Check out the paper and feel free to reach out! Incredibly grateful to my collaborators Lokesh and Nikhil at the Dynamics Robotics and Control Laboratory (DRCL), USC. Special thanks to Prof. Quan Nguyen for his valuable inputs throughout the project! (6/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Policies may get stuck in local optima by choosing an undesirable sequence of modes. To mitigate this, we introduce a task-agnostic mode-switching preference reward using mode ranks specified by the user. (5/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
We propose an oracle-guided policy optimization framework leveraging a hybrid automata perspective to design multi-mode oracles. This abstraction results in structured exploration using a single oracle across different tasks and robots. (4/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Dynamic loco-manipulation calls for whole-body and contact-rich control: a hard exploration problem for deep RL due to susceptibility to local optima. Current approaches rely on:
- Task-specific reward shaping
- Multiple low-level/skill policies (3/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Introducing an oracle-guided policy optimization framework for synthesizing RL policies to tackle dynamic loco-manipulation
โ
Single multi-mode policy per task
โ
One oracle, same reward weights & hyperparameters across different robots & tasks
(2/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Excited to share our recent work:
Dynamic Bipedal Loco-manipulation using Oracle Guided Multi-mode Policies with Mode-transition Preference
Website: indweller.github.io/ogmplm/
Preprint: arxiv.org/abs/2410.01030
Video: youtu.be/gfDaRqobheg?...
๐งต(1/n)
26.11.2024 00:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Assistant professor at Princeton CS working on reinforcement learning and AI/ML.
Site: https://ben-eysenbach.github.io/
Lab: https://princeton-rl.github.io/
Adiabatic and Ecstatic
QEC @ qc.design | Formerly TUM Phyics and latticesurgery.com
A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef
Writes http://interconnects.ai
At Ai2 via HuggingFace, Berkeley, and normal places
Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning.
Lover of Linux ๐ง, coffee โ, and retro gaming. Big fan of open-source. #gohabsgo ๐จ๐ฆ
For more info: https://linktr.ee/sharky6000
Principal Researcher @ Microsoft Research.
AI, RL, cog neuro, philosophy.
www.momen-nejad.org
Interested in cognition and artificial intelligence. Research Scientist at Google DeepMind. Previously cognitive science at Stanford. Posts are mine.
lampinen.github.io
Staff research scientist at Google DeepMind. AI and neuro.
Former physicist, current human.
Find more at www.janexwang.com
AI, RL, NLP, Games Asst Prof at UCSD
Research Scientist at Nvidia
Lab: http://pearls.ucsd.edu
Personal: prithvirajva.com
professor at university of washington and founder at csm.ai. computational cognitive scientist. working on social and artificial intelligence and alignment.
http://faculty.washington.edu/maxkw/
AI and Games Researcher at NYU. Head of AI at Nof1.
Professor, Department of Psychology and Center for Brain Science, Harvard University
https://gershmanlab.com/
web: http://maxim.ece.illinois.edu
substack: https://realizable.substack.com
Researcher in robotics and machine learning (Reinforcement Learning). Maintainer of Stable-Baselines (SB3).
https://araffin.github.io/
Research Engineer at Google DeepMind
A special snowflake existing in 196883 dimensions
#ActuallyAutistic He/Him
AGI research @DeepMind.
Ex cofounder & CTO Vicarious AI (acqd by Alphabet),
Cofounder Numenta
Triply EE (BTech IIT-Mumbai, MS&PhD Stanford). #AGIComics
blog.dileeplearning.com
Researching Artificial General Intelligence Safety, via thinking about neuroscience and algorithms, at Astera Institute. https://sjbyrnes.com/agi.html
comp neuro, neural manifolds, neuroAI, physics of learning
assistant professor @ harvard (physics, center for brain science, kempner institute) + @ Flatiron Institute
https://www.sychung.org
Safe and robust AI/ML, computational sustainability. Former President AAAI and IMLS. Distinguished Professor Emeritus, Oregon State University. https://web.engr.oregonstate.edu/~tgd/
assistant prof at USC Data Sciences and Operations and Computer Science; phd Cornell ORIE.
data-driven decision-making, operations research/management, causal inference, algorithmic fairness/equity
bureaucratic justice warrior
angelamzhou.github.io
Full of childlike wonder. Building friendly robots. UT Austin PhD student, MIT โ20.