Harsh Mahesheka's Avatar

Harsh Mahesheka

@harshmahesheka.bsky.social

Tinkering with bots and brains πŸ› οΈπŸ€– | EE @ IIT BHU πŸŽ“ | Previous Robot Learning @ Uni Freiburg, ASU, IIIT-H | GSoC '22 @ Open Robotics | Teaching bots to fetch me a beer 🍺 Visit me at - https://harshmahesheka.github.io/

862 Followers  |  289 Following  |  18 Posts  |  Joined: 17.11.2024  |  2.029

Latest posts by harshmahesheka.bsky.social on Bluesky

Found a hack for bookmark feature, just repost itπŸ˜‚

25.11.2024 18:16 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Check out Pygame_spiel by @giogix2.bsky.social !

This is a Pygame-based library to play games from the OpenSpiel suite against AI agents. 🀩😍

github.com/giogix2/pyga...

25.11.2024 18:07 β€” πŸ‘ 25    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1

Migrating to Bluesky feels like upgrading your codebase to that new software in the market. It's painful, but you know you will have to do it at some point.

24.11.2024 12:24 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

You just witnessed the birth of Skynet

24.11.2024 08:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
e introduce the effective horizon, a property of
MDPs that controls how difficult RL is. Our analysis is mo-
tivated by Greedy Over Random Policy (GORP), a simple
Monte Carlo planning algorithm (left) that exhaustively ex-
plores action sequences of length k and then uses m random
rollouts to evaluate each leaf node. The effective horizon
combines both k and m into a single measure. We prove
sample complexity bounds based on the effective horizon that
correlate closely with the real performance of PPO, a deep
RL algorithm, on our BRIDGE dataset of 155 deterministic
MDPs (right).

e introduce the effective horizon, a property of MDPs that controls how difficult RL is. Our analysis is mo- tivated by Greedy Over Random Policy (GORP), a simple Monte Carlo planning algorithm (left) that exhaustively ex- plores action sequences of length k and then uses m random rollouts to evaluate each leaf node. The effective horizon combines both k and m into a single measure. We prove sample complexity bounds based on the effective horizon that correlate closely with the real performance of PPO, a deep RL algorithm, on our BRIDGE dataset of 155 deterministic MDPs (right).

Kind of a broken record here but proceedings.neurips.cc/paper_files/...
is totally fascinating in that it postulates two underlying, measurable structures that you can use to assess if RL will be easy or hard in an environment

23.11.2024 18:18 β€” πŸ‘ 152    πŸ” 28    πŸ’¬ 8    πŸ“Œ 2

πŸ‘‹

24.11.2024 06:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great list!! Could you add me?
I work on Multi-Agent Reinforcement Learning and it's applications in robotics. Would love to connect with community.

23.11.2024 12:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
mesa-examples/rl at main Β· projectmesa/mesa-examples User and showcase agent-based models developed using Mesa - projectmesa/mesa-examples

Love to get added!!

A few of my open source projects πŸ€–-

1. Mesa_RL- github.com/projectmesa/...
2. Household Bot - github.com/harshmaheshe...
3. Gazebo Pkg_Create - github.com/gazebosim/gz...

23.11.2024 12:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This summer, I collaborated with Project Mesa to create Mesa-RL framework that seamlessly integrates multi-agent reinforcement learning into Mesa's agent-based modeling environments. πŸ€–πŸŒ

Go try RL with your social agent models!

Link - github.com/projectmesa/...

#ReinforcementLearning #MultiAgent

23.11.2024 12:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great List! Would love to be added and connect with the community.

23.11.2024 07:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I work in robot learning. Would love to be added in the list.

21.11.2024 22:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

HelloπŸ‘‹, Would love to be added.

21.11.2024 22:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hi, Thanks for the list.
Would love to be added to this list. I work on robot learning and its applications.

21.11.2024 22:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Great List!!

Would love to be added @chrispaxton.bsky.social

21.11.2024 21:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

By chaining a VLM and LLM in a bi-level framework, we use the β€œchain rule” to guide reward search directly from video demos.

This work was done under guidance of Prof. Wanxin Jin and Prof. Zhorang Wang at Intelligent Robotics and Interactive Systems (IRIS) Lab, Arizona State University.

21.11.2024 19:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Can robots learn skills from YouTube without complex video processing? Our LLM-driven bi-level programming shows it’s possible! Check out our RL agents learning skills from their biological counterparts!πŸ’‘

Preprint: arxiv.org/abs/2410.09286

#ReinforcementLearning #LLM #Robotics #AI

21.11.2024 19:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This was work with my great colleagues Harsh Mahesheka, Jan Ole von Hartz, Tim Welschehold and Abhinav Valada at the Technische FakultΓ€t der UniversitΓ€t Freiburg, University of Freiburg.

21.11.2024 19:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

By modularizing commands, the operator can fully focus on the task relevant end-effector motions. This even enables kinesthetic teaching of mobile manipulators in cluttered environments. We show that this allows us to rapidly learn whole-body mobile manipulation skills with less than ten minutes.

21.11.2024 19:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Can we operate mobile manipulators without expensive exoskeletons or tracking setups? πŸ“œ

In our latest work, Zero-Cost Whole-Body Teleoperation for Mobile Manipulation, we do exactly this.

Website and full videos: moma-teleop.cs.uni-freiburg.de
Arxiv: arxiv.org/abs/2409.150...

#AI #Robotics

21.11.2024 19:03 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I’m an Electrical Engineering student at IIT Varanasi, on a mission to teach robots human-like skills. Mimicking the most complex machineβ€”usβ€”is no small feat, but reinforcement learning makes it exciting (and intuitive!). πŸš€πŸ€–
#AI #ReinforcementLearning #Robotics

21.11.2024 18:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A starter pack of starter packs:

Robotics and AI go.bsky.app/DfAoaJ1
Computer Vision go.bsky.app/PkAKJu5
Computer Graphics Research go.bsky.app/ckQ1u9
Grumpy Machine Learners go.bsky.app/6ddpivr
Reinforcement Learning go.bsky.app/3WPHcHg

19.11.2024 04:36 β€” πŸ‘ 96    πŸ” 29    πŸ’¬ 8    πŸ“Œ 3

@harshmahesheka is following 18 prominent accounts