If you were at the last RLC NeurIPS event, you know it's not to be missed. Invite any RL researcher you know, no invite-only parties here.
10.12.2024 21:56 β π 18 π 2 π¬ 0 π 0@chanpyb.bsky.social
PhD student at University of Alberta. Interested in reinforcement learning, imitation learning, machine learning theory, and robotics https://chanb.github.io/
If you were at the last RLC NeurIPS event, you know it's not to be missed. Invite any RL researcher you know, no invite-only parties here.
10.12.2024 21:56 β π 18 π 2 π¬ 0 π 0Robotics manipulation to be specific :)
02.12.2024 15:29 β π 3 π 0 π¬ 0 π 0Can someone point me to any paper that uses RL on real-life (image-based) environments without sim2real/imitation learning? For good reasons I am told that this is pretty common but Iβve only found a handful of papers (CQN, QT-Opt, SAC-X)
02.12.2024 15:26 β π 3 π 0 π¬ 1 π 0Hi Csaba :)
30.11.2024 04:30 β π 1 π 0 π¬ 0 π 0Streaming Deep Reinforcement Learning Finally Works, by
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written π
This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!
arxiv.org/abs/2410.14606
I also think many robot learning papers are overclaiming what they can doβ¦ the paper task setup can be very easy in comparison to real production systems even for pick-n-placeβ¦ but itβs hard to see this difference in the presented videos (if any)
27.11.2024 14:27 β π 3 π 0 π¬ 1 π 0I feel like many works do have experiments on sim but they donβt seem to transfer to real life (not in terms of sim2real but applying the same algo). I wonder how much of it comes from delays or us being overprotective of the robot in real life. Maybe evals need to include these components.
27.11.2024 14:23 β π 2 π 0 π¬ 2 π 0I thought that was just me π was trying it on an uncluttered single item picking task
27.11.2024 13:23 β π 6 π 0 π¬ 1 π 0Perhaps itβs βnecessaryβ to have it as a baseline (arguably TD3 is just as important imo but SAC seems to be more commonly used), and itβs hard to convince people which one is stronger than SAC. I think recently there are few e.g. ACE at Neurips. Generally feels like a popularity game to me
23.11.2024 17:29 β π 1 π 0 π¬ 0 π 0Let me try, weβve been very quiet historically π
12.11.2024 14:58 β π 0 π 0 π¬ 0 π 0Can I get added please :)
12.11.2024 14:16 β π 0 π 0 π¬ 1 π 0Please see me :)
12.11.2024 14:15 β π 0 π 0 π¬ 0 π 0Please add me, Iβve been doing robot learning with RL/IL!
12.11.2024 14:10 β π 1 π 0 π¬ 1 π 0