Bryan Chan @chanpyb - Bluesky Profile

Bryan Chan

@chanpyb.bsky.social

PhD student at University of Alberta. Interested in reinforcement learning, imitation learning, machine learning theory, and robotics https://chanb.github.io/

1,095 Followers | 131 Following | 11 Posts | Joined: 10.11.2024 | 1.5816

Latest posts by chanpyb.bsky.social on Bluesky

If you were at the last RLC NeurIPS event, you know it's not to be missed. Invite any RL researcher you know, no invite-only parties here.

10.12.2024 21:56 — 👍 18 🔁 2 💬 0 📌 0

Robotics manipulation to be specific :)

02.12.2024 15:29 — 👍 3 🔁 0 💬 0 📌 0

Can someone point me to any paper that uses RL on real-life (image-based) environments without sim2real/imitation learning? For good reasons I am told that this is pretty common but I’ve only found a handful of papers (CQN, QT-Opt, SAC-X)

02.12.2024 15:26 — 👍 3 🔁 0 💬 1 📌 0

Hi Csaba :)

30.11.2024 04:30 — 👍 1 🔁 0 💬 0 📌 0

Streaming Deep Reinforcement Learning Finally Works Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning ...

Streaming Deep Reinforcement Learning Finally Works, by
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written 😅

This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!

arxiv.org/abs/2410.14606

27.11.2024 23:09 — 👍 92 🔁 20 💬 2 📌 0

I also think many robot learning papers are overclaiming what they can do… the paper task setup can be very easy in comparison to real production systems even for pick-n-place… but it’s hard to see this difference in the presented videos (if any)

27.11.2024 14:27 — 👍 3 🔁 0 💬 1 📌 0

I feel like many works do have experiments on sim but they don’t seem to transfer to real life (not in terms of sim2real but applying the same algo). I wonder how much of it comes from delays or us being overprotective of the robot in real life. Maybe evals need to include these components.

27.11.2024 14:23 — 👍 2 🔁 0 💬 2 📌 0

I thought that was just me 😅 was trying it on an uncluttered single item picking task

27.11.2024 13:23 — 👍 6 🔁 0 💬 1 📌 0

Perhaps it’s “necessary” to have it as a baseline (arguably TD3 is just as important imo but SAC seems to be more commonly used), and it’s hard to convince people which one is stronger than SAC. I think recently there are few e.g. ACE at Neurips. Generally feels like a popularity game to me

23.11.2024 17:29 — 👍 1 🔁 0 💬 0 📌 0

Let me try, we’ve been very quiet historically 😂

12.11.2024 14:58 — 👍 0 🔁 0 💬 0 📌 0

Can I get added please :)

12.11.2024 14:16 — 👍 0 🔁 0 💬 1 📌 0

Please see me :)

12.11.2024 14:15 — 👍 0 🔁 0 💬 0 📌 0

Please add me, I’ve been doing robot learning with RL/IL!

12.11.2024 14:10 — 👍 1 🔁 0 💬 1 📌 0

@chanpyb is following 20 prominent accounts

Anastasiia Pedan
@pedanana

Roland Zimmermann
@zimmerrol

Research Scientist @ Google DeepMind. Understanding machine learning models. Ex-physicist. Previously @ MPI-IS, University Tübingen & GoogleAI. rzimmermann.com

Gintare Karolina Dziugaite
@gkdziugaite

Sr Research Scientist at Google DeepMind, Toronto. Member, Mila. Adjunct, McGill CS. PhD Machine Learning & MASt Applied Math (Cambridge), BSc Math (Warwick). gkdz.org

Petar Veličković
@petar-v

Senior Staff Research Scientist, Google DeepMind Affiliated Lecturer, University of Cambridge Associate, Clare Hall GDL Scholar, ELLIS @ellis.eu 🇷🇸🇲🇪🇧🇦

Surya Bhupatiraju
@suryabhupa

large language modeler (gemini, long context, RL, gemma) @ google deepmind

Jack Parker-Holder
@jparkerholder

RS at Google DeepMind and Honorary Lecturer at UCL. Building general world models to solve AGI :)

Nenad Tomasev
@nenadtomasev

Developing AI responsibly. Senior Staff Research Scientist at Google DeepMind. Opinions are my own.

Matej Balog
@matejbalog

Research Scientist at Google DeepMind, working on algorithm discovery using AI: AlphaTensor, FunSearch, and beyond

Max Vladymyrov
@mxvl

Research Scientist @ Google DeepMind

Andrew Lampinen
@lampinen

Interested in cognition and artificial intelligence. Research Scientist at Google DeepMind. Previously cognitive science at Stanford. Posts are mine. lampinen.github.io

Vector Institute
@vectorinstitute.ai

The Vector Institute is dedicated to AI, excelling in machine & deep learning research. AI-generated content will be disclosed. FR: @institutvecteur.bsky.social

@pierrelucbacon

Michael Bowling
@michaelhbowling

Professor of Computing Science at the University of Alberta. Interested in AI, reinforcement learning, and games of all sorts.

Gary Marcus
@garymarcus

AI and cognitive science, Founder and CEO (Geometric Intelligence, acquired by Uber). 8 books including Guitar Zero, Rebooting AI and Taming Silicon Valley. Newsletter (50k subscribers): garymarcus.substack.com

Yoshua Bengio
@yoshuabengio

Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila. A.M. Turing Award Recipient and most-cited AI researcher. https://lawzero.org/en https://yoshuabengio.org/profile/

@szepi1991

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Ahmad Beirami
@abeirami

something new | Gemini RL+TTS @ Google DeepMind | Conversational AI @ Meta | RL Agents @ EA | ML+Information Theory @ MIT+Harvard+Duke | Georgia Tech PhD | زن زندگی آزادی 📍{NYC, YYZ} 🔗 https://beirami.github.io/

Arnaud Doucet
@arnauddoucet

Senior Staff Research Scientist @Google DeepMind, previously Stats Prof @Oxford Uni - interested in Computational Statistics, Generative Modeling, Monte Carlo methods, Optimal Transport.

Yuda Song
@yus167

PhD at Machine Learning Department, Carnegie Mellon University | Interactive Decision Making | https://yudasong.github.io