Robotiq directly integrating touch, robustly and reliably (hopefully π), is a big deal. looking forward to playing with these.
Now reannotating old teleop data might be unfeasible, but RL... π
www.linkedin.com/posts/vision...
@mwulfmeier.bsky.social
Large-Scale Robot Decision Making @GoogleDeepMind European @ELLISforEurope - imitation interaction transfer - priors: @oxfordrobots @berkeley_ai @ETH @MIT
Robotiq directly integrating touch, robustly and reliably (hopefully π), is a big deal. looking forward to playing with these.
Now reannotating old teleop data might be unfeasible, but RL... π
www.linkedin.com/posts/vision...
Nice perspective from Lu Li, Tianwei Ni, Yihao Sun, and Pierre-Luc Bacon on different conditions for offline-to-online learning.
lnkd.in/dUKc4jyV
We have been heavily relying on replay across experiments (lnkd.in/duVpCUru). Few experiments get started these days without initial datasets available.
Distributional RL has been the foundation of our recent projects (e.g. lnkd.in/eYiCp-y8).
Nice to see a distributional perspective to improvement for LLMs. Amin Rakhsha,Kanika Madan,Tianyu Zhang, Amir Farahmand, and Amir Khasahmadi via mode estimation rather than best of N for LLM: lnkd.in/d4NpyegJ
Curriculum matters more for active data generation than for fixed datasets. Data defines what you can learn and RL curriculum enables finding different datasets.
Great to see our paper mentioned in @dwarkesh_spβs blog post on RL efficiency.
dwarkesh.com/p/bits-per-samβ¦
arxiv.org/pdf/1707.05300
Didn't spend much time here and just had a look back: since when is most posting on bsky so aggressive and vile? And why? How can this get worse than x?
08.01.2026 19:34 β π 2 π 0 π¬ 0 π 0Thrilled with the possibilities of the new @BostonDynamics design for AI research.
07.01.2026 18:33 β π 2 π 0 π¬ 0 π 0While the human form is brilliant, itβs ultimately limited.
The real future of robotics lies in the permission to move beyond our own biological constraints and optimize.
Note, we didn't achieve supersonic flight by mimicking the flapping of wings.
Our own work took this to the extreme by fully replacing the SFT stage with (inverse) RL lnkd.in/d_VeJ9FG
Itβs still an open question where on this spectrum we will eventually converge, but my estimate is weβll land much closer to full RL across the entire pipeline.
Weβre moving toward a unified paradigm where supervised learning adopts RL ideas for a better conditioned pipeline. Thrilled to see this space growing e.g. via PPO style clipping (Proximal SFT - Zhu et al), direct KL reg (Anchored SFT - Li et al). importance sampling (Direct Fine-Tuning - Wu et al).
07.01.2026 14:46 β π 2 π 0 π¬ 1 π 0Taking parental leave for some deeply needed reading and it's fascinating to see the walls between SFT and RL crumbling.
π§΅
Featuring: Nihar B. Shah, Nitya Thakkar, Yutaro Yamada, Joelle Pineau, and a panel including Chris Bregler, Tom Dietterich, Andrew McCallum, Nathan Srebro, Markus Wulfmeier, & James Zou
03.12.2025 16:14 β π 1 π 0 π¬ 0 π 0Join our NeurIPS social event:Β The Role of AI in Scientific Peer Review.
lnkd.in/dgdBiBqM
Help build community and explore solutions for a fair, efficient, and transparent peer review system.
Β Β Β Β Β Wed. Dec. 3rd, 7:00 PM β 9:00 PM (Upper Level Ballroom 6CDEF)
Join us to find out what's still missing share.google/AJZyTvg4HUPNZAβ¦
(5+ role types for #Gemini #robotics @GoogleDeepMind)
Progress on robotics and AI has never been more closely linked. Access to leading frontier models has the potential to define what's possible in the physical world.
I'll be at #NeurIPS2025 for a couple of days next week. Find me if you want to know more.
x.com/sundarpichai...
Hindsight is 2025
28.11.2025 18:57 β π 1 π 0 π¬ 0 π 0I'll be at #NeurIPS2025 next week!
Looking forward to catching up with old and new friends - on AI for robots, Gemini Robotics, the RLnaissance in AI, and bad puns.
Bon appetit!
Expensive to run π
04.11.2025 19:17 β π 3 π 0 π¬ 2 π 0Congrats Nathan!
21.10.2025 20:06 β π 1 π 0 π¬ 0 π 0Truly enjoyed discussing the consolidation of specialist and generalist approaches to physical AI at #IROS2025.
Hoping to visit Hangzhou in physical rather than digital form myself in the not too distant future - second IROS AC dinner missed in a row.
#Robotics #physicalAI
The explosion of AI capability and complexity demands better understanding.
I firmly believe in studying large model behavior - from the perspective of artificial sequential decision making (like Inverse RL) to now linking it with human decision-making and cognitive models.
bit.ly/3Wqrtxl
'Papa, was ist das?'
As a parent, Gemini has made my life massively easier. We found this caterpillar earlier and the answer is in fact correct (verification is much easier than generation thanks to classical Google search)
Very curious about how LLMs are continuing to change the ways we learn!
The position paper track at #NeurIPS2025 was a great idea, the acceptance rate of under 6% not so much!
This is unnecessarily low and will reduce interest in any future iterations of the track.
(Disclaimer: our team is part of the 94%)
@NeurIPSConf
If you want one brain for any robot, cross-embodiment learning is the key. Check out the new model and tech report for sparks of it share.google/H5RBiwtCWnW7...
And catch up with the team (unfortunately without me this year) at #CORL2025!
More here:
x.com/sundarpichai...
Control problems are everywhere, but some applications of Reinforcement Learning are truly out of this world! π
Our team's latest research @ Google DeepMind in #Science shows RL can improve sensitivity by 30-100x.
See how RL can accelerate cosmic discovery!π (Image #nanobanana)
lnkd.in/e8r7t3NJ
Other companies folded or dropped their divisions.
They've given up too early. Congratulations, not easy to accept you have been wrong and lucky I had no bets. Lots of learnings. Now to scaling the rest of robotics!
#Robotics is hard, and so is #AutonomousDriving! Massive congratulations to my friends at #Waymo for proving me wrong!
x.com/Waymo/status...
There was a time a couple years back when I was getting skeptical about the scale of both technological and societal challenges for scaling autonomy....
Massive props to everyone organizing the #RSS2025 demo. I'm having a massive amount of fomo here.
27.06.2025 17:58 β π 0 π 0 π¬ 0 π 0Large language and vision models alone don't solve the whole #robotics problem.
But they surely have a massive impact on generalisation and robustness!
New scenes, new backgrounds, new objects, new people, new language and audio....
x.com/ayzwah/statu...
Don't have a robot? Try our newest Gemini Robotics on-Device VLA in simulation!
Or become a trusted tester and tune and adapt the model yourself!
www.youtube.com/watch?v=nVMY...
Thrilled to share something new: Fast, robust, and increasingly intelligent π§ π€
Truly proud of our team and hoping you'll enjoy the model!
x.com/GoogleDeepMi...