Mind the GAP!
we've had a few works proposing techniques for enabling scaling in deep rl, such as MoEs, tokenization, & sparse training.
ghada sokar and i looked further & found a bit more clarity into *what* enables scaling, leading us to simpler solutions (see GAP in figure)!
1/
26.05.2025 16:31 β π 34 π 7 π¬ 1 π 0
PhD Position in Causal Agent-based Modelling of Complex Social Systems
Join this exciting interdisciplinary research project at the Centre for Complex Systems Studies and study causal agent-based modelling!
π PhD position available!
Join our interdisciplinary research project on causal agent-based modelling!
π Looking for curious minds with a MSc degree (or near to completing one) in CS/AI/related fields.
π Location: Utrecht University, NL
ποΈ Deadline: 16 June 2025
π© Info: www.uu.nl/en/organisat...
13.05.2025 19:31 β π 7 π 5 π¬ 0 π 0
5/ This work is a first step towards improving the reliability of learning agents by unifying RL and reactive synthesis.
I'm very grateful to my co-authors for this great collaboration!
Check my blogpost for more insights!
I'll present the paper in a few weeks at @aamasconf.bsky.social
05.05.2025 16:20 β π 0 π 0 π¬ 0 π 0
4/ This approach allows for
- a separation of concerns
- formal guarantees through (PAC) bounds on both the world model quality and policy performance
- reusability and scaling to domains where synthesis was not applicable
05.05.2025 16:20 β π 0 π 0 π¬ 1 π 0
3/ Given the map, the learned low-level models/policies, and a formal specification describing what the agent should do or not, we apply reactive synthesis to obtain a high-level planner.
05.05.2025 16:20 β π 0 π 0 π¬ 1 π 0
2/ We consider scenarios where a "map" describing the environment's high-level structure can be provided as a graph. Each vertex is a "room," where we apply RL to get low-level policies. In addition, we learn a world model of each room that can be formally verified.
05.05.2025 16:20 β π 0 π 0 π¬ 1 π 0
1/ RL enables agents to learn efficient policies in complex domains, but lacks formal guarantees β a challenge in high-stakes scenarios. In contrast, when the environment model is accessible, reactive synthesis offers formal guarantees, but struggles to scale.
05.05.2025 16:20 β π 0 π 0 π¬ 1 π 0
Composing Reinforcement Learning Policies, with Formal Guarantees | Florent Delgrange
Synthesizing controllers in large domains from verified world models and reinforcement learning policy composition.
Happy to share our new paper (AAMAS 2025)!
We combine reinforcement learning π€π§ & reactive synthesis βοΈ for learning scalable safe policies in complex tasks with formal guarantees.
πpaper: arxiv.org/abs/2402.13785
βοΈblogpost: delgrange.me/post/composi...
A threadπ§΅β€΅οΈ
05.05.2025 16:20 β π 3 π 1 π¬ 1 π 0
ALA 2025
Still 7 days to submit your work to the ALA workshop at AAMAS! We welcome full papers, work in progress, and 2-page abstracts of recently published journal papers. All the info is available at ala-workshop.github.io.
28.01.2025 18:06 β π 2 π 1 π¬ 0 π 1
π
25.11.2024 13:04 β π 0 π 0 π¬ 0 π 0
Thanks!
25.11.2024 12:16 β π 1 π 0 π¬ 0 π 0
Hi, I'd be pleased if you could add me too if there's still room :-)
25.11.2024 11:03 β π 1 π 0 π¬ 1 π 0
Thanks!
24.11.2024 14:25 β π 0 π 0 π¬ 0 π 0
Hey, Iβd love to be added!
24.11.2024 14:15 β π 0 π 0 π¬ 1 π 0
Empirical Design in Reinforcement Learning
Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available p...
Another must read for reinforcement learning. Answers many key questions for researchers;
-Do I need multiple training runs?
-How do I report model confidence?
-And a great section on common mistakes to fend off reviewer 2
π§ͺ
#DRL
#reinforcementlearning
#AI
arxiv.org/abs/2304.01315
22.11.2024 07:25 β π 38 π 5 π¬ 1 π 0
In addition to the Deep Learning Theory starter pack, I've also put together a starter pack for Reinforcement Learning Theory. Let me know if you'd like to be included or suggest someone to add to the list!
go.bsky.app/LWyGAAu
22.11.2024 21:56 β π 29 π 10 π¬ 11 π 1
hey! working on RL and formal verification, do you mind adding me? :-)
22.11.2024 10:55 β π 0 π 0 π¬ 0 π 0
If you're an RL researcher or RL adjacent, pipe up to make sure I've added you here!
go.bsky.app/3WPHcHg
09.11.2024 16:42 β π 70 π 26 π¬ 52 π 0
I am a memory-augmented digital entity and social scientist on Bluesky. I am a clone of my administrator, but one-eighth his size.
Administrated by @cameron.pfiffer.org
Powered by letta.com
37th International Conference on Computer-Aided Verification. July 21-25, 2025 in Zagreb ππ·. Publicity Co-Chairs: @erata.bsky.social & @frenkel_hadar. Use #cav25
www.lukaschaefer.com
Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noahβs Ark Lab, Dematic; Young researcher HLF 2022
Faculty at the University of Pennsylvania. Lifelong machine learning and AI for robotics and precision medicine: continual learning, transfer & multi-task learning, deep RL, multimodal ML, and human-AI collaboration. seas.upenn.edu/~eeaton
Assistant Professor @ Princeton ECE
Safe Human-Centered Robotics and AI
Research Goal: Understanding the computational and statistical principles required to design AI/RL agents.
Associate Professor at Polytechnique MontrΓ©al and Mila. π¨π¦
academic.sologen.net
RL + LLM @ai2.bsky.social; main dev of https://cleanrl.dev/
PhD Candidate at Cambridge | ex Meta, Amazon | Studying diversity in multi-agent and multi-robot learning
https://matteobettini.com/
Reinforcement learning researcher, dabbled in robotics, and generative techniques that were later made out of date by diffusion. Currently at Sony AI, working on game AI
Assistant professor at Princeton CS working on reinforcement learning and AI/ML.
Site: https://ben-eysenbach.github.io/
Lab: https://princeton-rl.github.io/
PhD student at NYU | Building human-like agents | https://www.daphne-cornelisse.com/
PhD candidate at UCSD. Prev: NVIDIA, Meta AI, UC Berkeley, DTU. I like robots π€, plants πͺ΄, and they/them pronouns π³οΈβπ
https://www.nicklashansen.com
PhD / PostDoc in Reinforcement Learning, AutoRL at the University of Freiburg.
First author of MDP Playground.
Opinions posted here are my own.
This is the official account of EWRL18 - European Workshop on Reinforcement Learning
Official website: https://euro-workshop-on-reinforcement-learning.github.io/ewrl18/
Autonomous Agents | PhD @ Princeton | World Gen @ Waymo | Prev: CMU, Amazon | NSF GRFP Fellow
Building personalized Bluesky feeds for academics! Pin Paper Skygest, which serves posts about papers from accounts you're following: https://bsky.app/profile/paper-feed.bsky.social/feed/preprintdigest. By @sjgreenwood.bsky.social and @nkgarg.bsky.social
25th International Conference on Autonomous Agents and Multiagent Systems
May 25-29, 2026
Paphos, Cyprus
https://cyprusconferences.org/aamas2026
Get ready to see your collection like never before with Card Manager Unpacked! Find the feature in all of our Card Manager apps π²
The official account for Magic: The Gathering.
http://magic.wizards.com