🇨🇦 Heading to @rl-conference.bsky.social next week to present HyperMARL (@cocomarl-workshop.bsky.social) and Remember Markov (Finding The Frame Workshop).
If you are around, hmu, happy to chat about Multi-Agent Systems (MARL, agentic systems), open-endedness, environments, or anything related! 🎉
03.08.2025 10:41 — 👍 9 🔁 2 💬 0 📌 2
Will be at ICML and looking to hire a postdoc to help us scale up and deploy RL in self-driving. So, hit me up to chat.
10.07.2025 23:24 — 👍 23 🔁 6 💬 1 📌 1
The Edinburgh RL Reading group is back with a fresh new website 👏
Anyone is welcome to attend!
10.07.2025 11:09 — 👍 4 🔁 0 💬 0 📌 0
Love these curated and shareable feeds on here. Such a good feature to give more control to the users and community to make the experience what they want it to be rather than leaving it in the control of the platform!
29.06.2025 10:25 — 👍 2 🔁 0 💬 0 📌 0
Eugene is awesome! — if you are interested in autonomous driving and RL, and New York sounds like an exciting place for a postdoc then this is an amazing opportunity! 👇
23.06.2025 09:10 — 👍 1 🔁 0 💬 0 📌 0
Robust Autonomy Emerges from Self-Play
Self-play has powered breakthroughs in two-player and multi-player games. Here we show that self-play is a surprisingly effective strategy in another domain. We show that robust and naturalistic drivi...
Hiring a postdoc to scale up and deploy RL-based planning onto some self-driving cars! We'll be building on arxiv.org/abs/2502.03349 and learn what the limits and challenges of RL planning are. Shoot me a message if interested and help spread the word please!
Full posting to come in a bit.
21.06.2025 17:14 — 👍 60 🔁 24 💬 4 📌 1
Thanks! @sharky6000.bsky.social
19.06.2025 15:15 — 👍 1 🔁 0 💬 0 📌 0
GitHub - marl-book/marl-book-exercises: Code exercises for the MARL Textbook
Code exercises for the MARL Textbook. Contribute to marl-book/marl-book-exercises development by creating an account on GitHub.
We also updated and moved our code exercises which we had built for a summer school to the marl-book github page so they can be easily found at github.com/marl-book/ma...
Thanks @sacha2.bsky.social for the reminder on that! They had previously been hidden inside my GitHub account 😅
19.06.2025 08:29 — 👍 2 🔁 0 💬 0 📌 0
Our textbook “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” has sold out! Another print round with minor corrections is in production with @mitpress.bsky.social and coming 👏
An errata of these corrections and the updated book PDF can already be found at www.marl-book.com
19.06.2025 08:26 — 👍 32 🔁 6 💬 2 📌 0
GitHub - LukasSchaefer/marl-book-exercises: Code exercises for the MARL Textbook
Code exercises for the MARL Textbook. Contribute to LukasSchaefer/marl-book-exercises development by creating an account on GitHub.
Awesome to see the slides publicly available, wasn’t aware they are now!
And yes actually, the code exercises are here: github.com/LukasSchaefe...
Good point though, I wanted to migrate them to the book github project. I’ll have a stab at that later!
04.06.2025 21:38 — 👍 3 🔁 0 💬 0 📌 1
📜🤖 Can a shared multi-agent RL policy support both specialised & homogeneous team behaviours -- without changing the learning objective, requiring preset diversity levels or sequential updates? Our preprint “𝘏𝘺𝘱𝘦𝘳𝘔𝘈𝘙𝘓: 𝘈𝘥𝘢𝘱𝘵𝘪𝘷𝘦 𝘏𝘺𝘱𝘦𝘳𝘯𝘦𝘵𝘸𝘰𝘳𝘬𝘴 𝘧𝘰𝘳 𝘔𝘶𝘭𝘵𝘪-𝘈𝘨𝘦𝘯𝘵 𝘙𝘓” explores this!
27.05.2025 11:07 — 👍 10 🔁 2 💬 1 📌 2
Thanks, will check it out!
23.05.2025 15:37 — 👍 1 🔁 0 💬 0 📌 0
That sounds exciting! Are there any recordings or slides available to check this out? 👀
23.05.2025 15:14 — 👍 0 🔁 0 💬 1 📌 0
Today, I’ll be presenting our work on exploration in MARL using ensembles here! 👇
Multiagent Learning Session
Where: Ambassador Ballroom 1 & 2
When: 14:00 - 14:13
I’ll also present the poster later at the Learn track of the poster session at 15:45 - 16:30
21.05.2025 15:18 — 👍 2 🔁 0 💬 0 📌 0
Thanks!
We wanted to have a game with more realistic visuals. We decided on CS:GO mainly since there already existed an open dataset to train on and compare results to prior methods that used the same dataset.
Also, we finished initial experiments actually on the same day as CS2 released 😅
21.05.2025 12:06 — 👍 1 🔁 0 💬 1 📌 0
Thanks for the encouragement Marc — I’ll look out for you!
18.05.2025 21:53 — 👍 1 🔁 0 💬 0 📌 0
Thank!
18.05.2025 09:53 — 👍 1 🔁 0 💬 0 📌 0
Thanks to all the co-authors and collaborators!
Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, David Bignell, Siddhartha Sen, Andrea Treviño Gavito, and first and foremost Sam Devlin
It's been an absolute joy working with this group of kind folks 👏
18.05.2025 09:40 — 👍 2 🔁 0 💬 0 📌 0
At the Adaptive and Learning Agents Workshop I'll be presenting our comprehensive study on the efficacy of different visual encoders for imitation learning in modern video games.
I'll be presenting the work as a short talk and poster at the ALA workshop on Monday!
18.05.2025 09:40 — 👍 2 🔁 1 💬 1 📌 0
This has been a long time coming, thanks a lot for my collaborators for all their help! Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V Albrecht, and David Mguni
It's actually going to be my first ever oral presentation, so excited (and nervous) about that 👀
18.05.2025 09:40 — 👍 2 🔁 0 💬 2 📌 0
At the main conference, I'll be presenting our work on using ensembles of value functions for multi-agent exploration!
I'll be presenting the oral at the Multi-agent Learning 1 session on Wednesday (2:00 - 3:45pm), and the poster after 3:45pm!
Paper: arxiv.org/abs/2302.03439
18.05.2025 09:40 — 👍 3 🔁 1 💬 1 📌 1
On my way to Detroit for @aamasconf.bsky.social! Looking forward to presenting the last work from my PhD at the main conference, and work from @msftresearch.bsky.social at the Adaptive and Learning Agents Workshop. More info 👇
If you'd like to chat, feel free to DM me!
18.05.2025 09:40 — 👍 8 🔁 2 💬 2 📌 0
Congrats Matthew, looks awesome! 👏
05.05.2025 14:22 — 👍 1 🔁 0 💬 0 📌 0
I’ll be coming to Detroit! Would be awesome to have chance to meet and chat in person :)
04.05.2025 17:47 — 👍 2 🔁 0 💬 1 📌 0
Workshop website screenshot showing the title and important dates of the website.
Call for papers! 🎉 If you have felt confused about evaluation in explainable AI, this workshop is for you! We are excited to go to Bologna @ ECAI'25 as we try to build better methods for evaluating XAI. Can't wait to see what ideas the community will bring!
sites.google.com/view/excd-20...
14.03.2025 15:41 — 👍 6 🔁 3 💬 1 📌 0
Job Details
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
12.03.2025 15:17 — 👍 19 🔁 13 💬 0 📌 0
Why we're training AI on video games
AI is already a powerful tool for collaboration — but this is just the tip of the iceberg, says AI researcher Katja Hofmann. She describes her team's work training AI on years of human gameplay data f...
What happens when we train generative AI models to predict everything that happens in a modern video game? In my TED AI talk I share insights from how AI can learn the physics of a game, and how this research could empower game creatives: www.ted.com/talks/katja_...
07.03.2025 18:00 — 👍 6 🔁 1 💬 2 📌 0
Wow! Awesome to see this recognition 👏
05.03.2025 10:48 — 👍 2 🔁 0 💬 0 📌 0
EurIPS is a community-organized, NeurIPS-endorsed conference in Copenhagen where you can present papers accepted at @neuripsconf.bsky.social
eurips.cc
RL & Agents Reading Group @ University of Edinburgh
We regularly discuss recent papers in RL, MARL & related
https://edinburgh-rl.github.io/reading-group
PhD Student doing RL in POMDP at the University of Liège - Intern at McGill - gsprd.be
PhD student at @cmurobotics.bsky.social working on interactive algorithms for agentic alignment (e.g. imitation/RLHF). no model is an island. https://gokul.dev/.
Global Health. Energy Innovation. Alzheimer’s. https://gatesnot.es/tgn
Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence.
Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Associate Professor - University of Alberta
Canada CIFAR AI Chair with Amii
Machine Learning and Program Synthesis
he/him; ele/dele 🇨🇦 🇧🇷
https://www.cs.ualberta.ca/~santanad
Research Scientist at Google DeepMind | Building Gemini
guabhinav.com
postdoc @ ai lab, Vrije Universiteit Brussel
working on providing reliable and verifiable ai mechanisms
#RL & formal methods
delgrange.me
Postdoc at @kthuniversity.bsky.social.
Past: @gaipslab.bsky.social; Sony AI.
Interested in all things RL and Multimodal.
miguelvasco.com
phd student building computational models of social cognition @ edinburgh | prev imperial, ucl, inria
https://maxtaylordavi.es
Year Progress Bot. Maintained by @haider.bsky.social
Buy me a Coffee: https://buymeacoffee.com/halipunjabi
AITHYRA is a new dynamic research institute for biomedical AI in Vienna. AITHYRA seeks to build Europe’s premier institute for AI-driven biological and medical research, uniting computer scientists, engineers, and biologists in a collaborative environment.
A world-class research hub in AI and machine learning, in partnership with universities, RDI organizations and businesses in Finland. We are the 2nd institute in the @ellis.eu network.
🔗 ellisinstitute.fi
GM Gaming AI, Innovation & Science. Robot school dropout.
I work at Sakana AI 🐟🐠🐡 → @sakanaai.bsky.social
https://sakana.ai/careers
Researching planning, reasoning, and RL in LLMs. Previously: Google DeepMind, UC Berkeley, MIT. I post about: AI 🤖, flowers 🌷, parenting 👶, public transit 🚆. She/her.
http://www.jesshamrick.com
Building personalized Bluesky feeds for academics! Pin Paper Skygest, which serves posts about papers from accounts you're following: https://bsky.app/profile/paper-feed.bsky.social/feed/preprintdigest. By @sjgreenwood.bsky.social and @nkgarg.bsky.social
Research director @Inria, Head of @flowersInria
lab, prev. @MSFTResearch @SonyCSLParis
Artificial intelligence, cognitive sciences, sciences of curiosity, language, self-organization, autotelic agents, education, AI and society
http://www.pyoudeyer.com
Professor, Santa Fe Institute. Research on AI, cognitive science, and complex systems.
Website: https://melaniemitchell.me
Substack: https://aiguide.substack.com/