Fingers crossed it’ll actually live up to the hype accumulated over all these years 🙏
26.08.2025 08:51 — 👍 1 🔁 0 💬 0 📌 0@lukaschaefer.bsky.social
www.lukaschaefer.com Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noah’s Ark Lab, Dematic; Young researcher HLF 2022
Fingers crossed it’ll actually live up to the hype accumulated over all these years 🙏
26.08.2025 08:51 — 👍 1 🔁 0 💬 0 📌 0Thanks for sharing, hadn’t seen this before and definitely plan to catch up!
26.08.2025 08:49 — 👍 0 🔁 0 💬 1 📌 0🇨🇦 Heading to @rl-conference.bsky.social next week to present HyperMARL (@cocomarl-workshop.bsky.social) and Remember Markov (Finding The Frame Workshop).
If you are around, hmu, happy to chat about Multi-Agent Systems (MARL, agentic systems), open-endedness, environments, or anything related! 🎉
Will be at ICML and looking to hire a postdoc to help us scale up and deploy RL in self-driving. So, hit me up to chat.
10.07.2025 23:24 — 👍 23 🔁 6 💬 1 📌 1The Edinburgh RL Reading group is back with a fresh new website 👏
Anyone is welcome to attend!
Love these curated and shareable feeds on here. Such a good feature to give more control to the users and community to make the experience what they want it to be rather than leaving it in the control of the platform!
29.06.2025 10:25 — 👍 2 🔁 0 💬 0 📌 0Eugene is awesome! — if you are interested in autonomous driving and RL, and New York sounds like an exciting place for a postdoc then this is an amazing opportunity! 👇
23.06.2025 09:10 — 👍 1 🔁 0 💬 0 📌 0Hiring a postdoc to scale up and deploy RL-based planning onto some self-driving cars! We'll be building on arxiv.org/abs/2502.03349 and learn what the limits and challenges of RL planning are. Shoot me a message if interested and help spread the word please!
Full posting to come in a bit.
Thanks! @sharky6000.bsky.social
19.06.2025 15:15 — 👍 1 🔁 0 💬 0 📌 0We also updated and moved our code exercises which we had built for a summer school to the marl-book github page so they can be easily found at github.com/marl-book/ma...
Thanks @sacha2.bsky.social for the reminder on that! They had previously been hidden inside my GitHub account 😅
Our textbook “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” has sold out! Another print round with minor corrections is in production with @mitpress.bsky.social and coming 👏
An errata of these corrections and the updated book PDF can already be found at www.marl-book.com
Awesome to see the slides publicly available, wasn’t aware they are now!
And yes actually, the code exercises are here: github.com/LukasSchaefe...
Good point though, I wanted to migrate them to the book github project. I’ll have a stab at that later!
📜🤖 Can a shared multi-agent RL policy support both specialised & homogeneous team behaviours -- without changing the learning objective, requiring preset diversity levels or sequential updates? Our preprint “𝘏𝘺𝘱𝘦𝘳𝘔𝘈𝘙𝘓: 𝘈𝘥𝘢𝘱𝘵𝘪𝘷𝘦 𝘏𝘺𝘱𝘦𝘳𝘯𝘦𝘵𝘸𝘰𝘳𝘬𝘴 𝘧𝘰𝘳 𝘔𝘶𝘭𝘵𝘪-𝘈𝘨𝘦𝘯𝘵 𝘙𝘓” explores this!
27.05.2025 11:07 — 👍 11 🔁 2 💬 1 📌 2Thanks, will check it out!
23.05.2025 15:37 — 👍 1 🔁 0 💬 0 📌 0That sounds exciting! Are there any recordings or slides available to check this out? 👀
23.05.2025 15:14 — 👍 0 🔁 0 💬 1 📌 0Today, I’ll be presenting our work on exploration in MARL using ensembles here! 👇
Multiagent Learning Session
Where: Ambassador Ballroom 1 & 2
When: 14:00 - 14:13
I’ll also present the poster later at the Learn track of the poster session at 15:45 - 16:30
Thanks!
We wanted to have a game with more realistic visuals. We decided on CS:GO mainly since there already existed an open dataset to train on and compare results to prior methods that used the same dataset.
Also, we finished initial experiments actually on the same day as CS2 released 😅
Thanks for the encouragement Marc — I’ll look out for you!
18.05.2025 21:53 — 👍 1 🔁 0 💬 0 📌 0Thank!
18.05.2025 09:53 — 👍 1 🔁 0 💬 0 📌 0Thanks to all the co-authors and collaborators!
Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, David Bignell, Siddhartha Sen, Andrea Treviño Gavito, and first and foremost Sam Devlin
It's been an absolute joy working with this group of kind folks 👏
Paper: arxiv.org/abs/2312.02312
It took some time but I'm super excited that now our code is also open-source and available for everyone to use at: github.com/microsoft/im...
At the Adaptive and Learning Agents Workshop I'll be presenting our comprehensive study on the efficacy of different visual encoders for imitation learning in modern video games.
I'll be presenting the work as a short talk and poster at the ALA workshop on Monday!
This has been a long time coming, thanks a lot for my collaborators for all their help! Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V Albrecht, and David Mguni
It's actually going to be my first ever oral presentation, so excited (and nervous) about that 👀
At the main conference, I'll be presenting our work on using ensembles of value functions for multi-agent exploration!
I'll be presenting the oral at the Multi-agent Learning 1 session on Wednesday (2:00 - 3:45pm), and the poster after 3:45pm!
Paper: arxiv.org/abs/2302.03439
On my way to Detroit for @aamasconf.bsky.social! Looking forward to presenting the last work from my PhD at the main conference, and work from @msftresearch.bsky.social at the Adaptive and Learning Agents Workshop. More info 👇
If you'd like to chat, feel free to DM me!
Congrats Matthew, looks awesome! 👏
05.05.2025 14:22 — 👍 1 🔁 0 💬 0 📌 0I’ll be coming to Detroit! Would be awesome to have chance to meet and chat in person :)
04.05.2025 17:47 — 👍 2 🔁 0 💬 1 📌 0We are making available an experimental and interactive real-time gameplay experience in Copilot Labs, powered by our Muse family of world models. Learn more about the research underpinning this experience: www.microsoft.com/en-us/resear...
07.04.2025 19:07 — 👍 9 🔁 3 💬 0 📌 0Workshop website screenshot showing the title and important dates of the website.
Call for papers! 🎉 If you have felt confused about evaluation in explainable AI, this workshop is for you! We are excited to go to Bologna @ ECAI'25 as we try to build better methods for evaluating XAI. Can't wait to see what ideas the community will bring!
sites.google.com/view/excd-20...
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
12.03.2025 15:17 — 👍 19 🔁 13 💬 0 📌 0