Lukas Schäfer's Avatar

Lukas Schäfer

@lukaschaefer.bsky.social

www.lukaschaefer.com Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noah’s Ark Lab, Dematic; Young researcher HLF 2022

268 Followers  |  162 Following  |  48 Posts  |  Joined: 26.11.2024  |  2.1508

Latest posts by lukaschaefer.bsky.social on Bluesky

🇨🇦 Heading to @rl-conference.bsky.social next week to present HyperMARL (@cocomarl-workshop.bsky.social) and Remember Markov (Finding The Frame Workshop).

If you are around, hmu, happy to chat about Multi-Agent Systems (MARL, agentic systems), open-endedness, environments, or anything related! 🎉

03.08.2025 10:41 — 👍 9    🔁 2    💬 0    📌 2

Will be at ICML and looking to hire a postdoc to help us scale up and deploy RL in self-driving. So, hit me up to chat.

10.07.2025 23:24 — 👍 23    🔁 6    💬 1    📌 1

The Edinburgh RL Reading group is back with a fresh new website 👏
Anyone is welcome to attend!

10.07.2025 11:09 — 👍 4    🔁 0    💬 0    📌 0

Love these curated and shareable feeds on here. Such a good feature to give more control to the users and community to make the experience what they want it to be rather than leaving it in the control of the platform!

29.06.2025 10:25 — 👍 2    🔁 0    💬 0    📌 0

Eugene is awesome! — if you are interested in autonomous driving and RL, and New York sounds like an exciting place for a postdoc then this is an amazing opportunity! 👇

23.06.2025 09:10 — 👍 1    🔁 0    💬 0    📌 0
Preview
Robust Autonomy Emerges from Self-Play Self-play has powered breakthroughs in two-player and multi-player games. Here we show that self-play is a surprisingly effective strategy in another domain. We show that robust and naturalistic drivi...

Hiring a postdoc to scale up and deploy RL-based planning onto some self-driving cars! We'll be building on arxiv.org/abs/2502.03349 and learn what the limits and challenges of RL planning are. Shoot me a message if interested and help spread the word please!

Full posting to come in a bit.

21.06.2025 17:14 — 👍 60    🔁 24    💬 4    📌 1

Thanks! @sharky6000.bsky.social

19.06.2025 15:15 — 👍 1    🔁 0    💬 0    📌 0
Preview
GitHub - marl-book/marl-book-exercises: Code exercises for the MARL Textbook Code exercises for the MARL Textbook. Contribute to marl-book/marl-book-exercises development by creating an account on GitHub.

We also updated and moved our code exercises which we had built for a summer school to the marl-book github page so they can be easily found at github.com/marl-book/ma...

Thanks @sacha2.bsky.social for the reminder on that! They had previously been hidden inside my GitHub account 😅

19.06.2025 08:29 — 👍 2    🔁 0    💬 0    📌 0

Our textbook “Multi-Agent Reinforcement Learning: Foundations and Modern Approaches” has sold out! Another print round with minor corrections is in production with @mitpress.bsky.social and coming 👏

An errata of these corrections and the updated book PDF can already be found at www.marl-book.com

19.06.2025 08:26 — 👍 32    🔁 6    💬 2    📌 0
Preview
GitHub - LukasSchaefer/marl-book-exercises: Code exercises for the MARL Textbook Code exercises for the MARL Textbook. Contribute to LukasSchaefer/marl-book-exercises development by creating an account on GitHub.

Awesome to see the slides publicly available, wasn’t aware they are now!

And yes actually, the code exercises are here: github.com/LukasSchaefe...

Good point though, I wanted to migrate them to the book github project. I’ll have a stab at that later!

04.06.2025 21:38 — 👍 3    🔁 0    💬 0    📌 1
Video thumbnail

📜🤖 Can a shared multi-agent RL policy support both specialised & homogeneous team behaviours -- without changing the learning objective, requiring preset diversity levels or sequential updates? Our preprint “𝘏𝘺𝘱𝘦𝘳𝘔𝘈𝘙𝘓: 𝘈𝘥𝘢𝘱𝘵𝘪𝘷𝘦 𝘏𝘺𝘱𝘦𝘳𝘯𝘦𝘵𝘸𝘰𝘳𝘬𝘴 𝘧𝘰𝘳 𝘔𝘶𝘭𝘵𝘪-𝘈𝘨𝘦𝘯𝘵 𝘙𝘓” explores this!

27.05.2025 11:07 — 👍 10    🔁 2    💬 1    📌 2

Thanks, will check it out!

23.05.2025 15:37 — 👍 1    🔁 0    💬 0    📌 0

That sounds exciting! Are there any recordings or slides available to check this out? 👀

23.05.2025 15:14 — 👍 0    🔁 0    💬 1    📌 0

Today, I’ll be presenting our work on exploration in MARL using ensembles here! 👇

Multiagent Learning Session
Where: Ambassador Ballroom 1 & 2
When: 14:00 - 14:13

I’ll also present the poster later at the Learn track of the poster session at 15:45 - 16:30

21.05.2025 15:18 — 👍 2    🔁 0    💬 0    📌 0

Thanks!

We wanted to have a game with more realistic visuals. We decided on CS:GO mainly since there already existed an open dataset to train on and compare results to prior methods that used the same dataset.

Also, we finished initial experiments actually on the same day as CS2 released 😅

21.05.2025 12:06 — 👍 1    🔁 0    💬 1    📌 0

Thanks for the encouragement Marc — I’ll look out for you!

18.05.2025 21:53 — 👍 1    🔁 0    💬 0    📌 0

Thank!

18.05.2025 09:53 — 👍 1    🔁 0    💬 0    📌 0

Thanks to all the co-authors and collaborators!
Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, David Bignell, Siddhartha Sen, Andrea Treviño Gavito, and first and foremost Sam Devlin

It's been an absolute joy working with this group of kind folks 👏

18.05.2025 09:40 — 👍 2    🔁 0    💬 0    📌 0
Preview
GitHub - microsoft/imitation_learning_in_modern_video_games: Accompanying code for "Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games" publication Accompanying code for "Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games" publication - microsoft/imitation_learning_in_modern_video_games

Paper: arxiv.org/abs/2312.02312

It took some time but I'm super excited that now our code is also open-source and available for everyone to use at: github.com/microsoft/im...

18.05.2025 09:40 — 👍 4    🔁 0    💬 1    📌 1
Post image

At the Adaptive and Learning Agents Workshop I'll be presenting our comprehensive study on the efficacy of different visual encoders for imitation learning in modern video games.

I'll be presenting the work as a short talk and poster at the ALA workshop on Monday!

18.05.2025 09:40 — 👍 2    🔁 1    💬 1    📌 0

This has been a long time coming, thanks a lot for my collaborators for all their help! Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V Albrecht, and David Mguni

It's actually going to be my first ever oral presentation, so excited (and nervous) about that 👀

18.05.2025 09:40 — 👍 2    🔁 0    💬 2    📌 0
Post image

At the main conference, I'll be presenting our work on using ensembles of value functions for multi-agent exploration!

I'll be presenting the oral at the Multi-agent Learning 1 session on Wednesday (2:00 - 3:45pm), and the poster after 3:45pm!

Paper: arxiv.org/abs/2302.03439

18.05.2025 09:40 — 👍 3    🔁 1    💬 1    📌 1

On my way to Detroit for @aamasconf.bsky.social! Looking forward to presenting the last work from my PhD at the main conference, and work from @msftresearch.bsky.social at the Adaptive and Learning Agents Workshop. More info 👇

If you'd like to chat, feel free to DM me!

18.05.2025 09:40 — 👍 8    🔁 2    💬 2    📌 0

Congrats Matthew, looks awesome! 👏

05.05.2025 14:22 — 👍 1    🔁 0    💬 0    📌 0

I’ll be coming to Detroit! Would be awesome to have chance to meet and chat in person :)

04.05.2025 17:47 — 👍 2    🔁 0    💬 1    📌 0
Preview
WHAMM! Real-time world modelling of interactive environments. - Microsoft Research Today we are making available an interactive real-time gameplay experience in Copilot Labs, powered by our Muse family of world models. This blog post provides a deeper look at the research underpinni...

We are making available an experimental and interactive real-time gameplay experience in Copilot Labs, powered by our Muse family of world models. Learn more about the research underpinning this experience: www.microsoft.com/en-us/resear...

07.04.2025 19:07 — 👍 8    🔁 3    💬 0    📌 0
Workshop website screenshot showing the title and important dates of the website.

Workshop website screenshot showing the title and important dates of the website.

Call for papers! 🎉 If you have felt confused about evaluation in explainable AI, this workshop is for you! We are excited to go to Bologna @ ECAI'25 as we try to build better methods for evaluating XAI. Can't wait to see what ideas the community will bring!
sites.google.com/view/excd-20...

14.03.2025 15:41 — 👍 6    🔁 3    💬 1    📌 0
Job Details

My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...

12.03.2025 15:17 — 👍 19    🔁 13    💬 0    📌 0
Preview
Why we're training AI on video games AI is already a powerful tool for collaboration — but this is just the tip of the iceberg, says AI researcher Katja Hofmann. She describes her team's work training AI on years of human gameplay data f...

What happens when we train generative AI models to predict everything that happens in a modern video game? In my TED AI talk I share insights from how AI can learn the physics of a game, and how this research could empower game creatives: www.ted.com/talks/katja_...

07.03.2025 18:00 — 👍 6    🔁 1    💬 2    📌 0

Wow! Awesome to see this recognition 👏

05.03.2025 10:48 — 👍 2    🔁 0    💬 0    📌 0

@lukaschaefer is following 20 prominent accounts