Marc Lanctot's Avatar

Marc Lanctot

@sharky6000.bsky.social

Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning. Lover of Linux 🐧, coffee β˜•, and retro gaming. Big fan of open-source. #gohabsgo πŸ‡¨πŸ‡¦ For more info: https://linktr.ee/sharky6000

8,732 Followers  |  425 Following  |  2,159 Posts  |  Joined: 29.12.2023
Posts Following

Posts by Marc Lanctot (@sharky6000.bsky.social)

Pretty nice looking thesis, thanks!

01.03.2026 22:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If they're basically solving a slightly perturbed game, that would be great news.. because then I believe it would easy to have an "active" version (in the sense of bsky.app/profile/shar...) based on adversarial bandits. Will have to dig into the detail and ask Serena about it. πŸ˜€

01.03.2026 17:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Here's the overview. I highlighted one aspect of this that I really like, because vanilla VasE does not do anything special to handle the statistical uncertainty that is present in the scores out-of-the-box, which could be quite relevant when comparing agents.

01.03.2026 17:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Except it does it in a more sophisticated way with targeted ambiguity sets *and* it maintains some properties similar to the classical maximal lotteries.

01.03.2026 17:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

If I am right about that, it is similar in spirit to the motivation behind "Projected Replicator Dynamics" in the PSRO paper which simulated a constrained equation that had a lower bound on the probabilities. Or how in Nash averaging they use maximum entropy Nash equilibrium.

01.03.2026 17:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

Yes we use LPs to solve the maximal lotteries objectives too (they are basically two-player zero-sum games). Problem is that makes them sensitive to small changes. My first take was this seems like a way to redesign the LP to spread weight elsewhere...? To avoid the sensitivities.

01.03.2026 17:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yeah we have been in touch with Serena Wang so we know of this work but I have only skimmed it so far. Looks neat!

01.03.2026 17:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Spoke to the authors this week! Nice work. They presented in London.

27.02.2026 22:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This is amazing.

www.getyourfuckingmoneyback.com

27.02.2026 17:49 β€” πŸ‘ 36673    πŸ” 11868    πŸ’¬ 492    πŸ“Œ 798

So indeed you can go a long way with just proper modeling. What I am curious about is whether predictive modeling like this will generalize outside RRPS. I expect that it will. And indeed maybe it already covers much of the gain we expect from search/reasoning.

27.02.2026 21:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yeah, that is a great question! In our RRPS paper from '23, we ran RL in a self-play setting where the "predictive agent" was endowed with the ability to predict which bot it was playing against. And when we tested it again held-out, unknown bots it did much better than standard self-play bots.

27.02.2026 21:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Ah yes that is a good point. The classical one is zero-sum but a different one-- which is not zero-sum-- (with the same equilibrim) was used when soliciting the human data.

27.02.2026 21:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Hey what leaderboard / site is this from?

25.02.2026 23:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Omg, I can def relate to this...

25.02.2026 23:42 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AI models that simulate internal debate dramatically improve accuracy on complex tasks A new study reveals that top models like DeepSeek-R1 succeed by simulating internal debates. Here is how enterprises can harness this "society of thought" to build more robust, self-correcting agents.

Grateful to @venturebeat.com for featuring our Paradigms of Intelligence team’s research on β€œsocieties of thought,” or internal multi-agent dialogues.

Read the full piece, which includes a thoughtful quote from my friend & colleague James Evans: bit.ly/3ZN4oa5

25.02.2026 16:01 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

What???

A Cirque show.. named Ludo.. at a conference banquet dinner!!

🀯🀯🀯

So cool! Can't wait for this! πŸ₯°

25.02.2026 23:39 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Getting a lot easier to avoid tbh because the pro-AI contingent is greater in numbers than before.

But the last few times it happened, it was triggered exactly by this kind of question... but it was posed quite rudely, so maybe you have not (yet) triggered them...? πŸ˜…πŸ‘

25.02.2026 01:48 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ’”

22.02.2026 15:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

😱😭

22.02.2026 15:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ’―!!

22.02.2026 15:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Ridiculous save by Connor Hellebuyck keeps the game tied #milanocortina2026 #cbcsports
YouTube video by CBC Sports Ridiculous save by Connor Hellebuyck keeps the game tied #milanocortina2026 #cbcsports

I know, insane!! Did you see that stick stop?? 🀯🀯🀯

youtube.com/shorts/UczOl...

22.02.2026 15:41 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Omg.. overtime.

And the shots are now over 40!! 🀩

22.02.2026 15:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Still tied... with 8 minutes to go!

Shots are 37 - 17 !! 🀯

22.02.2026 15:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Cale Makar ties it for Canada against USA 1-1 in men's hockey final | Day 16 | Milano Cortina 2026
YouTube video by CBC Sports Cale Makar ties it for Canada against USA 1-1 in men's hockey final | Day 16 | Milano Cortina 2026

Canada πŸ‡¨πŸ‡¦ / USA πŸ‡ΊπŸ‡Έ gold medal πŸ… hockey game tied 1-1 heading into the third!!

youtu.be/IlrAWd-yYD0?...

22.02.2026 14:53 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Preview
Steal a Brainrot - Wikipedia

462K current active players right now in Steal a Brainrot: en.wikipedia.org/wiki/Steal_a...

That's 0.005% of the world's population. πŸ€―πŸ˜…

I think it's still the most popular game in the world. It set a world record of 20M concurrent users in August (still the standing record IIRC)

22.02.2026 13:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Haha, nope, at least not yet in my son's friend groups πŸ˜…

There are still timed special events in Roblox every Saturday

22.02.2026 13:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Aha, global knowledge spreading of hockey by Bluesky... I approve! 🫢 Thanks for the clarification.

Speaking of which... gold medal Olympics between πŸ‡¨πŸ‡¦ and πŸ‡ΊπŸ‡Έ today, starting momentarily! πŸ’πŸ…πŸ€©

22.02.2026 13:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Especially if you have kids around the ages of 7-10 you may know about Italian Brainrot (en.wikipedia.org/wiki/Italian...).

Well, today I discovered my new favorite character: Pingu Amore, a cupid penguin will cool shades! πŸΉβ€οΈπŸ§πŸ˜ŽπŸ˜‡

22.02.2026 13:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Or are the Habs really global? 😁

22.02.2026 12:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Haha, noted βœ…οΈ πŸ‘Œ

Somehow I had gotten the impression that you were Australian.. is that not right?

(Surprised that you know about the Habs.)

Are you an expat teaching is Australia?

22.02.2026 12:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0