Antoine Moulin's Avatar

Antoine Moulin

@antoine-mln.bsky.social

doing a phd in RL/online learning on questions related to exploration and adaptivity > https://antoine-moulin.github.io/

2,223 Followers  |  235 Following  |  67 Posts  |  Joined: 17.11.2024  |  2.0363

Latest posts by antoine-mln.bsky.social on Bluesky

Some text in heuristics/separation of results appear twice? Good post otherwise

14.12.2025 18:38 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Thank you for coming and for the great discussion!!

07.12.2025 22:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Schedule | ARLET The session will cover invited talks, contributed talks and posters. The tentative schedule in Pacific Daylight Time (GMTβˆ’7) can be found below.

So excited for the RL theory meets experiment workshop tomorrow at NeurIPS: arlet-workshop.github.io/neurips2025/...
Talks look AMAZING and you can hear me be the foolish experimentalist on a panel

06.12.2025 07:18 β€” πŸ‘ 16    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1

The paper looks cool! And it seems well-written :). I did not expect these exponents in Theorem 4.2 πŸ˜†

26.10.2025 14:56 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Very excited to share our preprint: Self-Speculative Masked Diffusions

We speed up sampling of masked diffusion models by ~2x by using speculative sampling and a hybrid non-causal / causal transformer

arxiv.org/abs/2510.03929

w/ @vdebortoli.bsky.social, Jiaxin Shi, @arnauddoucet.bsky.social

07.10.2025 22:09 β€” πŸ‘ 13    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Preview
Percepta | A General Catalyst Transformation Company Transforming critical institutions using applied AI. Let's harness the frontier.

We're finally out of stealth: percepta.ai
We're a research / engineering team working together in industries like health and logistics to ship ML tools that drastically improve productivity. If you're interested in ML and RL work that matters, come join us πŸ˜€

02.10.2025 15:35 β€” πŸ‘ 118    πŸ” 18    πŸ’¬ 7    πŸ“Œ 2
Preview
Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs Optimal transport between graphs, based on Gromov-Wasserstein and other extensions, is a powerful tool for comparing and aligning graph structures. However, solving the associated non-convex optimizat...

I am happy to share that our paper "Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs" was accepted at Neurips 2025 ! πŸ₯³

Huge thanks to my co-authors @rflamary.bsky.social and Bertrand Thirion !

arxiv.org/abs/2506.12025

(1/5)

29.09.2025 08:54 β€” πŸ‘ 31    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

News πŸŽ‰ We’re thrilled to announce our final panelist: David Silver!
Don’t miss David and our amazing lineup of speakersβ€”submit your latest RL work to our NeurIPS workshop.
πŸ“… Extended deadline: Sept 2 (AoE)

28.08.2025 19:55 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Call for Papers | ARLET A simple, whitespace theme for academics. Based on [*folio](https://github.com/bogoli/-folio) design.

We've extended the deadline for our workshop's calls for papers/ideas! Submit your work by August 29 AoE. Instructions on the website: arlet-workshop.github.io/neurips2025/...

18.08.2025 10:46 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

The OpenReview link for our calls (for papers and ideas) is available, submit here: openreview.net/group?id=Neu...

We look forward to receiving your submissions!

05.08.2025 15:03 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

last year's edition was so much fun I'm really looking forward to this one!! join us in San Diego :))

28.07.2025 18:06 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Was it recorded? πŸ€”

19.06.2025 15:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Join us for Nneka's presentation tomorrow! Last talk before the summer break.

09.06.2025 17:43 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

Join us tomorrow for Dave's talk! He will present his recent work on randomised exploration, which received an outstanding paper award at ALT 2025 earlier this year.

02.06.2025 14:55 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Inverse Q-Learning Done Right: Offline Imitation Learning in $Q^Ο€$-Realizable MDPs We study the problem of offline imitation learning in Markov decision processes (MDPs), where the goal is to learn a well-performing policy given a dataset of state-action pairs generated by an expert...

link: arxiv.org/abs/2505.19946

27.05.2025 07:16 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

new preprint with the amazing @lviano.bsky.social and @neu-rips.bsky.social on offline imitation learning! learned a lot :)

when the expert is hard to represent but the environment is simple, estimating a Q-value rather than the expert directly may be beneficial. lots of open questions left though!

27.05.2025 07:12 β€” πŸ‘ 18    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Preview
Nonlinear Meta-Learning Can Guarantee Faster Rates Many recent theoretical works on \emph{meta-learning} aim to achieve guarantees in leveraging similar representational structures from related tasks towards simplifying a target task. The main aim of ...

🚨 New paper accepted at SIMODS! 🚨
β€œNonlinear Meta-learning Can Guarantee Faster Rates”

arxiv.org/abs/2307.10870

When does meta learning work? Spoiler: generalise to new tasks by overfitting on your training tasks!

Here is why:
πŸ§΅πŸ‘‡

26.05.2025 16:50 β€” πŸ‘ 9    πŸ” 7    πŸ’¬ 2    πŸ“Œ 1

Dhruv Rohatgi will be giving a lecture on our recent work on comp-stat tradeoffs in next-token prediction at the RL Theory virtual seminar series (rl-theory.bsky.social) tomorrow at 2pm EST! Should be a fun talk---come check it out!!

26.05.2025 19:19 β€” πŸ‘ 11    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

new work on computing distances between stochastic processes ***based on sample paths only***! we can now:
- learn distances between Markov chains
- extract "encoder-decoder" pairs for representation learning
- with sample- and computational-complexity guarantees
read on for some quick details..
1/n

26.05.2025 13:26 β€” πŸ‘ 37    πŸ” 10    πŸ’¬ 1    πŸ“Œ 0
Preview
Discrete Diffusion: Continuous-Time Markov Chains A tutorial explaining some key intuitions behind continuous time Markov chains for machine learners interested in discrete diffusion models: alternative representations, connections to point processes...

A new blog post with intuitions behind continuous-time Markov chains, a building block of diffusion language models, like @inceptionlabs.bsky.social's Mercury and Gemini Diffusion. This post touches on different ways of looking at Markov chains, connections to point processes, and more.

22.05.2025 15:14 β€” πŸ‘ 22    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

oh the inference blog posts are back πŸ₯°

22.05.2025 15:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Mattes Mollenhauer, Nicole M\"ucke, Dimitri Meunier, Arthur Gretton: Regularized least squares learning with heavy-tailed noise is minimax optimal https://arxiv.org/abs/2505.14214 https://arxiv.org/pdf/2505.14214 https://arxiv.org/html/2505.14214

21.05.2025 06:14 β€” πŸ‘ 6    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1
Post image

Later today, Sikata and Marcel will talk about their recent work on oracle-efficient RL with ensembles. Join us!

20.05.2025 15:48 β€” πŸ‘ 6    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Excited to share what I've been up to: bringing text diffusion to Gemini!

Diffusion models are _fast_, and hold immense promise to challenge autoregressive models as the de facto standard for language modeling.

20.05.2025 18:52 β€” πŸ‘ 15    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

omg thanks

16.05.2025 00:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Community events and tutorials, list from the website

Community events and tutorials, list from the website

Workshops, list from the website

Workshops, list from the website

The tutorials, workshops, and community events for #COLT2025 have been announced!

Exciting topics, and impressive slate of speakers and events, on June 30! The workshops have calls for contributions (⏰ May 16, 19, and 25): check them out!
learningtheory.org/colt2025/ind...

10.05.2025 01:51 β€” πŸ‘ 20    πŸ” 7    πŸ’¬ 2    πŸ“Œ 0
Post image

Announcing the first workshop on Foundations of Post-Training (FoPT) at COLT 2025!

πŸ“ Soliciting abstracts/posters exploring theoretical & practical aspects of post-training and RL with language models!

πŸ—“οΈ Deadline: May 19, 2025

09.05.2025 17:09 β€” πŸ‘ 17    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1
Post image

looking forward to giving a talk at the 2025 GHOST day in the beautiful city of Poznan (Poland) --- hope to see you there this weekend!

06.05.2025 19:06 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Is Best-of-N really the best we can do for language model inference?

New paper (appearing at ICML) led by the amazing Audrey Huang (ahahaudrey.bsky.social) with Adam Block, Qinghua Liu, Nan Jiang, and Akshay Krishnamurthy (akshaykr.bsky.social).

1/11

03.05.2025 17:40 β€” πŸ‘ 22    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

Spectral Representation for Causal Estimation with Hidden Confounders
at #AISTATS2025

A spectral method for causal effect estimation with hidden confounders, for instrumental variable and proxy causal learning
arxiv.org/abs/2407.10448

Haotian Sun, @antoine-mln.bsky.social, Tongzheng Ren, Bo Dai

02.05.2025 12:36 β€” πŸ‘ 3    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

@antoine-mln is following 20 prominent accounts