Anton Baumann's Avatar

Anton Baumann

@antonbaumann.bsky.social

18 Followers  |  40 Following  |  3 Posts  |  Joined: 10.12.2024  |  1.8302

Latest posts by antonbaumann.bsky.social on Bluesky

Actual problems like AI in space?
www.spacex.com/updates#xai-...

03.02.2026 17:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Excited to share our new work on Self-Distillation Policy Optimization (SDPO)!

SDPO is a simple algorithm that turns textual feedback into logit-level learning signals, enabling sample-efficient RL from runtime errors, LLM judges, and even binary feedback.

Preprint: arxiv.org/abs/2601.20802

30.01.2026 11:37 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

SDPO enables RL agents to learn from rich feedback (i.e., not only whether an attempt failed, but why it failed, such as error messages). Even without such rich feedback, SDPO can reflect on past attempts and outperform GRPO. SDPO also accelerates solution discovery at test time!

30.01.2026 07:17 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Training LLMs with verifiable rewards uses 1bit signal per generated response. This hides why the model failed.

Today, we introduce a simple algorithm that enables the model to learn from any rich feedback!
And then turns it into dense supervision.

(1/n)

29.01.2026 19:38 โ€” ๐Ÿ‘ 10    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

This has now been accepted at @iclr-conf.bsky.social !

26.01.2026 15:52 โ€” ๐Ÿ‘ 34    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

It's really hard to tell nowadays what is a made-up joke and what is reality.

16.01.2026 09:41 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The Nobel Prize committee should announce the World Cup winner tomorrow

06.12.2025 04:29 โ€” ๐Ÿ‘ 38765    ๐Ÿ” 7495    ๐Ÿ’ฌ 510    ๐Ÿ“Œ 302

Super interesting! Will the talk be recorded, or will the slides be available afterward?

06.12.2025 16:09 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I am hiring a PhD & postdoc to work together with me at KTH on probabilistic machine learning. Both positions are fully funded and part of WASP.

I will be attending @euripsconf.bsky.social, if you are around and want to talk about the positions or what we do at KTH, then ping me and we can meet.

23.11.2025 11:39 โ€” ๐Ÿ‘ 34    ๐Ÿ” 14    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Home Martin Trapp - Assistant Professor in Machine Learning at KTH Royal Institute of Technology.

Want to work on Trustworthy AI? ๐Ÿš€

I'm seeking exceptional candidates to apply for the Digital Futures Postdoctoral Fellowship to work with me on Uncertainty Quantification, Bayesian Deep Learning, and Reliability of ML Systems.

The position will be co-advised by Hossein Azizpour or Henrik Bostrรถm.

02.10.2025 14:46 โ€” ๐Ÿ‘ 11    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Post-hoc Probabilistic Vision-Language Models Vision-language models (VLMs), such as CLIP and SigLIP, have found remarkable success in classification, retrieval, and generative tasks. For this, VLMs deterministically map images and text descripti...

Unfortunately, our submission to #NeurIPS didnโ€™t go through with (5,4,4,3). But because I think itโ€™s an excellent paper, I decided to share it anyway.

We show how to efficiently apply Bayesian learning in VLMs, improve calibration, and do active learning. Cool stuff!

๐Ÿ“ arxiv.org/abs/2412.06014

18.09.2025 20:34 โ€” ๐Ÿ‘ 51    ๐Ÿ” 16    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Preview
Opinion | Stop Acting Like This Is Normal

www.nytimes.com/2025/09/07/o...

07.09.2025 11:48 โ€” ๐Ÿ‘ 897    ๐Ÿ” 237    ๐Ÿ’ฌ 104    ๐Ÿ“Œ 100
Post image Post image

I'm very excited to share notes on Probabilistic AI that I have been writing with @arkrause.bsky.social ๐Ÿฅณ

arxiv.org/pdf/2502.05244

These notes aim to give a graduate-level introduction to probabilistic ML + sequential decision-making.
I'm super glad to be able to share them with all of you now!

11.02.2025 08:19 โ€” ๐Ÿ‘ 121    ๐Ÿ” 25    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3

Tomorrow Iโ€™ll be presenting our recent work on improving LLMs via local transductive learning in the FITML workshop at NeurIPS.
Join us for our โœจoralโœจ at 10:30am in east exhibition hall A.

Joint work with my fantastic collaborators Sascha Bongni, @idoh.bsky.social, @arkrause.bsky.social

13.12.2024 18:32 โ€” ๐Ÿ‘ 5    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I will present โœŒ๏ธ BDU workshop papers @ NeurIPS: one by Rui Li (looking for internships) and one by Anton Baumann.

๐Ÿ”— to extended versions:

1. ๐Ÿ™‹ "How can we make predictions in BDL efficiently?" ๐Ÿ‘‰ arxiv.org/abs/2411.18425

2. ๐Ÿ™‹ "How can we do prob. active learning in VLMs" ๐Ÿ‘‰ arxiv.org/abs/2412.06014

10.12.2024 15:18 โ€” ๐Ÿ‘ 18    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

@antonbaumann is following 20 prominent accounts