Francis Bach's Avatar

Francis Bach

@bachfrancis.bsky.social

Researcher in machine learning

2,258 Followers  |  14 Following  |  14 Posts  |  Joined: 17.11.2024
Posts Following

Posts by Francis Bach (@bachfrancis.bsky.social)

Post image Post image Post image

Still using temperature scaling?
With @dholzmueller.bsky.social, Michael I. Jordan and @bachfrancis.bsky.social we argue that with well designed regularization, more expressive models like matrix scaling can outperform simpler ones across calibration set sizes, data dimensions, and applications.

13.11.2025 12:27 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Frederik Kunstner

Together with the great Frederik Kunstner (fkunstner.github.io)

27.09.2025 14:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Not all scaling laws are nice power laws. This month’s blog post: Zipf’s law in next-token prediction and why Adam (ok, sign descent) scales better to large vocab sizes than gradient descent: francisbach.com/scaling-laws...

27.09.2025 14:57 β€” πŸ‘ 47    πŸ” 12    πŸ’¬ 1    πŸ“Œ 0
Post image

Avec en prime une photo du lac de Roselend.

21.07.2025 06:58 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Jamais deux sans trois. Le dΓ©partement d’informatique de l’ENS toujours en forme. 131 km, 4500m de dΓ©nivelΓ©, cette fois-ci avec du Beaufort aux ravitaillements!
Bravo Olivier CappΓ©! #LEtapeduTour

21.07.2025 06:56 β€” πŸ‘ 18    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/

18.07.2025 14:39 β€” πŸ‘ 24    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0
Post image

Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/

18.07.2025 14:24 β€” πŸ‘ 19    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
A Collectivist, Economic Perspective on AI Information technology is in the midst of a revolution in which omnipresent data collection and machine learning are impacting the human world as never before. The word "intelligence" is being used as...

What if AI isn’t about building solo geniuses, but designing social systems?
Michael Jordan advocates blending ML, economics, and uncertainty management to prioritize social welfare over mere prediction.
A must-read rethink.
arxiv.org/abs/2507.062...

13.07.2025 13:10 β€” πŸ‘ 37    πŸ” 9    πŸ’¬ 0    πŸ“Œ 0

Big thanks to the COLT 2025 organizers for an awesome event in Lyon! Here are the slides from my keynote this morning in case you’re curious about the references I mentioned: www.di.ens.fr/~fbach/fbach...

01.07.2025 21:11 β€” πŸ‘ 19    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Register at PAISS 1-5 Sept 2025 @inria_grenoble with very talented speakers this year πŸ™‚
paiss.inria.fr
cc @mvladimirova.bsky.social

09.06.2025 20:54 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Post image

The PAISS summer school is back with an incredible line of speakers (and more to come). Spread the word !

05.05.2025 16:34 β€” πŸ‘ 23    πŸ” 11    πŸ’¬ 1    πŸ“Œ 3
Post image

Γ‰pisode 5 de notre sΓ©rie "Les nouveaux visages de l’AcadΓ©mie des sciences" : La statistique, une science en son temps.

Retrouvez la vidΓ©o sur Youtube : www.youtube.com/watch?v=2AhT...

06.05.2025 12:42 β€” πŸ‘ 13    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Mathematical Aspects of Data Science Graduate Summer School - EPFL - Sept. 1-5, 2025

Announcing : The 2nd International Summer School on Mathematical Aspects of Data Science
mathsdata2025.github.io
EPFL, Sept 1–5, 2025

Speakers:
Bach @bachfrancis.bsky.social
Bandeira
Mallat
Montanari
PeyrΓ© @gabrielpeyre.bsky.social

For PhD students & early-career researchers
Apply before May 15!

14.04.2025 17:00 β€” πŸ‘ 46    πŸ” 24    πŸ’¬ 1    πŸ“Œ 1

[NOUVELLE SERIE "LES NOUVEAUX VISAGES DE L'ACADEMIE DES SCIENCES]
Episode nΒ°1 : Anne Canteaut : une architecte de la cryptographie moderne
www.youtube.com/watch?v=xyGC...

01.04.2025 10:01 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

Futur best seller!

28.03.2025 08:08 β€” πŸ‘ 37    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0
Post image

Characterizing finely the decay of eigenvalues of kernel matrices: many people need it, but explicit references are hard to find. This blog post reviews amazing asymptotic results from Harold Widom (1963!) and proposes new non-asymptotic bounds.
francisbach.com/spectrum-ker...

24.03.2025 14:26 β€” πŸ‘ 50    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

The must-read introduction to PAC-Bayes!

05.03.2025 04:30 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”¬βœ¨ JournΓ©e des Femmes et des filles de scienceβœ¨πŸ”¬

Γ€ travers leurs parcours inspirants et leurs engagements, les acadΓ©miciennes, et toutes les femmes et filles de science faΓ§onnent la recherche d’aujourd’hui et de demain.

Pour aller plus loin : urls.fr/PK5xg9

11.02.2025 09:05 β€” πŸ‘ 48    πŸ” 26    πŸ’¬ 3    πŸ“Œ 4
Preview
Building Bridges between Regression, Clustering, and Classification Regression, the task of predicting a continuous scalar target y based on some features x is one of the most fundamental tasks in machine learning and statistics. It has been observed and...

Check out our paper, with Lawrence Stewart and @bachfrancis.bsky.social

Link: arxiv.org/abs/2502.02996

1/8

10.02.2025 12:00 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
Behind the Term Sheet: Bioptimus’ $41M Series A The GPT of Biology Raising the Bar of AI-Driven Scientific Research

If you're curious of what is "behind a term sheet", don't miss this account by the Cathay innovation team who led our recent series A at Bioptimus

medium.com/cathay-innov...

09.02.2025 10:54 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

An inspirational talk by Michael Jordan: a refreshing, deep, and forward-looking vision for AI beyond LLMs.
www.youtube.com/live/W0QLq4q...

07.02.2025 07:56 β€” πŸ‘ 27    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Preview
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training We show that learning-rate schedules for large model training behave surprisingly similar to a performance bound from non-smooth convex optimization theory. We provide a bound for the constant schedul...

Learning rate schedules seem mysterious? Why is the loss going down so fast during cooldown?
Turns out that this behaviour can be described with a bound from *convex, nonsmooth* optimization.

A short thread on our latest paper 🚞

arxiv.org/abs/2501.18965

05.02.2025 10:13 β€” πŸ‘ 30    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0
Post image Post image

Early stopping on validation loss? This leads to suboptimal calibration and refinement errorsβ€”but you can do better!
With @dholzmueller.bsky.social, Michael I. Jordan, and @bachfrancis.bsky.social, we propose a method that integrates with any model and boosts classification performance across tasks.

03.02.2025 13:03 β€” πŸ‘ 18    πŸ” 9    πŸ’¬ 4    πŸ“Œ 0
My book is (at last) out! – Machine Learning Research Blog

I've been eagerly awaiting this book for years! At last, a standalone and meticulous exposition of the current mathematical principles of machine learning, adorned with beautiful proofs. Well done and thank you,
@bachfrancis.bsky.social . Discover more here: francisbach.com/my-book-is-o...

05.01.2025 17:55 β€” πŸ‘ 15    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

A happy author discovering the first hard copies

21.12.2024 15:24 β€” πŸ‘ 96    πŸ” 16    πŸ’¬ 3    πŸ“Œ 1
Post image

My book is (at last) out, just in time for Christmas!
A blog post to celebrate and present it: francisbach.com/my-book-is-o...

21.12.2024 15:23 β€” πŸ‘ 140    πŸ” 35    πŸ’¬ 2    πŸ“Œ 3
Post image

My book is (at last) out, just in time for Christmas!
A blog post to celebrate and present it: francisbach.com/my-book-is-o...

21.12.2024 15:23 β€” πŸ‘ 140    πŸ” 35    πŸ’¬ 2    πŸ“Œ 3
Post image

Breaking news : L'AcadΓ©mie des sciences accueille 18 nouveaux membres dΓ¨s 2025, avec une majoritΓ© fΓ©minine pour la 1Γ¨re fois depuis 1666 πŸ‘©β€πŸ”¬ : un symbole fort pour la paritΓ© en science ! πŸ’₯

πŸ”— En savoir plus sur les nouveaux membres : urlr.me/mntDHX

17.12.2024 12:50 β€” πŸ‘ 17    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1
| Laurent Pfeiffer

New opening! Post-doctoral position on relaxation methods for large-scale optimization and the management of electrical systems, in collaboration between EDF and Inria Saclay and Paris. See more details here: laurentpfeiffer.github.io/postdoc/

27.11.2024 07:11 β€” πŸ‘ 53    πŸ” 6    πŸ’¬ 2    πŸ“Œ 1