Francis Bach

Francis Bach

@bachfrancis.bsky.social

Researcher in machine learning

2,271 Followers 14 Following 15 Posts Joined Nov 2024
6 days ago
Video thumbnail

Looking for alternatives to quadratic functions for closed-form analysis in optimization? This post explores matrix Riccati dynamics and their applications to neural networks. francisbach.com/closed-form-...

16 2 0 0
3 months ago
Post image Post image Post image

Still using temperature scaling?
With @dholzmueller.bsky.social, Michael I. Jordan and @bachfrancis.bsky.social we argue that with well designed regularization, more expressive models like matrix scaling can outperform simpler ones across calibration set sizes, data dimensions, and applications.

5 2 1 0
5 months ago
Frederik Kunstner

Together with the great Frederik Kunstner (fkunstner.github.io)

2 0 0 0
5 months ago
Video thumbnail

Not all scaling laws are nice power laws. This month’s blog post: Zipf’s law in next-token prediction and why Adam (ok, sign descent) scales better to large vocab sizes than gradient descent: francisbach.com/scaling-laws...

47 12 1 0
7 months ago
Post image

Avec en prime une photo du lac de Roselend.

4 0 0 0
7 months ago
Post image

Jamais deux sans trois. Le département d’informatique de l’ENS toujours en forme. 131 km, 4500m de dénivelé, cette fois-ci avec du Beaufort aux ravitaillements!
Bravo Olivier Cappé! #LEtapeduTour

18 0 1 0
7 months ago
Video thumbnail

Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/

24 5 0 0
7 months ago
Post image

Tired of lengthy computations to derive scaling laws? This post is made for you: discover the sharpness of the z-transform!
francisbach.com/z-transform/

19 4 0 0
7 months ago
Preview
A Collectivist, Economic Perspective on AI Information technology is in the midst of a revolution in which omnipresent data collection and machine learning are impacting the human world as never before. The word "intelligence" is being used as...

What if AI isn’t about building solo geniuses, but designing social systems?
Michael Jordan advocates blending ML, economics, and uncertainty management to prioritize social welfare over mere prediction.
A must-read rethink.
arxiv.org/abs/2507.062...

37 9 0 0
8 months ago

Big thanks to the COLT 2025 organizers for an awesome event in Lyon! Here are the slides from my keynote this morning in case you’re curious about the references I mentioned: www.di.ens.fr/~fbach/fbach...

19 2 0 0
9 months ago
Post image

Register at PAISS 1-5 Sept 2025 @inria_grenoble with very talented speakers this year 🙂
paiss.inria.fr
cc @mvladimirova.bsky.social

8 4 1 0
10 months ago
Post image

The PAISS summer school is back with an incredible line of speakers (and more to come). Spread the word !

23 11 1 3
10 months ago
Post image

Épisode 5 de notre série "Les nouveaux visages de l’Académie des sciences" : La statistique, une science en son temps.

Retrouvez la vidéo sur Youtube : www.youtube.com/watch?v=2AhT...

13 5 1 0
10 months ago
Mathematical Aspects of Data Science Graduate Summer School - EPFL - Sept. 1-5, 2025

Announcing : The 2nd International Summer School on Mathematical Aspects of Data Science
mathsdata2025.github.io
EPFL, Sept 1–5, 2025

Speakers:
Bach @bachfrancis.bsky.social
Bandeira
Mallat
Montanari
Peyré @gabrielpeyre.bsky.social

For PhD students & early-career researchers
Apply before May 15!

46 24 1 1
11 months ago

[NOUVELLE SERIE "LES NOUVEAUX VISAGES DE L'ACADEMIE DES SCIENCES]
Episode n°1 : Anne Canteaut : une architecte de la cryptographie moderne
www.youtube.com/watch?v=xyGC...

9 3 1 0
11 months ago
Post image

Futur best seller!

37 6 2 0
11 months ago
Post image

Characterizing finely the decay of eigenvalues of kernel matrices: many people need it, but explicit references are hard to find. This blog post reviews amazing asymptotic results from Harold Widom (1963!) and proposes new non-asymptotic bounds.
francisbach.com/spectrum-ker...

50 7 0 0
1 year ago

The must-read introduction to PAC-Bayes!

6 0 0 0
1 year ago

🔬✨ Journée des Femmes et des filles de science✨🔬

À travers leurs parcours inspirants et leurs engagements, les académiciennes, et toutes les femmes et filles de science façonnent la recherche d’aujourd’hui et de demain.

Pour aller plus loin : urls.fr/PK5xg9

48 26 3 4
1 year ago
Preview
Building Bridges between Regression, Clustering, and Classification Regression, the task of predicting a continuous scalar target y based on some features x is one of the most fundamental tasks in machine learning and statistics. It has been observed and...

Check out our paper, with Lawrence Stewart and @bachfrancis.bsky.social

Link: arxiv.org/abs/2502.02996

1/8

8 2 1 0
1 year ago
Preview
Behind the Term Sheet: Bioptimus’ $41M Series A The GPT of Biology Raising the Bar of AI-Driven Scientific Research

If you're curious of what is "behind a term sheet", don't miss this account by the Cathay innovation team who led our recent series A at Bioptimus

medium.com/cathay-innov...

6 1 0 0
1 year ago
Post image

An inspirational talk by Michael Jordan: a refreshing, deep, and forward-looking vision for AI beyond LLMs.
www.youtube.com/live/W0QLq4q...

27 1 2 0
1 year ago
Preview
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training We show that learning-rate schedules for large model training behave surprisingly similar to a performance bound from non-smooth convex optimization theory. We provide a bound for the constant schedul...

Learning rate schedules seem mysterious? Why is the loss going down so fast during cooldown?
Turns out that this behaviour can be described with a bound from *convex, nonsmooth* optimization.

A short thread on our latest paper 🚞

arxiv.org/abs/2501.18965

30 6 2 0
1 year ago
Post image Post image

Early stopping on validation loss? This leads to suboptimal calibration and refinement errors—but you can do better!
With @dholzmueller.bsky.social, Michael I. Jordan, and @bachfrancis.bsky.social, we propose a method that integrates with any model and boosts classification performance across tasks.

18 9 4 0
1 year ago
My book is (at last) out! – Machine Learning Research Blog

I've been eagerly awaiting this book for years! At last, a standalone and meticulous exposition of the current mathematical principles of machine learning, adorned with beautiful proofs. Well done and thank you,
@bachfrancis.bsky.social . Discover more here: francisbach.com/my-book-is-o...

15 1 0 0
1 year ago

A happy author discovering the first hard copies

96 16 3 1
1 year ago
Post image

My book is (at last) out, just in time for Christmas!
A blog post to celebrate and present it: francisbach.com/my-book-is-o...

139 35 2 3
1 year ago
Post image

My book is (at last) out, just in time for Christmas!
A blog post to celebrate and present it: francisbach.com/my-book-is-o...

139 35 2 3
1 year ago
Post image

Breaking news : L'Académie des sciences accueille 18 nouveaux membres dès 2025, avec une majorité féminine pour la 1ère fois depuis 1666 👩‍🔬 : un symbole fort pour la parité en science ! 💥

🔗 En savoir plus sur les nouveaux membres : urlr.me/mntDHX

17 7 0 1
1 year ago
| Laurent Pfeiffer

New opening! Post-doctoral position on relaxation methods for large-scale optimization and the management of electrical systems, in collaboration between EDF and Inria Saclay and Paris. See more details here: laurentpfeiffer.github.io/postdoc/

53 6 2 1