Han Bao's Avatar

Han Bao

@han-b.bsky.social

Associate Professor@The Institute of Statistical Mathematics, working in ML theory https://hermite.jp/

52 Followers  |  38 Following  |  92 Posts  |  Joined: 15.12.2024  |  1.8713

Latest posts by han-b.bsky.social on Bluesky

書き捨てのPoCはLLMが圧倒的に向いている一方で、プロダクト意識する規模だとコードベース全体の一貫性を保つ必要があって、long context にまだ難のあるLLMだとそれが難しいのかなと思ってるんですけど、どうですか(僕は後者をやらないので想像で喋っている)

19.10.2025 05:20 — 👍 0    🔁 0    💬 0    📌 0

Haha, even for those in Japan, the current political scene is highly unpredictable😂

12.10.2025 04:18 — 👍 1    🔁 0    💬 0    📌 0
Post image

This paper studies why Adam occasionally causes loss spikes, which is attributed to the edge of stability phenomenon. As seen from the figure, once hitting EOS (see b) a loss spike is triggered. An interesting experimental report!

arxiv.org/abs/2506.04805

10.10.2025 07:55 — 👍 5    🔁 1    💬 0    📌 0
Post image

A nice paper in ICML2025, generalizing the smoothness notion, for which GD convergence is provided. The convergence proof is very transparent: descent lemma + telescoping + estimation of stepsize.
arxiv.org/abs/2412.11773

06.10.2025 22:47 — 👍 2    🔁 0    💬 0    📌 0
Preview
Prediction of Human Pharmacokinetics From Chemical Structure: Combining Mechanistic Modeling with Machine Learning Pharmacokinetics (PK) is the result of a complex interplay between compound properties and physiology, and a detailed characterization of a molecule's PK during preclinical research is key to understa...

Yesterday I learned from a pharmacologist that an integration of surrogate models and pharmacokinetics (PK) has been emerging. For example this (jpharmsci.org/article/S002... ). I just wonder if we can refine it by PINN-like approach.

26.09.2025 07:08 — 👍 3    🔁 0    💬 0    📌 0

Thrilled to share our paper is accepted to #NeurIPS2025 as a Spotlight! 🎉 Big thanks to my awesome collaborators and the program committee. See you in San Diego!

22.09.2025 05:19 — 👍 8    🔁 1    💬 1    📌 0

Why don't we implement ACL Findings type publication models to the ML venues? I don't see any outstanding reason to be reluctant. I know borderline papers sometimes do not provide "surprising" insights, but this ought to be judged by the test of time.

20.09.2025 06:20 — 👍 3    🔁 0    💬 0    📌 0
Post image

This is the metareview

18.09.2025 18:50 — 👍 21    🔁 2    💬 3    📌 2

This is not OK.

I don't submit often to NeurIPS, but I reviewed papers for this conference almost every year. As a reviewer, why would I spend time trying to give a fair opinion on papers if it's what happens in the end???

20.09.2025 06:10 — 👍 51    🔁 11    💬 3    📌 1

Thank you, and the same to you! It is truly well-deserved to be selected as a spotlight!

18.09.2025 21:56 — 👍 1    🔁 0    💬 0    📌 0

Je suis tellement content que mon papier est accepté à JMLR après plus d’un an! C’était très long…

18.09.2025 12:16 — 👍 7    🔁 0    💬 4    📌 0

おめでとうございます!今後とも活躍を楽しみにしています。
(僕もちょうどストロガッツ気になってるとこでした)

18.09.2025 06:02 — 👍 1    🔁 0    💬 1    📌 0
Post image

2) Metastability of Lorenz attractor
Metastability can even seen in the classical Lorentz system. Here's ρ=23.2. Yorke & Yorke (1979) fit least squares to the so-called "peak return map", and (semi-)analytically derived the transient time.

link.springer.com/content/pdf/...

15.09.2025 05:22 — 👍 1    🔁 0    💬 0    📌 0

Recently, Otto & Reznikoff (2006) studied the metastability in a general setup. They consider gradient-flow-based systems and demonstrated: if the energy function is δ-Lipschitz and satisfies a variant of KL-type inequality, the system is metastable for t=O(1/δ)!

webdoc.sub.gwdg.de/ebook/serien...

15.09.2025 05:22 — 👍 1    🔁 0    💬 1    📌 0

A-C eq. is dynamics based on a reaction-diffusion system. It has a "metastability" phenomenon, i.e., the system state is trapped in an initial condition for a quite long time, which is apparently the stable state, but exhibits a drastic transient behavior at some time. See the first fig.

15.09.2025 05:22 — 👍 1    🔁 0    💬 1    📌 0
Post image

In this weekend I attended a workshop among mathematicians, where I was discussing dynamical systems. During the workshop, there are numerous novel insights to me, which I want to briefly share.

1) metastability in Allen-Cahn equation

(fig based on people.maths.ox.ac.uk/trefethen/pd...)

15.09.2025 05:22 — 👍 4    🔁 0    💬 1    📌 0
Post image

Today's my favorites: "Clustering with Bregman Divergences: an Asymptotic Analysis" (Liu & Belkin, 2016)
proceedings.neurips.cc/paper_files/...

It's concerned with the limiting distribution of k-means (or Bregman) centroids (w/ n, k -> \infty). This is an escort distribution! (maybe overlooked)

02.09.2025 22:44 — 👍 6    🔁 1    💬 0    📌 0
Post image

Today I learned that the continuous-time limit of Nesterov's accelerated gradient is Bessel's differential equation, which can be solved analytically. That's unexpectedly a beautiful result to me...

web.stanford.edu/~boyd/papers...

28.08.2025 14:20 — 👍 9    🔁 4    💬 0    📌 0

烏丸より西側は総じて平和ですよ

19.08.2025 00:27 — 👍 1    🔁 0    💬 1    📌 0
Post image Post image

3) Angkor Wat, of course (in daylight and sunrise each)

10.08.2025 09:19 — 👍 3    🔁 0    💬 0    📌 0
Post image

2) Kulen Mountain

10.08.2025 09:17 — 👍 1    🔁 0    💬 1    📌 0
Post image

I was on a three-day vacation in Siem Reap 🇰🇭
Here are a few views of my favorites:

1) Baphuon

10.08.2025 09:16 — 👍 4    🔁 0    💬 1    📌 0
Deep Learning Theory Workshop 2025

I participated in Deep Learning Theory Workshop 2025 held in RIKEN AIP. I enjoyed lively discussions among the participants.
delta-workshop.github.io/deep-learnin...

07.08.2025 10:16 — 👍 5    🔁 0    💬 0    📌 0

I thought IIIT Hyderabad and IIIT Delhi are branches of IIIT (not IIT to make sure!), but it turns out to be that the first I are different!

07.08.2025 06:22 — 👍 3    🔁 0    💬 0    📌 0
Preview
Abadie-Type Constraint Qualification for Mathematical Programs with Equilibrium Constraints - Journal of Optimization Theory and Applications Mathematical programs with equilibrium constraints (MPEC) are nonlinear programs which do not satisfy any of the common constraint qualifications (CQ). In order to obtain first-order optimality condit...

I'm recently learning MPEC (mathematical programming with equilibrium constraints), which is basically a bilevel programming. Since MPEC is known to violate LICQ (I didn't realize it before), it's much harder. Interestingly, MPEC-ver of LICQ etc. is known instead.
link.springer.com/article/10.1...

04.08.2025 11:09 — 👍 2    🔁 0    💬 0    📌 0

AISTATS 2026 will be in Morocco!

30.07.2025 08:07 — 👍 35    🔁 11    💬 0    📌 0

I went last year, right before MLSS Okinawa! Indeed it was nearly 30 years after this film, but some filming locations (including the tango bar) still exist. But Hong-Kong must be developing way faster...

26.07.2025 23:54 — 👍 1    🔁 0    💬 0    📌 0

I really love this film! Actually this film finally brought me to Argentina 😂

26.07.2025 14:35 — 👍 1    🔁 0    💬 1    📌 0

Thanks, I’m gonna keep learning!

23.07.2025 07:19 — 👍 1    🔁 0    💬 0    📌 0

ありがとう!試験もの久しぶりでかなり嬉しい

22.07.2025 11:30 — 👍 0    🔁 0    💬 1    📌 0

@han-b is following 19 prominent accounts