書き捨てのPoCはLLMが圧倒的に向いている一方で、プロダクト意識する規模だとコードベース全体の一貫性を保つ必要があって、long context にまだ難のあるLLMだとそれが難しいのかなと思ってるんですけど、どうですか(僕は後者をやらないので想像で喋っている)
19.10.2025 05:20 — 👍 0 🔁 0 💬 0 📌 0@han-b.bsky.social
Associate Professor@The Institute of Statistical Mathematics, working in ML theory https://hermite.jp/
書き捨てのPoCはLLMが圧倒的に向いている一方で、プロダクト意識する規模だとコードベース全体の一貫性を保つ必要があって、long context にまだ難のあるLLMだとそれが難しいのかなと思ってるんですけど、どうですか(僕は後者をやらないので想像で喋っている)
19.10.2025 05:20 — 👍 0 🔁 0 💬 0 📌 0Haha, even for those in Japan, the current political scene is highly unpredictable😂
12.10.2025 04:18 — 👍 1 🔁 0 💬 0 📌 0This paper studies why Adam occasionally causes loss spikes, which is attributed to the edge of stability phenomenon. As seen from the figure, once hitting EOS (see b) a loss spike is triggered. An interesting experimental report!
arxiv.org/abs/2506.04805
A nice paper in ICML2025, generalizing the smoothness notion, for which GD convergence is provided. The convergence proof is very transparent: descent lemma + telescoping + estimation of stepsize.
arxiv.org/abs/2412.11773
Yesterday I learned from a pharmacologist that an integration of surrogate models and pharmacokinetics (PK) has been emerging. For example this (jpharmsci.org/article/S002... ). I just wonder if we can refine it by PINN-like approach.
26.09.2025 07:08 — 👍 3 🔁 0 💬 0 📌 0Thrilled to share our paper is accepted to #NeurIPS2025 as a Spotlight! 🎉 Big thanks to my awesome collaborators and the program committee. See you in San Diego!
22.09.2025 05:19 — 👍 8 🔁 1 💬 1 📌 0Why don't we implement ACL Findings type publication models to the ML venues? I don't see any outstanding reason to be reluctant. I know borderline papers sometimes do not provide "surprising" insights, but this ought to be judged by the test of time.
20.09.2025 06:20 — 👍 3 🔁 0 💬 0 📌 0This is the metareview
18.09.2025 18:50 — 👍 21 🔁 2 💬 3 📌 2This is not OK.
I don't submit often to NeurIPS, but I reviewed papers for this conference almost every year. As a reviewer, why would I spend time trying to give a fair opinion on papers if it's what happens in the end???
Thank you, and the same to you! It is truly well-deserved to be selected as a spotlight!
18.09.2025 21:56 — 👍 1 🔁 0 💬 0 📌 0Je suis tellement content que mon papier est accepté à JMLR après plus d’un an! C’était très long…
18.09.2025 12:16 — 👍 7 🔁 0 💬 4 📌 0おめでとうございます!今後とも活躍を楽しみにしています。
(僕もちょうどストロガッツ気になってるとこでした)
2) Metastability of Lorenz attractor
Metastability can even seen in the classical Lorentz system. Here's ρ=23.2. Yorke & Yorke (1979) fit least squares to the so-called "peak return map", and (semi-)analytically derived the transient time.
link.springer.com/content/pdf/...
Recently, Otto & Reznikoff (2006) studied the metastability in a general setup. They consider gradient-flow-based systems and demonstrated: if the energy function is δ-Lipschitz and satisfies a variant of KL-type inequality, the system is metastable for t=O(1/δ)!
webdoc.sub.gwdg.de/ebook/serien...
A-C eq. is dynamics based on a reaction-diffusion system. It has a "metastability" phenomenon, i.e., the system state is trapped in an initial condition for a quite long time, which is apparently the stable state, but exhibits a drastic transient behavior at some time. See the first fig.
15.09.2025 05:22 — 👍 1 🔁 0 💬 1 📌 0In this weekend I attended a workshop among mathematicians, where I was discussing dynamical systems. During the workshop, there are numerous novel insights to me, which I want to briefly share.
1) metastability in Allen-Cahn equation
(fig based on people.maths.ox.ac.uk/trefethen/pd...)
Today's my favorites: "Clustering with Bregman Divergences: an Asymptotic Analysis" (Liu & Belkin, 2016)
proceedings.neurips.cc/paper_files/...
It's concerned with the limiting distribution of k-means (or Bregman) centroids (w/ n, k -> \infty). This is an escort distribution! (maybe overlooked)
Today I learned that the continuous-time limit of Nesterov's accelerated gradient is Bessel's differential equation, which can be solved analytically. That's unexpectedly a beautiful result to me...
web.stanford.edu/~boyd/papers...
烏丸より西側は総じて平和ですよ
19.08.2025 00:27 — 👍 1 🔁 0 💬 1 📌 03) Angkor Wat, of course (in daylight and sunrise each)
10.08.2025 09:19 — 👍 3 🔁 0 💬 0 📌 02) Kulen Mountain
10.08.2025 09:17 — 👍 1 🔁 0 💬 1 📌 0I was on a three-day vacation in Siem Reap 🇰🇭
Here are a few views of my favorites:
1) Baphuon
I participated in Deep Learning Theory Workshop 2025 held in RIKEN AIP. I enjoyed lively discussions among the participants.
delta-workshop.github.io/deep-learnin...
I thought IIIT Hyderabad and IIIT Delhi are branches of IIIT (not IIT to make sure!), but it turns out to be that the first I are different!
07.08.2025 06:22 — 👍 3 🔁 0 💬 0 📌 0I'm recently learning MPEC (mathematical programming with equilibrium constraints), which is basically a bilevel programming. Since MPEC is known to violate LICQ (I didn't realize it before), it's much harder. Interestingly, MPEC-ver of LICQ etc. is known instead.
link.springer.com/article/10.1...
AISTATS 2026 will be in Morocco!
30.07.2025 08:07 — 👍 35 🔁 11 💬 0 📌 0I went last year, right before MLSS Okinawa! Indeed it was nearly 30 years after this film, but some filming locations (including the tango bar) still exist. But Hong-Kong must be developing way faster...
26.07.2025 23:54 — 👍 1 🔁 0 💬 0 📌 0I really love this film! Actually this film finally brought me to Argentina 😂
26.07.2025 14:35 — 👍 1 🔁 0 💬 1 📌 0Thanks, I’m gonna keep learning!
23.07.2025 07:19 — 👍 1 🔁 0 💬 0 📌 0ありがとう!試験もの久しぶりでかなり嬉しい
22.07.2025 11:30 — 👍 0 🔁 0 💬 1 📌 0