Many more details on derivation, intuition, implementation details and proofs can be found in our paper arxiv.org/pdf/2507.08956 with the amazing Zhenghan Fang, Sam Buchanan and Mateo Diaz @ @jhu.edu @hopkinsdsai.bsky.social
22.07.2025 19:25 β π 0 π 0 π¬ 0 π 0
2) In theory, we show that prox-diff sampling requires only O(d/sqrt(eps)) to produce a distribution epsilon-away (in KL) from a target one (assuming oracle prox), faster than the score version (resp. O(d/eps)). Technical assumptions differ in all papers though, so exact comparison is hard (10/n)
22.07.2025 19:25 β π 1 π 0 π¬ 1 π 0
1) In practice, ProxDM can provide samples from the data distribution much faster than comparable methods based on the score like DDPM from (Ho et al, 2020) and even comparable to their ODE alternatives (which are much faster) (9/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
In this way, we generalize the Proximal Matching Loss from (Fang et al, 2024) to learn time-specific proximal operators for the densities at each discrete time. The result is Proximal Diffusion Models: sampling by using proximal operators instead of the score. This has 2 main advantages: (8/n)
22.07.2025 19:25 β π 1 π 0 π¬ 1 π 0
So, in order to implement a Proximal/backward version of diffusion models, we need a (cheap!) way of solving this optimization problem, i.e. computing the proximal of the log densities at every single time step. If only there was a way⦠oh, in come Learned Proximal Networks (7/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
What are proximal operators? You can think of them as generalizations of projection operators. For a given (proximable) functional \rho(x), its proximal is defined by the solution of a simple optimization problem: (6/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
Backward discretization of diff. eqs. has been long studied (c.f. gradient descent vs proximal point method). Letβs go ahead and discretize the same SDE, but backwards! One problem: the update is defined implicitly... But it does admit a close form expression in terms of proximal operators! (5/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
Crucially, this step relies on being able to compute the score function. Luckily, Minimum Mean Squared Estimate (MMSE) denoisers can do just that (at least asymptotically). But, couldnβt there be a different discretization strategy for this SDE, you ask? Great question! Let's go *back*... (4/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
While elegant in continuous time, one needs to discretize the SDE to implement it in practice. In DF, this has always been done through forward discretization (e.g Euler-Maruyama), which combines a gradient step of the data distribution at the discrete time t (the *score*), and Gaussian noise: (3/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
First, a (very) brief overview of diffusion models (DM). DM work by simulating a process that converts samples from a distribution (random noise) to samples from target distribution of interest. This process is modeled mathematically with a stochastic differential equation (SDE) (2/n)
22.07.2025 19:25 β π 0 π 0 π¬ 1 π 0
Check this out π’ Score-based diffusion models are powerfulβbut slow to sample. Could there be something better? Drop the scores, use proximals instead!
We present Proximal Diffusion Models, providing a faster alternative both in theory* and practice. Hereβs how it works π§΅(1/n)
22.07.2025 19:25 β π 1 π 0 π¬ 1 π 0
Awesome to see our cover in @cp-patterns.bsky.social finally out! And kudos go to Zhenzhen Wang for her massive work on biomarker discovery for breast cancer
www.cell.com/patterns/ful...
14.03.2025 16:09 β π 2 π 1 π¬ 0 π 0
Today, on #WomenInScience day, this paper on biomarker discovery for breast cancer, by my amazing student Zhenzhen, has just appeared in @cp-patterns.bsky.social
π Her work shows how to construct fully interpretable biomarkers employing bi-level graph learning! @jhu.edu @hopkinsdsai.bsky.social
12.02.2025 02:30 β π 2 π 1 π¬ 0 π 1
Wanna bet? Testing conceptual importance for more explainable AI
Johns Hopkins researchers used betting strategies to help clarify AI modelsβ decision-making processes.
Nice write-up by @JHUCompSci about @JacopoTeneggi's work. Puch-line: interpretability of opaque ML models can be posed as hypothesis tests, for which online (efficient) testing procedures can be derived! www.cs.jhu.edu/news/wanna-b...
13.12.2024 17:47 β π 1 π 0 π¬ 0 π 0
π£ What should *ML explanations* convey, and how does one report these precisely and rigorously? @neuripsconf.bsky.social
come check
Jacopo Teneggi's work on Testing for Explanations via betting this afternoon! I *bet* you'll like it :) openreview.net/pdf?id=A0HSm... @hopkinsdsai.bsky.social
11.12.2024 18:12 β π 1 π 0 π¬ 1 π 0
realSEUDO for real-time calcium imaging analysis
Closed-loop neuroscience experimentation, where recorded neural activity is used to modify the experiment on-the-fly, is critical for deducing causal connections and optimizing experimental time. A cr...
NeurIPS paper: Excited for our work (with Iuliia Dmitrieva+Sergey Babkin) on
"realSEUDO for real-time calcium imaging analysis"
arxiv.org/abs/2405.15701
to be presented tomorrow (Thu 4:30-7:30PM). realSEUDO is a fully on-line method for cell detection and activity estimation that runs at >30Hz.
11.12.2024 14:23 β π 3 π 2 π¬ 1 π 0
A National Center devoted to developing MRI-based biomarkers to assess brain disease (P41 EB031771)
Visit us at: www.kennedykrieger.org/physiologic-metabolic-anatomic-biomarkers
Assistant Professor at UC San Diego
Bloomberg Distinguished Professor (SAIS & Carey Business School), Johns Hopkins University. Political economy, Brazil, and a little bit of futebol.
π²π½ β’ Postdoc at the Kuchibhotla lab
Waitress turned Congresswoman for the Bronx and Queens. Grassroots elected, small-dollar supported. A better world is possible.
ocasiocortez.com
John C Malone Professor at Johns Hopkins Computer Science, Center for Language and Speech Processing, Malone Center for Engineering in Healthcare.
Parttime: Bloomberg LP #nlproc
Professor at Johns Hopkins Biomedical Engineering and CS, Director Malone Center for Engineering in Healthcare, Research Director (Strategy and Partnerships) Data Science and AI Institute
Johns Hopkins Chemical & Biomolecular Engineering Professor, Co-Director of RosettaCommons, protein engineer, Baltimore resident, father of two. he/him
Research Saves Lives
Johns Hopkins Whiting School of Engineering
Baltimore, Maryland
https://engineering.jhu.edu
A peer-reviewed #openaccess data science journal from @cellpress.bsky.social
Editor-in-Chief: Andrew L Hufton (@alhufton.bsky.social)
Visit us online at https://www.cell.com/patterns/
Associate Professor at University of Pennsylvania
Americaβs first research university. Leading discovery and sharing knowledge to better the world since 1876. With campuses & centers in Baltimore & around the world.
San Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Comments to this account are not monitored. Please send feedback to townhall@neurips.cc.
neuroscientist (postdoc). motor learning and control. sleep. memory consolidation. π¦π· CONICET - UNSAM
Bioengineer (UNER)
looking for a better place for interacting
source: https://arxiv.org/rss/stat.ML
maintainer: @tmaehara.bsky.social
Distinguished Scientist at Google. Computational Imaging, Machine Learning, and Vision. Posts are personal opinions. May change or disappear over time.
http://milanfar.org
Research Scientist at Google. All opinions are my own (π). πΊπΎ
PhD student in Computer Science @UCSD. Studying interpretable AI and RL to improve people's decision-making.
Assistant Prof at UCSD. I work on safety, interpretability, and fairness in machine learning. www.berkustun.com