Ian Dunn's Avatar

Ian Dunn

@ian-dunn.bsky.social

PhD Candidate in Computational Biology @ University of Pittsburgh. Working on deep generative models for molecular structure. iandunn.io

450 Followers  |  222 Following  |  16 Posts  |  Joined: 14.11.2024  |  1.6886

Latest posts by ian-dunn.bsky.social on Bluesky

Preview
CACHE Challenge #1: Docking with GNINA Is All You Need We describe our winning submission to the first Critical Assessment of Computational Hit-Finding Experiments (CACHE) challenge. In this challenge, 23 participants employed a diverse array of structure...

😎 pubs.acs.org/doi/10.1021/...

22.09.2025 22:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
FlowMol3: Flow Matching for 3D De Novo Small-Molecule Generation A generative model capable of sampling realistic molecules with desired properties could accelerate chemical discovery across a wide range of applications. Toward this goal, significant effort has foc...

Preprint: arxiv.org/abs/2508.12629
Code: github.com/Dunni3/FlowMol

02.09.2025 19:10 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

The performance gains over previous FlowMol versions are due to 3 techniques which are cheap and architecture agnostic. We hypothesize that these techniques operate synergistically to reduce a common pathology in transport-based generative models.

02.09.2025 19:10 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

I'm excited to share FlowMol3! The 3rd (and final) version of our flow matching model for 3D de novo, small-molecule generation. FlowMol3 achieves state of the art performance over a broad range of evaluations while having β‰ˆ10x fewer parameters than comparable models.

02.09.2025 19:10 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Video thumbnail

Our new preprint PharmacoForge: Pharmacophore Generation with Diffusion Models is out now! PharmacoForge quickly generates pharmacophores for a given protein pocket that identify key binding features and find useful compounds in a pharmacophore search. Check it out! πŸ§ͺ doi.org/10.26434/che...

27.05.2025 19:11 β€” πŸ‘ 21    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0
Post image

New "blogpost" from our lab, that got accepted at ICLR 2025! We compare an old MCMC method known as Sequential Monte Carlo to generative models trained on energy functions (iDEM/iEFM) and show that MCMC does better. Check it out here: rishalaggarwal.github.io/ebmvsmcmc/

31.03.2025 16:57 β€” πŸ‘ 4    πŸ” 4    πŸ’¬ 1    πŸ“Œ 2
Video thumbnail

Structural biology is in an era of dynamics & assemblies but turning raw experimental data into atomic models at scale remains challenging. @minhuanli.bsky.social and I present ROCKETπŸš€: an AlphaFold augmentation that integrates crystallographic and cryoEM/ET data with room for more! 1/14.

24.02.2025 12:22 β€” πŸ‘ 139    πŸ” 60    πŸ’¬ 6    πŸ“Œ 5

Thank you!

15.12.2024 21:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks Pat!

15.12.2024 21:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

MLSB + the AI4Science field are clearly outgrowing the ML conference workshop format

15.12.2024 18:38 β€” πŸ‘ 11    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

FlowMol at your fingertips! We just released a colab notebook to make using FlowMol super easy. Come chat with us tomorrow at @workshopmlsb ! #NeurIPS2024 πŸ§ͺ colab.research.google.com/github/Dunni...

15.12.2024 00:34 β€” πŸ‘ 19    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1

Thanks Alex!

12.12.2024 03:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - Dunni3/FlowMol: Mixed continous/categorical flow-matching model for de novo molecule generation. Mixed continous/categorical flow-matching model for de novo molecule generation. - Dunni3/FlowMol

Our work is fully open-source and we invite feedback from the community. Code is available here: github.com/Dunni3/FlowMol

11.12.2024 21:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This opens a new set of questions, gives researchers a new way to quantify molecule quality, and the ability to test hypotheses as we further push de novo models to more faithfully match the distribution of real molecules.

11.12.2024 21:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

But that's not the whole story. We introduce methods to quantify molecule quality at the level of functional groups + ring systems. "Valid" generated molecules tend to contain significantly more reactive functional groups than in the training data.

11.12.2024 21:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We test a handful of discrete flow matching methods for 3D de novo molecule design and provide some explanations for their differing performance. The result of this is a version of FlowMol with CTMC flows that achieves SOTA validity with fewer learnable parameters.

11.12.2024 21:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

I'm presenting a new paper "Exploring Discrete Flow Matching for 3D De Novo Molecule Generation" at @workshopmlsb.bsky.social this week! More info in this thread but reach out if want to chat at NeurIPS about generative models or molecular design. arxiv.org/abs/2411.16644

11.12.2024 21:21 β€” πŸ‘ 51    πŸ” 12    πŸ’¬ 2    πŸ“Œ 3

Congrats!

10.12.2024 15:31 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
CACHE Challenge #1: Docking with GNINA Is All You Need We describe our winning submission to the first Critical Assessment of Computational Hit-Finding Experiments (CACHE) challenge. In this challenge, 23 participants employed a diverse array of structure...

Our paper describing our winning submission (tied with @olexandr.bsky.social) is out with some extra computational analysis of the predicted binding modes. We didn't do anything fancy (but the hits weren't that great either...).

pubs.acs.org/doi/10.1021/...

10.12.2024 12:26 β€” πŸ‘ 31    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1

formal post coming soon :p

29.11.2024 00:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Here is how Boltz-1 (green), DynamicBind (magenta), and GNINA (blue) dock a collection of random molecules. GNINA, using a classical sampling algorithm (MCMC) hits all concave regions while the ML samplers have distinct preferences. Boltz is the most likely to induce a fit.

22.11.2024 18:27 β€” πŸ‘ 42    πŸ” 15    πŸ’¬ 0    πŸ“Œ 1

@ian-dunn is following 20 prominent accounts