For more details about the work on learning uniformly accurate interatomic potentials from scratch I'll present in B1.36:
π Paper: www.nature.com/articles/s41...
π» Code: github.com/nec-research...
@viktorzaverkin.bsky.social
Research Scientist @ NEC Labs Europe, Ph.D. in Theoretical Chemistry @ SimTech & @unistuttgart.bsky.social, ML/DL for Chemistry & Materials Science
For more details about the work on learning uniformly accurate interatomic potentials from scratch I'll present in B1.36:
π Paper: www.nature.com/articles/s41...
π» Code: github.com/nec-research...
Or stop by poster B1.36 (Thu, Aug 28)!
#PsiK2025 #AI4Science
Iβll be at the Psi-k conference next week!
Letβs chat about ML potentials, uncertainty quantification (ensemble-free, gradient-based: Laplace approx., NTKs, batch selection, β¦), uncertainty-biased MD, message-passing architectures, particle-mesh long-range methods, etc.
π§΅ TL;DR:
β
Benchmark metrics improve with model size and electrostatics
β These gains don't always translate to improved simulation outcomes
β οΈ Training data & evaluation practices remain key bottlenecks
πPreprint: arxiv.org/abs/2508.10841
π»Code: github.com/nec-research...
-/
With @matheusfferraz.bsky.social, @falesiani.bsky.social, @mniepert.bsky.social
15/
Moreover, simulation results are sensitive to training data composition.
E.g., water density predictions depend on whether NaCl-water clusters were in the training set: compare ICTP-LR(M) vs. ICTP-LR(M)*.
Legend: Solid green - ICTP-LR(M); dashed green - ICTP-LR(M)*.
14/
Without DFT-level simulations or other baselines, it is difficult to assess to what extent universal ML potentials improve on classical FFs in realistic biomolecular settings.
While their qualitative advantages are often evident, quantitative validation remains challenging.
13/
These results highlight the limitations of current evaluation practices.
12/
In Trp-cage, simulations with explicit long-range electrostatics exhibit greater conformational variability.
However, the origin of these effects remains unclear without DFT-level simulations.
11/
For Crambin, no significant differences are observed for the vibrational spectrum.
10/
For Ala3, larger models better reproduce experimental J-couplings.
9/
For water and NaCl-water mixtures:
- Larger models don't consistently outperform smaller ones
- Increasing model size doesn't yield systematic convergence
- Explicit electrostatics shifts density predictions from overestimation to underestimation, without consistent gains.
8/
BUT: These improvements do not consistently translate into more accurate physical observables in simulations.
Densities, radial distribution functions, and conformational ensembles show inconsistent trends with model size and long-range electrostatics.
7/
As expected, benchmark metrics (e.g., energy & force RMSEs) systematically improve with increasing model size and the inclusion of explicit long-range interactions.
6/
We use DIMOS for our simulations:
πPreprint: arxiv.org/abs/2503.20541
π»Code: github.com/nec-research...
5/
We assess the impact of model size, dataset composition, and explicit long-range electrostatics across:
π Benchmark datasets
π§ Pure liquid water
π§ NaCl-water mixtures
𧬠Small peptides (blocked and cationic Ala3)
π§ͺ Small proteins (Trp-cage, Crambin)
4/
DFT-level simulations and other high-quality baselines are unavailable or infeasible for biomolecular systems.
A more reliable evaluation should consider how model expressivity (model size, explicit long-range interactions) affects prediction errors and simulation results.
3/
π Preprint: arxiv.org/abs/2508.10841
π» Code: github.com/nec-research...
2/
π¨ New preprint: How well do universal ML potentials perform in biomolecular simulations under realistic conditions?
There's growing excitement around ML potentials trained on large datasets.
But do they deliver in simulations of biomolecular systems?
Itβs not so clear. π§΅
1/
πPaper: pubs.acs.org/doi/abs/10.1... (but check also pubs.acs.org/doi/abs/10.1...)
π»Code(s): gitlab.com/zaverkin_v/g... (TensorFlow), github.com/nec-research... (PyTorch), and github.com/apax-hub/apax (JAX)
Many thanks to everyone who has read, cited, or built on it. I hope it continues to be helpful!
25.06.2025 14:42 β π 0 π 0 π¬ 1 π 0We proposed using full tensor contractions to construct many-body features, thereby avoiding expensive sums over triplets, quadruplets, and so on. I am thrilled to see that similar ideas are now an integral part of state-of-the-art architectures, such as MACE, CACE, and so on. πͺ
25.06.2025 14:42 β π 0 π 0 π¬ 1 π 0πMy first PhD paper just reached 100 citations, which is a small but very special milestone for me!
Our paper introduces Gaussian moments as molecular descriptors and uses them to build ML potentials with an impressive balance between accuracy and computational efficiency.
π Apply Now: International Master's Chemical Sciences! π
The application portal for the English-conducted M.Sc. Chemical Sciences at @unistuttgart.bsky.social are officially open! π¬β¨
π Visit our program website for further details: www.uni-stuttgart.de/en/study/stu...
I'll present our paper in the afternoon poster session at 4:30pm - 7:30 pm in East Exhibit Hall A-C, poster 3304!
12.12.2024 18:54 β π 3 π 2 π¬ 0 π 0Neural surrogates can accelerate PDE solving but need expensive ground-truth training data. Can we reduce the training data size with active learning (AL)? In our NeurIPS D3S3 poster, we introduce AL4PDE, an extensible AL benchmark for autoregressive neural PDE solvers. π§΅
11.12.2024 18:22 β π 12 π 3 π¬ 1 π 2Join us today at #NeurIPS2024 for our poster presentation:
Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing
ποΈ When: Wed, Dec 11, 11 a.m. β 2 p.m. PST
π Where: East Exhibit Hall A-C, Poster #4107
#MachineLearning #InteratomicPotentials #Equivariance #GraphNeuralNetworks
@falesiani.bsky.social
11.12.2024 01:20 β π 1 π 0 π¬ 0 π 0@takashimal.bsky.social
08.12.2024 11:03 β π 0 π 0 π¬ 1 π 0Good question! There is a certain connection between Cartesian and geometric products through those operations: geometric product for an n-dimensional vector could be seen as an "outer" product (up to n-rank tensors) + a "contraction". However, I don't know of any systematic study of the relation.
08.12.2024 10:23 β π 0 π 0 π¬ 0 π 0