CSML IIT Lab's Avatar

CSML IIT Lab

@pontilgroup.bsky.social

Computational Statistics and Machine Learning (CSML) Lab | PI: Massimiliano Pontil | Webpage: csml.iit.it | Active research lines: Learning theory, ML for dynamical systems, ML for science, and optimization.

613 Followers  |  15 Following  |  34 Posts  |  Joined: 24.11.2024
Posts Following

Posts by CSML IIT Lab (@pontilgroup.bsky.social)

Post image

Almost 5 years in the making... "Hyperparameter Optimization in Machine Learning" is finally out! πŸ“˜

We designed this monograph to be self-contained, covering: Grid, Random & Quasi-random search, Bayesian & Multi-fidelity optimization, Gradient-based methods, Meta-learning.

arxiv.org/abs/2410.22854

17.12.2025 09:54 β€” πŸ‘ 13    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0
Preview
Hyperparameter Optimization in Machine Learning Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values ...

🚨 OpenReview might have leaked names, but it won't leak the best hyperparameters, unfortunately! πŸ˜…

Tired of the drama? Solve your HPO problems before the ICML deadline with this new monograph by our own Luca Franceschi & Massimiliano Pontil (& colleagues).

arxiv.org/abs/2410.22854

28.11.2025 17:34 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1

He will also present an entropy-respecting forward–backward learning scheme that mitigates the inherent ill-posedness of stochastic learning problems.

Join us for what promises to be a very insightful session!

14.11.2025 14:03 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In this talk, Arthur Bizzi will introduce Neural Kolmogorov Equations, a deterministic and parallelizable framework for learning continuous-time stochastic processes using Forward and Backward Kolmogorov Equations.

14.11.2025 14:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Abstract:
Learning differential equations becomes substantially more challenging in the presence of stochasticity, as Neural SDEs typically require expensive, sequential integration during training.

14.11.2025 14:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ“’ Upcoming Talk at Our Lab

We’re excited to host Arthur Bizzi from EPFL for a research talk next week!

Title: Towards Neural Kolmogorov Equations: Parallelizable SDE Learning with Neural PDEs

πŸ—“ Date: November 19
⏰ Time: 16:00 CET
πŸ“ Galileo Sala, CHT @iitalk.bsky.social

14.11.2025 14:03 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Excited to share our group’s latest work at #AISTATS2025! πŸŽ“
Tackling concentration in dependent data settings with empirical Bernstein bounds for Hilbert space-valued processes.
πŸ“Catch the poster tomorrow!

πŸ” See the original tweet for details!

02.05.2025 18:36 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

DeltaProduct is here! Achieve better state tracing through highly parallel execution. Explore more!πŸš€

09.04.2025 10:11 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Slow dynamical modes from static averages In recent times, efforts are being made at describing the evolution of a complex system not through long trajectories, but via the study of probability distribution evolution. This more collective app...

P11] (submitted to The Journal of Chemical Physics)
chemrxiv.org/engage/chemr...

Kooplearn library:
kooplearn.readthedocs.io/latest/

For the longer version of the thread, you can take a look at this blog post:
vladi-iit.github.io/posts/2024-1...

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Learning Dynamical Systems via Koopman Operator Regression in Reproducing Kernel Hilbert Spaces We study a class of dynamical systems modelled as Markov chains that admit an invariant distribution via the corresponding transfer, or Koopman, operator. While data-driven algorithms to reconstruct s...

Publications:
[P1] NeurIPS 2022
arxiv.org/abs/2205.14027

[P2] NeurIPS2023
arxiv.org/abs/2302.02004

[P3] ICML2024
arxiv.org/abs/2312.13426

[P4] NeurIPS2023
arxiv.org/abs/2306.04520

[P5] ICLR 2024
arxiv.org/abs/2307.09912

[P6] NeurIPS2024
arxiv.org/abs/2405.12940

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

14/ Looking ahead, we’re excited to tackle new challenges:
β€’ Learning from partial observations
β€’ Modeling non-time-homogeneous dynamics
β€’ Expanding applications in neuroscience, genetics, and climate modeling

Stay tuned for groundbreaking updates from our team! 🌍

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ™ Collaborations with the Dynamic Legged Systems group led by Claudio Semini and the Atomistic Simulations group led by Michele Parrinello enriched our research, resulting in impactful works like [P9, P10] and [P7, P11].

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

12/ This journey wouldn’t have been possible without the inspiring collaborations that shaped our work.

🌟 Special thanks to Karim Lounici from Γ‰cole Polytechnique, whose insights were a major driving force behind many projects.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Predicting the quantiles for opening/closing of the Chignolin protein in the next simulation step

Predicting the quantiles for opening/closing of the Chignolin protein in the next simulation step

11/ One of our most exciting results:
[P8] NeurIPS 2024 proposed Neural Conditional Probability (NCP) to efficiently learn conditional distributions. It simplifies uncertainty quantification and guarantees accuracy for nonlinear, high-dimensional data.

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

10/ [P7] NeurIPS 2024 developed methods to discover slow dynamical modes in systems like molecular simulations. This is transformative for studying rare events and costly data acquisition scenarios in atomistic systems.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

9/ Addressing continuous dynamics:
[P6] NeurIPS 2024 introduced a physics-informed framework for learning Infinitesimal Generators (IG) of stochastic systems, ensuring robust spectral estimation.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

8/ 🌟 Representation learning takes center stage in:
[P5] ICLR 2024
We combined neural networks with operator theory via Deep Projection Networks (DPNets). This approach enhances robustness, scalability, and interpretability for dynamical systems.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Free energy surface of Chignolin protein folding

Free energy surface of Chignolin protein folding

7/ πŸ“ˆ Scaling up:
[P4] NeurIPS 2023 introduced a NystrΓΆm sketching-based method to reduce computational costs from cubic to almost linear without sacrificing accuracy. Validated on massive datasets like molecular dynamics, see figure.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Effects of metric distortion in learning eigenvalues (left) and stabilization of forecasting (right) for Ornstein-Uhlenbeck process

Effects of metric distortion in learning eigenvalues (left) and stabilization of forecasting (right) for Ornstein-Uhlenbeck process

6/ [P3] ICML 2024 addressed a critical issue in TO-based modeling: reliable long-term predictions.
Our Deflate-Learn-Inflate (DLI) paradigm ensures uniform error bounds, even for infinite time horizons. This method stabilized predictions in real-world tasks; see the figure.

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

5/ [P2] NeurIPS 2023 advanced TOs with theoretical guarantees for spectral decompositionβ€”previously lacking finite sample guarantees. We developed sharp learning rates, enabling accurate, reliable models for long-term system behavior.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
 Koopman Operator Regression Pipeline

Koopman Operator Regression Pipeline

4/ πŸ”‘ The journey began with:
[P1] NeurIPS 2022
We introduced the first ML formulation for learning TO, which led to the development of the open-source Kooplearn library. This step laid the groundwork for exploring the theoretical limits of operator learning from finite data.

15.01.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

3/TOs describe system evolution over finite time intervals, while IGs capture instantaneous rates of change. Their spectral decomposition is key for identifying dominant modes and understanding long-term behavior in complex or stochastic systems.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

2/ 🌐 Our work revolves around Markov/Transfer Operators (TO) and their Infinitesimal Generators (IG)β€”tools that allow us to model complex dynamical systems by understanding their evolution in higher-dimensional spaces. Here’s why this matters.

15.01.2025 14:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

1/ πŸš€ Over the past two years, our team, CSML, at IIT, has made significant strides in the data-driven modeling of dynamical systems. Curious about how we use advanced operator-based techniques to tackle real-world challenges? Let’s dive in! πŸ§΅πŸ‘‡

15.01.2025 14:34 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

An inspiring dive into understanding dynamical processes through 'The Operator Way.' A fascinating approach made accessible for everyoneβ€”check it out! πŸ‘‡πŸ‘€

15.01.2025 10:31 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues Linear Recurrent Neural Networks (LRNNs) such as Mamba, RWKV, GLA, mLSTM, and DeltaNet have emerged as efficient alternatives to Transformers in large language modeling, offering linear scaling with…

Excited to present
"Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues"
at the M3L workshop at #NeurIPS
https://buff.ly/3BlcD4y

If interested, you can attend the presentation the 14th at 15:00, pass at the afternoon poster session, or DM me to discuss :)

10.12.2024 22:52 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

In his book β€œThe Nature of Statistical Learning” V. Vapnik wrote:
β€œWhen solving a given problem, try to avoid a more general problem as an intermediate step”

12.12.2024 17:19 β€” πŸ‘ 8    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Join us at our posters and talks to connect, share ideas, and explore collaborations. πŸš€βœ¨

10.12.2024 02:38 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”¬ Fine-tuning Foundation Models for Molecular Dynamics: A Data-Efficient Approach with Random Features
✍️ @pienovelli.bsky.social, L. Bonati, P. Buigues, G. Meanti, L. Rosasco, M. Pontil | πŸ“…ML4PS Workshop, Dec 15.

10.12.2024 02:38 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ”— Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
✍️ R. Grazzi, J. Siems, J. Franke, A. Zela, F. Hutter, M. Pontil
πŸ“ƒhttps://arxiv.org/abs/2411.12537 | πŸ“… Oral @ M3L workshop, Dec 14, 15:00 - 15:15.

10.12.2024 02:38 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0