Almost 5 years in the making... "Hyperparameter Optimization in Machine Learning" is finally out! π
We designed this monograph to be self-contained, covering: Grid, Random & Quasi-random search, Bayesian & Multi-fidelity optimization, Gradient-based methods, Meta-learning.
arxiv.org/abs/2410.22854
17.12.2025 09:54 β
π 13
π 8
π¬ 0
π 0
Hyperparameter Optimization in Machine Learning
Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values ...
π¨ OpenReview might have leaked names, but it won't leak the best hyperparameters, unfortunately! π
Tired of the drama? Solve your HPO problems before the ICML deadline with this new monograph by our own Luca Franceschi & Massimiliano Pontil (& colleagues).
arxiv.org/abs/2410.22854
28.11.2025 17:34 β
π 9
π 1
π¬ 0
π 1
He will also present an entropy-respecting forwardβbackward learning scheme that mitigates the inherent ill-posedness of stochastic learning problems.
Join us for what promises to be a very insightful session!
14.11.2025 14:03 β
π 4
π 0
π¬ 0
π 0
In this talk, Arthur Bizzi will introduce Neural Kolmogorov Equations, a deterministic and parallelizable framework for learning continuous-time stochastic processes using Forward and Backward Kolmogorov Equations.
14.11.2025 14:03 β
π 1
π 0
π¬ 1
π 0
Abstract:
Learning differential equations becomes substantially more challenging in the presence of stochasticity, as Neural SDEs typically require expensive, sequential integration during training.
14.11.2025 14:03 β
π 1
π 0
π¬ 1
π 0
π’ Upcoming Talk at Our Lab
Weβre excited to host Arthur Bizzi from EPFL for a research talk next week!
Title: Towards Neural Kolmogorov Equations: Parallelizable SDE Learning with Neural PDEs
π Date: November 19
β° Time: 16:00 CET
π Galileo Sala, CHT @iitalk.bsky.social
14.11.2025 14:03 β
π 5
π 2
π¬ 1
π 0
Excited to share our groupβs latest work at #AISTATS2025! π
Tackling concentration in dependent data settings with empirical Bernstein bounds for Hilbert space-valued processes.
πCatch the poster tomorrow!
π See the original tweet for details!
02.05.2025 18:36 β
π 3
π 0
π¬ 0
π 0
DeltaProduct is here! Achieve better state tracing through highly parallel execution. Explore more!π
09.04.2025 10:11 β
π 5
π 1
π¬ 0
π 0
14/ Looking ahead, weβre excited to tackle new challenges:
β’ Learning from partial observations
β’ Modeling non-time-homogeneous dynamics
β’ Expanding applications in neuroscience, genetics, and climate modeling
Stay tuned for groundbreaking updates from our team! π
15.01.2025 14:34 β
π 2
π 0
π¬ 1
π 0
π Collaborations with the Dynamic Legged Systems group led by Claudio Semini and the Atomistic Simulations group led by Michele Parrinello enriched our research, resulting in impactful works like [P9, P10] and [P7, P11].
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
12/ This journey wouldnβt have been possible without the inspiring collaborations that shaped our work.
π Special thanks to Karim Lounici from Γcole Polytechnique, whose insights were a major driving force behind many projects.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
Predicting the quantiles for opening/closing of the Chignolin protein in the next simulation step
11/ One of our most exciting results:
[P8] NeurIPS 2024 proposed Neural Conditional Probability (NCP) to efficiently learn conditional distributions. It simplifies uncertainty quantification and guarantees accuracy for nonlinear, high-dimensional data.
15.01.2025 14:34 β
π 2
π 0
π¬ 1
π 0
10/ [P7] NeurIPS 2024 developed methods to discover slow dynamical modes in systems like molecular simulations. This is transformative for studying rare events and costly data acquisition scenarios in atomistic systems.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
9/ Addressing continuous dynamics:
[P6] NeurIPS 2024 introduced a physics-informed framework for learning Infinitesimal Generators (IG) of stochastic systems, ensuring robust spectral estimation.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
8/ π Representation learning takes center stage in:
[P5] ICLR 2024
We combined neural networks with operator theory via Deep Projection Networks (DPNets). This approach enhances robustness, scalability, and interpretability for dynamical systems.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
Free energy surface of Chignolin protein folding
7/ π Scaling up:
[P4] NeurIPS 2023 introduced a NystrΓΆm sketching-based method to reduce computational costs from cubic to almost linear without sacrificing accuracy. Validated on massive datasets like molecular dynamics, see figure.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
Effects of metric distortion in learning eigenvalues (left) and stabilization of forecasting (right) for Ornstein-Uhlenbeck process
6/ [P3] ICML 2024 addressed a critical issue in TO-based modeling: reliable long-term predictions.
Our Deflate-Learn-Inflate (DLI) paradigm ensures uniform error bounds, even for infinite time horizons. This method stabilized predictions in real-world tasks; see the figure.
15.01.2025 14:34 β
π 2
π 0
π¬ 1
π 0
5/ [P2] NeurIPS 2023 advanced TOs with theoretical guarantees for spectral decompositionβpreviously lacking finite sample guarantees. We developed sharp learning rates, enabling accurate, reliable models for long-term system behavior.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
Koopman Operator Regression Pipeline
4/ π The journey began with:
[P1] NeurIPS 2022
We introduced the first ML formulation for learning TO, which led to the development of the open-source Kooplearn library. This step laid the groundwork for exploring the theoretical limits of operator learning from finite data.
15.01.2025 14:34 β
π 2
π 0
π¬ 1
π 0
3/TOs describe system evolution over finite time intervals, while IGs capture instantaneous rates of change. Their spectral decomposition is key for identifying dominant modes and understanding long-term behavior in complex or stochastic systems.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
2/ π Our work revolves around Markov/Transfer Operators (TO) and their Infinitesimal Generators (IG)βtools that allow us to model complex dynamical systems by understanding their evolution in higher-dimensional spaces. Hereβs why this matters.
15.01.2025 14:34 β
π 1
π 0
π¬ 1
π 0
1/ π Over the past two years, our team, CSML, at IIT, has made significant strides in the data-driven modeling of dynamical systems. Curious about how we use advanced operator-based techniques to tackle real-world challenges? Letβs dive in! π§΅π
15.01.2025 14:34 β
π 5
π 3
π¬ 1
π 0
An inspiring dive into understanding dynamical processes through 'The Operator Way.' A fascinating approach made accessible for everyoneβcheck it out! ππ
15.01.2025 10:31 β
π 4
π 1
π¬ 0
π 0
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
Linear Recurrent Neural Networks (LRNNs) such as Mamba, RWKV, GLA, mLSTM, and DeltaNet have emerged as efficient alternatives to Transformers in large language modeling, offering linear scaling withβ¦
Excited to present
"Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues"
at the M3L workshop at #NeurIPS
https://buff.ly/3BlcD4y
If interested, you can attend the presentation the 14th at 15:00, pass at the afternoon poster session, or DM me to discuss :)
10.12.2024 22:52 β
π 9
π 3
π¬ 0
π 0
In his book βThe Nature of Statistical Learningβ V. Vapnik wrote:
βWhen solving a given problem, try to avoid a more general problem as an intermediate stepβ
12.12.2024 17:19 β
π 8
π 3
π¬ 1
π 0
Join us at our posters and talks to connect, share ideas, and explore collaborations. πβ¨
10.12.2024 02:38 β
π 3
π 0
π¬ 0
π 0
π¬ Fine-tuning Foundation Models for Molecular Dynamics: A Data-Efficient Approach with Random Features
βοΈ @pienovelli.bsky.social, L. Bonati, P. Buigues, G. Meanti, L. Rosasco, M. Pontil | π
ML4PS Workshop, Dec 15.
10.12.2024 02:38 β
π 3
π 0
π¬ 1
π 0
π Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues
βοΈ R. Grazzi, J. Siems, J. Franke, A. Zela, F. Hutter, M. Pontil
πhttps://arxiv.org/abs/2411.12537 | π
Oral @ M3L workshop, Dec 14, 15:00 - 15:15.
10.12.2024 02:38 β
π 3
π 0
π¬ 1
π 0