SDPO enables RL agents to learn from rich feedback (i.e., not only whether an attempt failed, but why it failed, such as error messages). Even without such rich feedback, SDPO can reflect on past attempts and outperform GRPO. SDPO also accelerates solution discovery at test time!
30.01.2026 07:17 β π 6 π 1 π¬ 0 π 0
Excited to be back at AI House Davos! Looking forward to connecting and engaging in inspiring discussions, panels and roundtables about trustworthy AI, AI infrastructure and much more!
@eth-ai-center.bsky.social @ethz.ch @csateth.bsky.social
19.01.2026 11:42 β π 4 π 0 π¬ 0 π 0
ICML reaffirms its support to the community and standards of conduct:
- We do not tolerate harassment or other improper conduct;
- Academic integrity is paramount;
- We redouble our support to peer review, with more incentives for reviewers & financial support for OpenReview
icml.cc/public/blog#...
16.12.2025 19:43 β π 12 π 4 π¬ 0 π 1
Excited to attend @euripsconf.bsky.social and the @ellis.eu UnConference in Copenhagen this week!
02.12.2025 07:20 β π 37 π 0 π¬ 0 π 0
On my way to Montreal for COLM. Let me know if youβre also coming! Iβd be very happy to catch up!
We present our poster at #1013 in the Wednesday morning session.
Joint work with the amazing Ryo Bertolissi, @idoh.bsky.social, @arkrause.bsky.social.
06.10.2025 10:52 β π 11 π 1 π¬ 0 π 0
Here's the detailed technical report with many more details: github.com/swiss-ai/ape...
02.09.2025 20:42 β π 11 π 0 π¬ 0 π 0
Apertus: a fully open, transparent, multilingual language model
EPFL, ETH Zurich and the Swiss National Supercomputing Centre (CSCS) released Apertus today, Switzerlandβs first large-scale, open, multilingual language model β a milestone in generative AI for trans...
EPFL, ETH Zurich, and CSCS today released Apertus, Switzerland's first large-scale, multilingual language model (LLM). As a fully open LLM, it serves as a building block for developers and organizations to create their own applications.
ethz.ch/en/news-and-...
02.09.2025 09:07 β π 29 π 19 π¬ 1 π 2
Clinical notes are messy, inconsistent, and unstructuredβyet they hold some of the most valuable signals in real-world clinical practice.
Join us today at ICML at the Foundation Models for Structured Data workshop to see how we can make sense of these notes!
π West Ballroom D
18.07.2025 16:25 β π 10 π 2 π¬ 2 π 0
In our ICML paper, we study fine-tuning a generalist policy for multiple tasks. We ask, provided a pre-trained policy, how can we maximize multi-task performance with a minimal number of additional demonstrations?
π We are presenting a possible solution on Wed, 11am to 1.30pm at B2-B3 W-609!
14.07.2025 19:35 β π 11 π 4 π¬ 1 π 0
ETH ZΓΌrich students gain AI skills using the βAlpsβ supercomputer: www.cscs.ch/science/comp... #students #AI #supercomputer #wearealps
17.06.2025 09:39 β π 2 π 2 π¬ 0 π 0
β¨ Very excited to share that our work "Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs" will be presented at ICLR! β¨
ποΈ Wednesday, April 23rd, 7:00β9:30 p.m. PDT
π Hall 3 + Hall 2B #257
Joint work with my fantastic collaborators Sascha Bongni,
@idoh.bsky.social, @arkrause.bsky.social
21.04.2025 14:37 β π 16 π 1 π¬ 1 π 0
What is the place of exploration in today's AI landscape and in which settings can exploration algorithms address current open challenges?
Join us to discuss this at our exciting workshop at @icmlconf.bsky.social 2025: EXAIT!
exait-workshop.github.io
#ICML2025
17.04.2025 05:53 β π 10 π 3 π¬ 1 π 0
We've released our lecture notes for the course Probabilistic AI at ETH Zurich, covering uncertainty in ML and its importance for sequential decision making. Thanks a lot to @jonhue.bsky.social for his amazing effort and to everyone who contributed! We hope this resource is useful to you!
17.02.2025 07:19 β π 61 π 10 π¬ 1 π 0
Schematic illustration of a scalar-valued residual deep GP with L hidden layers. The last layer is a scalar-valued GP on the manifold. If it is not present, the model is manifold-valued. If it is replaced with a Gaussian vector field (GVF), the model is a vector field on the manifold.
Excited to share our ICLR 2025 oral "Residual Deep Gaussian Processes on Manifolds"!
With @vabor112.bsky.social & @arkrause.bsky.social, we introduce manifold-to-manifold GPs that can be composed together, generalising deep GPs to manifolds. Applications include wind prediction & Bayes opt! 1/n
13.02.2025 16:45 β π 38 π 9 π¬ 1 π 2
Reflecting back on an inspiring week at www.aihousedavos.com β such a vibrant environment to discuss, with AI thought leaders across academia, industry and the public sector, challenges around safe and responsible AI, and harnessing AI for sustainable development!
24.01.2025 09:20 β π 22 π 0 π¬ 0 π 0
π¨ New reinforcement learning algorithms π¨
Excited to announce MaxInfoRL, a class of model-free RL algorithms that solves complex continuous control tasks (including vision-based!) by steering exploration towards informative transitions.
Details in the thread π
17.12.2024 17:46 β π 18 π 2 π¬ 1 π 1
Weβre presenting our work βWhen to Sense and Control? A Time-adaptive Approach for Continuous-Time RLβ today at NeurIPS. Come join us in West at poster #6604 from 16:30-19:30!
Joint work with my fantastic collaborators Bhavya Sukhija, Yarden As, Florian DΓΆrfler, @arkrause.bsky.social
13.12.2024 23:35 β π 4 π 3 π¬ 1 π 0
Tomorrow Iβll be presenting our recent work on improving LLMs via local transductive learning in the FITML workshop at NeurIPS.
Join us for our β¨oralβ¨ at 10:30am in east exhibition hall A.
Joint work with my fantastic collaborators Sascha Bongni, @idoh.bsky.social, @arkrause.bsky.social
13.12.2024 18:32 β π 5 π 4 π¬ 1 π 0
Weβre presenting our work βTransductive Active Learning: Theory and Applicationsβ now at NeurIPS. Come join us in East at poster #4924!
Joint work with my fantastic collaborators Bhavya Sukhija, Lenart Treven, Yarden As, @arkrause.bsky.social
11.12.2024 19:53 β π 5 π 2 π¬ 1 π 0
Looking forward to attending!
22.11.2024 06:51 β π 11 π 0 π¬ 1 π 0
Assistant Professor (Tenure Track) of Computer Science β Responsible Artificial Intelligence
π£ We have a tenure-track faculty opening in Responsible AI at @ethzurich.bsky.social :
ethz.ch/en/the-eth-z.... Deadline Nov 30 for full consideration. ETH Zurich is a vibrant environment for AI research with the ETH AI Center etc. Please help spread the word!
20.11.2024 08:31 β π 79 π 23 π¬ 2 π 0
https://ai.ethz.ch/education/phd-and-postdoc-programs.html
π£ Last call for the Ph.D. and Postdoc Fellowships at the ETH AI Center -- Deadline Nov 19 '24 t.co/aYI5tWXUWK @ethzurich.bsky.social
18.11.2024 10:52 β π 21 π 9 π¬ 0 π 0
Happy to join this platform and talk about ML & AI research (of the blue sky nature and otherwise...)
18.11.2024 10:47 β π 49 π 1 π¬ 3 π 0
Researcher on MDPs and RL. Retired prof. #orms #rl
Assistant Professor / Faculty Fellow @nyudatascience.bsky.social studying cognition in mind & brain with neural nets, Bayes, and other tools (eringrant.github.io).
elsewhere: sigmoid.social/@eringrant, twitter.com/ermgrant @ermgrant
@Harvard Professor & Director Ctr for Computation & Society
@HCRCS
@GoogleDeepMind
Principal Scientist & Director for AI for Social Good
#AIforSocialGood #AIforSocialImpact #AIforhealth #AIforConservation
Chief Models Officer @ Stealth Startup; Inria & MVA - Ex: Llama @AIatMeta & Gemini and BYOL @GoogleDeepMind
Official Bluesky page of the Computer Science Department at ETH Zurich. Collected media and news from and about the department.
The world's leading venue for collaborative research in theoretical computer science. Follow us at http://YouTube.com/SimonsInstitute.
Machine Learning Scientist @ ETH Zurich, Active Learning, Sequence Design, GenAI
Associate Professor, National University of Singapore. Working in information theory, machine learning, and statistics.
Gemini Post-Training @ Google DeepMind
Previously:Β ETH Zurich, Cambridge, CERN
alizeepace.com
CMU postdoc, previously MIT PhD. Causality, pragmatism, representation learning, and AI for biology / science more broadly. Proud rat dad.
Assistant Professor at Stanford
Machine learning, algorithm design, econ-CS
https://vitercik.github.io/
PhD student @ ETH AI Center
Previously Student Researcher @ DeepMind
Interests include: RL, Game Theory, Market Design, Alignment, Post-training
Welcome to ETH AI Center! We are ethz.ch/en 's central hub leading the way towards trustworthy, accessible and inclusive #artificialintelligence
ai.ethz.ch
Hon. Associate Professor UCL CS | Ex-Dir. Research AI for Good & Head of Element AI London Office | Ex-DeepMind. He/Him | https://cornebise.com