Martin Jaggi mjaggi - Bluesky Statics

There has been some online discussion of prompt watermarks in ICML submissions.

tl;dr:
- Yes, this is one of the *conference*'s (several) scientific integrity measures
- Yes, it's not infalliable (but it still helps)
- No, your paper won't be desk rejected as a result 1/4

14.02.2026 19:52 — 👍 11 🔁 4 💬 1 📌 0

More like every week.

13.02.2026 16:41 — 👍 32 🔁 3 💬 1 📌 1

Open models continue to pace closed models on a 9 month lag.

04.02.2026 17:48 — 👍 65 🔁 8 💬 3 📌 3

A factor of 10 billion since 2010 😮

A couple of eye-opening slides form @sloeschcke.bsky.social's presentation at today’s @belongielab.org meeting (1/2)

30.01.2026 13:58 — 👍 184 🔁 39 💬 22 📌 0

🙀

28.01.2026 08:35 — 👍 0 🔁 0 💬 0 📌 1

The #ICML2026 abstract deadline has passed! We're at 33540 active abstracts (and dropping). How many will make it over the finish line? 🏁

24.01.2026 16:51 — 👍 16 🔁 2 💬 1 📌 4

New blog post (on a shiny new ICML blog!): What's New in #ICML2026 Peer Review

Some highlights:
- Policies to combat thinly sliced contributions
- Cascading desk rejections for peer-review abuse
- Reviewer reciprocity
- New ways to support authors and reviewers

Post: blog.icml.cc/2026/01/08/w...

08.01.2026 17:26 — 👍 23 🔁 8 💬 1 📌 0

Autonomous excavator constructs a six-metre-high dry-stone wall

A multidisciplinary team of ETH Zurich researchers developed a method of using an autonomous excavator to construct a dry-stone wall that is six metres high and sixty-five metres long.

30.11.2023 15:28 — 👍 1 🔁 2 💬 0 📌 1

We updated the plots we use to measure the open model ecosystem at
interconnects, to guide The ATOM Project, and to understand what's happening.

We have ~8 plots to summarize what's happening.
First, the high level picture showing China's growing adoption lead.

07.01.2026 15:27 — 👍 25 🔁 7 💬 1 📌 1

Royal Society B - Wikipedia

no. is this the same as how en.wikipedia.org/wiki/Royal_S... was created back then?

11.12.2025 20:35 — 👍 2 🔁 0 💬 1 📌 0

Announcing the ICML 2026 policy for LLMs in reviewing! Reviewers and authors both pick either conservative or permissive LLM use, and will be matched accordingly. Importantly: authors on papers who choose conservative must obey the conservative policy as reviewers.

11.12.2025 15:59 — 👍 23 🔁 10 💬 2 📌 11

what about Apertus? (seems they missed to add us in that ranking)

01.12.2025 21:36 — 👍 1 🔁 0 💬 0 📌 0

Experimental Git branch to support Apertus in the browser with Transformers.js

👀 I am working on something pretty cool..

Hopefully, it will soon be possible to try #Apertus 🇨🇭 directly in your browser, powered by Transformers.js 🎉

21.11.2025 20:08 — 👍 9 🔁 1 💬 1 📌 0

The threshold for consistent English/query understanding is now 3M parameters.

26.11.2025 09:21 — 👍 58 🔁 2 💬 3 📌 2

thanks for the lausanne visit and sharing these super cool results!

12.11.2025 22:31 — 👍 3 🔁 0 💬 0 📌 0

Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...

10.11.2025 17:30 — 👍 183 🔁 33 💬 3 📌 18

🎉 ICML 2026 Call for Papers (& Position Papers) is here! 🎉

📅 Key Dates
Abstract deadline: Jan 23, 2026 AOE
Paper deadline: Jan 28, 2026 AOE

A few key changes this year:
- Attendance for authors of accepted papers is optional
- Originally submitted version of accepted papers will be made public
...

07.11.2025 14:42 — 👍 14 🔁 8 💬 1 📌 3

so open-weights models are much happier than closed ones i guess, cause they live on in the long run, did i get that right?

05.11.2025 12:32 — 👍 2 🔁 0 💬 0 📌 0

Reasoning with Sampling: Your Base Model is Smarter Than You Think Frontier reasoning models have exhibited incredible capabilities across a wide array of disciplines, driven by posttraining large language models (LLMs) with reinforcement learning (RL). However, desp...

this seems to become a trend already: arxiv.org/abs/2510.14901

17.10.2025 19:43 — 👍 3 🔁 0 💬 0 📌 0

Base Models Know How to Reason, Thinking Models Learn When Why do thinking language models like DeepSeek R1 outperform their base counterparts? Despite consistent performance gains, it remains unclear to what extent thinking models learn entirely new reasonin...

91% of reasoning does not need RL 🤯 arxiv.org/abs/2510.07364

14.10.2025 21:53 — 👍 8 🔁 0 💬 1 📌 0

Gemini 2.5 Computer Use can solve Google’s own CAPTCHAs Google just introduced a new Gemini 2.5 Computer Use model, specially designed to help operate a GUI interface by interacting with visible elements using a virtual mouse and keyboard. I …

I just tried the official demo for the new Gemini 2.5 Computer Use model and it started by navigating to Google, solving Google's own CAPTCHA and then running a search! https://simonwillison.net/2025/Oct/7/gemini-25-computer-use-captchas/

07.10.2025 21:19 — 👍 9 🔁 4 💬 2 📌 0

apertus also! (september release, same mission but multilingual)

05.10.2025 22:13 — 👍 1 🔁 0 💬 0 📌 0

GitHub - swiss-ai/apertus-finetuning-recipes Contribute to swiss-ai/apertus-finetuning-recipes development by creating an account on GitHub.

cool idea. let’s us know how it goes! btw maybe these can be useful github.com/swiss-ai/ape...
or, since today, also unsloth and llamacpp

03.10.2025 17:53 — 👍 1 🔁 0 💬 1 📌 0

Faculty Positions in Computer & Communication Sciences – Learning Sciences The School of Computer and Communication Sciences (IC) at EPFL invites applications for tenure-track faculty positions in learning sciences and educational technologies, with a focus on computational ...

on the engineering track it renews yearly usually, but permanent is possible after some experience & paperwork. on the academic track see e.g. here www.epfl.ch/about/workin...

26.09.2025 19:34 — 👍 1 🔁 0 💬 0 📌 0

Apertus LLM - a swiss-ai Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages

Link to the first version of the Apertus open-data open-weights LLM - multilingual in >1000 languages, and compliant ethical AI huggingface.co/collections/...

25.09.2025 21:08 — 👍 1 🔁 0 💬 0 📌 0

Several open positions at EPFL Lausanne and ETH Zurich and, as part of the Swiss AI Initiative. We cover the entire stack of foundation model training. And we're open to international applicants of course (no H-1B required ;))

25.09.2025 21:08 — 👍 2 🔁 0 💬 1 📌 0

AI Research Engineers - Swiss AI Initiative AI Research Engineers - Swiss AI Initiative

We're hiring again for AI research engineering roles: Join the team behind the Apertus LLM, if you share our passion to work on impactful AI that's truly open.

careers.epfl.ch/job/Lausanne...

25.09.2025 21:08 — 👍 5 🔁 4 💬 2 📌 0

1/🚨 New preprint

How do #LLMs’ inner features change as they train? Using #crosscoders + a new causal metric, we map when features appear, strengthen, or fade across checkpoints—opening a new lens on training dynamics beyond loss curves & benchmarks.

#interpretability

25.09.2025 14:02 — 👍 15 🔁 6 💬 2 📌 0

Schweizer Sprachmodell Apertus: So sieht EU-konforme, transparente KI aus Vielsprachigkeit, Transparenz, Respekt vor geistigem Eigentum: Das offene große Sprachmodell aus Schweizer KI-Schmieden verinnerlicht europäische Werte.

Schweizer Sprachmodell Apertus: So sieht EU-konforme, transparente KI aus
https://www.heise.de/hintergrund/Schweizer-Sprachmodell-Apertus-So-sieht-EU-konforme-transparente-KI-aus-10638501.html?utm_source=flipboard&utm_medium=activitypub

Gepostet in Nachrichten @nachrichten-heiseonline

24.09.2025 15:30 — 👍 1 🔁 1 💬 0 📌 0

funktioniert schon seit letzter woche im neusten LM Studio (mit MLX) huggingface.co/models?searc...

GGUF kommt auch bald die tage

18.09.2025 22:07 — 👍 0 🔁 0 💬 1 📌 0

Posts by Martin Jaggi (@mjaggi.bsky.social)