Are you interested in Open-Endedness and AI for Science? π§ͺ
I'm hiring a Student Researcher at Google DeepMind for a 6-month role. Join us to work on building agents capable of novel scientific discoveries! π¬
Reach out if this sounds like you, and apply here π
docs.google.com/forms/d/e/1F...
11.11.2025 11:47 β π 4 π 0 π¬ 1 π 0
StochasTok: Improving Fine-Grained Subword Understanding in LLMs
Subword-level understanding is integral to numerous tasks, including understanding multi-digit numbers, spelling mistakes, abbreviations, rhyming, and wordplay. Despite this, current large language mo...
π Paper: arxiv.org/abs/2506.01687
π» Code: github.com/anyasims/sto...
A massive π to my incredible co-authors: Anya Sims, Thom Foster, @klarakaleb.bsky.social, Tuan-Duy H. Nguyen, Joseph Lee, @jfoerst.bsky.social, @yeewhye.bsky.social!
[8/8]
11.06.2025 12:09 β π 1 π 0 π¬ 0 π 0
The significant gains from this minimal change are super exciting, and we see huge potential for larger models and more complex tasks like coding, scientific reasoning, and beyond! We invite you to explore the paper and code!
[7/]
11.06.2025 12:09 β π 0 π 0 π¬ 1 π 0
More major advantages! π
COST-EFFECTIVE: StochasTok allows enhanced subword skills to be seamlessly 'retrofitted' into existing pretrained models - thus avoiding costly pretraining!
ENHANCED ROBUSTNESS: Improves resilience to alternative tokenizations! (see examples)
[6/]
11.06.2025 12:09 β π 0 π 0 π¬ 1 π 0
Empirically, we find:
LANGUAGE: As hoped, StochasTok unlocks language manipulation ability! (see task examples below)
MATH: Furthermore, StochasTok dramatically changes multi-digit addition, enabling grokking and even generalization to UNSEEN TOKENIZERS!π€―
[5/]
11.06.2025 12:09 β π 0 π 0 π¬ 1 π 0
Practically, StochasTok is:
β
Computationally lightweightπͺΆ
β
A simple dataset preprocessing step β No training loop or inference time changes required!π οΈ
β
Compatible with ANY base tokenizer β Allows us to retrofit pretrained models!π°
β
Robust to hyperparameter choice!π₯
[4/]
11.06.2025 12:09 β π 1 π 0 π¬ 1 π 0
The underlying StochasTok algorithm is extremely simple!
1οΈβ£ Simply tokenize text with ANY base tokenizer,
2οΈβ£ Then, stochastically split some of those tokens into equivalent token pairs.
Thatβs basically it! Repeat step 2 for the desired granularity.
[3/]
11.06.2025 12:09 β π 0 π 0 π¬ 1 π 0
π€The problem: Standard tokenization gives distinct token IDs for each token - making it unnecessarily hard to learn, e.g., βbookβ=3092 and βcookβ=171691 differ by a single letter.
πThe solution: Allow LLMs to naturally 'see inside' tokens via alternative tokenizations!
[2/]
11.06.2025 12:09 β π 0 π 0 π¬ 1 π 0
πIntroducing βStochasTok: Improving Fine-Grained Subword Understanding in LLMsβ!π
LLMs are incredible but still struggle disproportionately with subword tasks, e.g., for character counts, wordplay, multi-digit numbers, fixing typos⦠Enter StochasTok, led by Anya Sims!
[1/]
11.06.2025 12:09 β π 4 π 2 π¬ 1 π 1
Β© CBC/Radio-Canada 2025. All rights reserved.
It was an honor to be on Quirks and Quarks (the
CBC science show) with @cong-ml.bsky.social talking about The AI Scientist and the impact of AI on science.
Science is being transformed by the AI revolution
cbc.ca/listen/live-...
14.02.2025 22:26 β π 8 π 2 π¬ 1 π 0
Introducing Automated Capability Discovery!
ACD automatically identifies surprising new capabilities and failure modes in foundation models, via "self-exploration" (models exploring their own abilities).
Led by @cong-ml.bsky.social & @shengranhu.bsky.social
π¬π€π§ π [1/9]
12.02.2025 06:59 β π 19 π 3 π¬ 1 π 0
It's an honor that The AI Scientist is #1 on this list!
www.linkedin.com/feed/update/...
Congrats @chris-lu.bsky.social @cong-ml.bsky.social @RobertTLange @hardmaru.bsky.social @jfoerst.bsky.social
08.01.2025 18:50 β π 23 π 3 π¬ 0 π 0
Lots of interest in ADAS! Thanks everyone, and congrats
Shengran Hu and @cong-ml.bsky.social! πππ
16.12.2024 18:19 β π 10 π 3 π¬ 0 π 0
Honored to receive this award for ADAS!!
16.12.2024 21:33 β π 5 π 0 π¬ 0 π 0
Our in-progress work Quality-Diversity Self-Play (w/ @cong-ml.bsky.social and @jeffclune.com) will have a poster presentation at #NeurIPS2024 workshops (@IMOLNeurIPS2024 Sunday West meeting room 217 - 219 and OpenworldAgents Sunday East Meeting Room 1-3, Foyer). Please come visit us!
14.12.2024 18:59 β π 9 π 1 π¬ 0 π 1
Our work Automated Design of Agentic Systems (w/
Shengran Hu & @cong-ml.bsky.social) will have β¨two oralsβ¨ @ #NeurIPS2024 workshops (LanGame Sat 10:20, OWA Sun 4:50). Please come visit usπ
We would also love to chat about open-endedness, LLM agents, etc. Come by if you want to meet!
10.12.2024 21:49 β π 12 π 2 π¬ 0 π 0
Interested in robust model-based offline RL algorithms? Come check out Anya Sims presenting our new paper investigating the edge of reach problem in offline MBRL!
πEast Exhibit Hall A-C #4603
#NeurIPS2024
12.12.2024 00:34 β π 1 π 0 π¬ 0 π 0
The RL (and some non-RL folks) starter pack is almost full. Pretty clear that the academic move here has succeeded
go.bsky.app/3WPHcHg
18.11.2024 20:30 β π 104 π 32 π¬ 12 π 3
Now that @jeffclune.bsky.social and @joelbot3000.bsky.social are here, time for an Open-Endedness starter pack.
go.bsky.app/MdVxrtD
20.11.2024 07:08 β π 105 π 32 π¬ 16 π 5
Mathematician at UCLA. My primary social media account is https://mathstodon.xyz/@tao . I also have a blog at https://terrytao.wordpress.com/ and a home page at https://www.math.ucla.edu/~tao/
PhD candidate at UCSD. Prev: NVIDIA, Meta AI, UC Berkeley, DTU. I like robots π€, plants πͺ΄, books π, and they/them pronouns π³οΈβπ
https://www.nicklashansen.com
Research director @Inria, Head of @flowersInria
lab, prev. @MSFTResearch @SonyCSLParis
Artificial intelligence, cognitive sciences, sciences of curiosity, language, self-organization, autotelic agents, education, AI and society
http://www.pyoudeyer.com
Co-Founder & CEO, Sakana AI π β @sakanaai.bsky.social
https://sakana.ai/careers
Sakana AI is an AI R&D company based in Tokyo, Japan. πΌπ§
https://sakana.ai/careers
Research @ OpenAI, Prev PhD at Oxford University
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
RS at Nvidia focussing on autonomous vehicles, simulation and RL. Opinions my own and do not represent those of my employer, Nvidia.
Waitress turned Congresswoman for the Bronx and Queens. Grassroots elected, small-dollar supported. A better world is possible.
ocasiocortez.com
Machine Teacher. Research Scientist at Phaidra. PhD from TU Delft. Previously JP Morgan, Huawei, Unity.
https://www.suau.io/
Research Engineer @ Google DeepMind
AI & Transportation | MIT Associate Professor
Interests: AI for good, sociotechnical systems, machine learning, optimization, reinforcement learning, public policy, gov tech, open science.
Science is messy and beautiful.
http://www.wucathy.com
Ph.D. Student studying AI & decision making at Mila / McGill University. Currently at FAIR @ Meta. Previously Google DeepMind & Google Brain.
https://brosa.ca
Assistant Professor at the University of Alberta. Amii Fellow, Canada CIFAR AI chair. Machine learning researcher. All things reinforcement learning.
π Edmonton, Canada π¨π¦
π https://webdocs.cs.ualberta.ca/~machado/
ποΈ Joined November, 2024
The nonprofit organization behind the Python programming language. For help with Python code: http://python.org/about/help/
On Mastodon: @ThePSF@fosstodon.org
www.lukaschaefer.com
Researcher @msftresearch.bsky.social; working on autonomous agents in video games; PhD Univ of Edinburgh ; Ex Huawei Noahβs Ark Lab, Dematic; Young researcher HLF 2022