's Avatar

@kylelwiggers.bsky.social

Ai2 Comms Lead | kylew@allenai.org | Pronouns: he/him

3,590 Followers  |  60 Following  |  785 Posts  |  Joined: 12.10.2024
Posts Following

Posts by (@kylelwiggers.bsky.social)

Post image

We analyzed 250K+ queries & 430K+ clickstream interactions from Asta, our AI-powered research assistantβ€”and today we're releasing the full dataset. How do researchers actually use AI science tools? Here's what we found. 🧡

27.02.2026 17:56 β€” πŸ‘ 23    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1
Post image

Can AI predict what scientists will do nextβ€”not just one piece, but the whole research process? PreScience is our new model eval for forecasting how science unfolds end-to-end, from how research teams form to a paper's eventual impact. Built with UChicago, supported by NSF.

25.02.2026 16:59 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1

We've released a Chrome extension for Astaβ€”a faster way to go from finding a paper to asking questions about it while you read. 🧡

18.02.2026 18:37 β€” πŸ‘ 12    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

Data mixing – determining how much web text, code, math, etc., you need for LM development – is a first-order lever on model quality. Introducing Olmix: a framework for configuring mixing methods at the start of dev & efficiently updating as data changes throughout. 🧡

13.02.2026 16:34 β€” πŸ‘ 22    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1
Post image

Knowing which questions to ask is often the hardest part of science. Today we're releasing AutoDiscovery in AstaLabs, an AI system that starts with your data and generates its own hypotheses. πŸ§ͺ

12.02.2026 16:06 β€” πŸ‘ 10    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Introducing MolmoSpaces, a large-scale, fully open platform + benchmark for embodied AI research. πŸ€–

230k+ indoor scenes, 130k+ object models, & 42M annotated robotic graspsβ€”all in one ecosystem.

11.02.2026 19:47 β€” πŸ‘ 11    πŸ” 4    πŸ’¬ 1    πŸ“Œ 2
Post image

LLMs often generate step-by-step instructions, from real-world tasks (how do I file taxes?) to plans for AI agents. Improving this is hard: outputs can sound fluent for steps that don't work, and current datasets cover few domains.

How2Everything evals/trains for this at scale. 🧡

10.02.2026 16:53 β€” πŸ‘ 21    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Post image

Since launching Open Coding Agents, it's been exciting to see how quickly the community has adopted them. Today we're releasing SERA-14B – a new 14B-parameter coding model – plus a major refresh of our open training datasets. 🧡

03.02.2026 17:39 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Introducing Theorizer: Turning thousands of papers into scientific laws πŸ“šβž‘οΈπŸ“œ

Most automated discovery systems focus on experimentation. Theorizer tackles the other half of science: theory buildingβ€”compressing scattered findings into structured, testable claims. 🧡

28.01.2026 18:37 β€” πŸ‘ 34    πŸ” 8    πŸ’¬ 1    πŸ“Œ 5

Here's just one of the cool apps you can vibe-code with SERA, our new agentic coding model! I was lucky enough to get my hands on it early and it's quite capable via Claude Code. Give it a go today!

27.01.2026 20:29 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Post image

Introducing Ai2 Open Coding Agentsβ€”starting with SERA, our first-ever coding models. Fast, accessible agents (8B–32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, & it works with Claude Code out of the box. 🧡

27.01.2026 16:12 β€” πŸ‘ 129    πŸ” 23    πŸ’¬ 1    πŸ“Œ 7
Post image

Introducing HiRO-ACE: an AI framework that makes highly detailed climate simulations dramatically more accessible. It generates decades of high-resolution precipitation data for any region in a day on a single GPUβ€”no supercomputing cluster required. 🧡

21.01.2026 19:34 β€” πŸ‘ 33    πŸ” 8    πŸ’¬ 1    πŸ“Œ 3
Post image Post image Post image

Last year Molmo set SOTA on image benchmarks + pioneered image pointing. Millions of downloads later, Molmo 2 brings Molmo’s grounded multimodal capabilities to video πŸŽ₯β€”and leads many open models on challenging industry video benchmarks. 🧡

16.12.2025 16:51 β€” πŸ‘ 14    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Introducing Bolmo, a new family of byte-level language models built by "byteifying" our open Olmo 3β€”and to our knowledge, the first fully open byte-level LM to match or surpass SOTA subword models across a wide range of tasks. 🧡

15.12.2025 17:19 β€” πŸ‘ 75    πŸ” 15    πŸ’¬ 1    πŸ“Œ 4
Post image Post image

Olmo 3.1 is here. We extended our strongest RL run and scaled our instruct recipe to 32Bβ€”releasing Olmo 3.1 Think 32B & Olmo 3.1 Instruct 32B, our most capable models yet. 🧡

12.12.2025 17:14 β€” πŸ‘ 14    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Video thumbnail

Update: DataVoyager, which we launched in Preview early this fall, is now available in Asta. πŸŽ‰
You can upload real datasets, ask complex research questions in natural language, & get back reproducible answers + visualizations. πŸ”πŸ“Š

08.12.2025 20:47 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Post image

Olmo 3 is now available through @hf.co Inference Providers, thanks to Public AI! πŸŽ‰
This means you can run our fully open 7B and 32B models β€” including Think and Instruct variants β€” via serverless API with no infrastructure to manage.

28.11.2025 16:50 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Our Olmo 3 models are now available via API on
@openrouter.bsky.social. Try Olmo 3-Instruct (7B) for chat & tool use, and our reasoning models Olmo-3 Think (7B & 32B) for more complex problems.

22.11.2025 01:58 β€” πŸ‘ 24    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flowβ€”not just the final weights, but the entire training journey.
Best fully open 32B reasoning model & best 32B base model. 🧡

20.11.2025 14:37 β€” πŸ‘ 68    πŸ” 17    πŸ’¬ 1    πŸ“Œ 2
Video thumbnail

Today we’re releasing Deep Research Tulu (DR Tulu)β€”the first fully open, end-to-end recipe for long-form deep research, plus an 8B agent you can use right away. Train agents that plan, search, synthesize, & cite across sources, making expert research more accessible. πŸ§­πŸ“š

18.11.2025 15:31 β€” πŸ‘ 48    πŸ” 14    πŸ’¬ 1    πŸ“Œ 3
Video thumbnail

Introducing OlmoEarth 🌍, state-of-the-art AI foundation models paired with ready-to-use open infrastructure to turn Earth data into clear, up-to-date insights within hoursβ€”not years.

04.11.2025 14:52 β€” πŸ‘ 34    πŸ” 5    πŸ’¬ 3    πŸ“Œ 3
Post image

Our fully open Olmo models enable rigorous, reproducible scienceβ€”from unlearning to clinical NLP, math learning, & fresher knowledge. Here’s how the research community has leveraged Olmo to make the entire AI ecosystem better + more transparent for all. 🧡

24.10.2025 18:36 β€” πŸ‘ 17    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1
Post image

We’re updating olmOCR, our model for turning PDFs & scans into clean text with support for tables, equations, handwriting, & more. olmOCR 2 uses synthetic data + unit tests as verifiable rewards to reach state-of-the-art performance on challenging documents. 🧡

22.10.2025 16:09 β€” πŸ‘ 37    πŸ” 6    πŸ’¬ 1    πŸ“Œ 3

πŸ“Š Today we're releasing data showing which scientific papers our AI research tool Asta cites most frequently. Think of it as creating citation counts for the AI eraβ€”tracking which research is actually powering AI answers across thousands of queries. 🧡

08.10.2025 18:26 β€” πŸ‘ 8    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Video thumbnail

Introducing Asta DataVoyagerβ€”our new AI capability in Asta that turns structured data into transparent, reproducible insights. Built for scientists, grounded in open, inspectable workflows. 🧡

01.10.2025 13:02 β€” πŸ‘ 18    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2
Video thumbnail

"We check in more open-source [AI] in the world than just anybody, its just one other company, Ai2"

Jensen Huang on Nvidia's open models/datasets

28.09.2025 01:18 β€” πŸ‘ 24    πŸ” 2    πŸ’¬ 3    πŸ“Œ 1
Video thumbnail

πŸŽ™οΈ Say hello to OLMoASRβ€”our fully open, from-scratch speech-to-text (STT) model. Trained on a curated audio-text set, it boosts zero-shot ASR and now powers STT in the Ai2 Playground. πŸ‘‡

28.08.2025 16:13 β€” πŸ‘ 19    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1

Today we’re releasing agent-baselines, a suite of 22 classes of AI agents for scienceβ€”including 9 open-source research-tuned agents like our state-of-the-art, benchmark-leading Asta v0. πŸš€πŸ”¬
Part of our Asta ecosystem to advance scientific AI. πŸ‘‡

26.08.2025 19:45 β€” πŸ‘ 11    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0