π Want strong LLM reasoning without breaking the bank? We explored just how cost-effectively RL can enhance reasoning using LoRA!
[1/9] Introducing Tina: A family of tiny reasoning models with strong performance at low cost, providing an accessible testbed for RL reasoning. π§΅
23.04.2025 17:10 β π 7 π 3 π¬ 1 π 0
π Diving deep into LLM reasoning?
From OpenAI's o-series to DeepSeek R1, from post-training to test-time compute β we break it down into structured spreadsheets. π§΅
19.02.2025 18:01 β π 4 π 2 π¬ 1 π 0
Added! (bsky.app/profile/will...)
12.02.2025 08:10 β π 1 π 0 π¬ 0 π 0
Our paper also contains an in-depth discussion on safety when releasing metagenomic models.
Looking for collaborators to build on this with us β please reach out!
metagene.ai
07.01.2025 20:58 β π 5 π 0 π¬ 0 π 0
We leverage the ecosystem of modern LLM toolingβin tokenization, model architecture, training, infra, etcβfor performance and extensibility. METAGENE-1 is standardized & easy to use.
Hugging Face: huggingface.co/metagene-ai
Github: github.com/metagene-ai
07.01.2025 20:58 β π 5 π 0 π¬ 1 π 0
A subset of results on our Genomic Embedding Benchmark and Pathogen Detection Benchmark.
ββMETAGENE-1 shows state-of-the-art results on pathogen detection, metagenomic embedding, and other genomic tasks.
We also release new benchmarks for genomic detection and embedding (eg, Gene-MTEB, based on MTEB for LLMs).
See our paper for details: arxiv.org/abs/2501.02045
07.01.2025 20:58 β π 4 π 0 π¬ 1 π 0
Overview of the metagenomic data collection and sequencing pipeline for model pretraining.
Our data pipeline is: human microbiome > wastewater > metagenomic sequences > tokens > training data.
Wastewater provides a rich source of data from tens of thousands of species across the human-adjacent microbiome. In total we pretrain on over 1.5T base pairs of DNA/RNA.
07.01.2025 20:58 β π 1 π 0 π¬ 1 π 0
Overview of METAGENE-1 and applications.
Metagenomic sequencing of wastewater produces vast amounts of data that can capture public health trends at a societal scale. Our goal is to train a model on this data to help in large-scale wastewater monitoring & detection of novel bio threats.
07.01.2025 20:58 β π 1 π 0 π¬ 1 π 0
Metagenomic Foundation Model
Metagenomic Foundation Model for Pandemic Monitoring
Excited to release METAGENE-1, a 7B parameter metagenomic foundation model, built to aid in pathogen detection & pandemic monitoring. Pretrained on 1.5 trillion base pairs of DNA/RNA sequenced from wastewater.
A collab w/ USC, PrimeIntellect, & the Nucleic Acid Observatory.
metagene.ai
07.01.2025 20:58 β π 20 π 0 π¬ 1 π 0
Entropy is one of those formulas that many of us learn, swallow whole, and even use regularly without really understanding.
(E.g., where does that βlogβ come from? Are there other possible formulas?)
Yet there's an intuitive & almost inevitable way to arrive at this expression.
09.12.2024 22:44 β π 548 π 131 π¬ 22 π 12
Added!
09.12.2024 08:40 β π 0 π 0 π¬ 0 π 0
Added! (bsky.app/profile/will...)
07.12.2024 05:12 β π 1 π 0 π¬ 1 π 0
Added! (bsky.app/profile/will...)
06.12.2024 10:36 β π 1 π 0 π¬ 0 π 0
hi everyone!! let's try this optimal transport again π
05.12.2024 12:58 β π 329 π 31 π¬ 2 π 1
YouTube video by WEHImovies
DNA Break Repair by Homologous Recombination (2024) Drew Berry wehi.tv
Delighted to publish my new molecular animation:
DNA Break Repair by Homologous Recombination
youtu.be/Xe-83tBcxhs
04.12.2024 00:07 β π 264 π 112 π¬ 39 π 40
Added! (bsky.app/profile/will...)
03.12.2024 22:20 β π 1 π 0 π¬ 0 π 0
Added!
02.12.2024 17:46 β π 0 π 0 π¬ 1 π 0
Added! (bsky.app/profile/will...)
02.12.2024 01:15 β π 0 π 0 π¬ 0 π 0
Added!
02.12.2024 00:35 β π 1 π 0 π¬ 0 π 0
Added! (bsky.app/profile/will...)
01.12.2024 11:13 β π 1 π 0 π¬ 0 π 0
Added!
01.12.2024 11:12 β π 0 π 0 π¬ 0 π 0
Added!
30.11.2024 09:39 β π 2 π 0 π¬ 0 π 0
Added!
30.11.2024 06:05 β π 0 π 0 π¬ 0 π 0
Added! (bsky.app/profile/will...)
28.11.2024 21:41 β π 1 π 0 π¬ 0 π 0
Added!
28.11.2024 09:14 β π 1 π 0 π¬ 0 π 0
Added!
28.11.2024 09:14 β π 1 π 0 π¬ 0 π 0
Anne Gagneux, Ségolène Martin, @quentinbertrand.bsky.social Remi Emonet and I wrote a tutorial blog post on flow matching: dl.heeere.com/conditional-... with lots of illustrations and intuition!
We got this idea after their cool work on improving Plug and Play with FM: arxiv.org/abs/2410.02423
27.11.2024 09:00 β π 356 π 102 π¬ 12 π 11
Added!
28.11.2024 01:16 β π 0 π 0 π¬ 0 π 0
Added! (bsky.app/profile/will...)
27.11.2024 02:21 β π 1 π 0 π¬ 0 π 0
Added! (bsky.app/profile/will...)
26.11.2024 21:56 β π 1 π 0 π¬ 0 π 0
Breakthrough AI to solve the world's biggest problems.
βΊ Join us: http://allenai.org/careers
βΊ Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm
Chief Models Officer @ Stealth Startup; Inria & MVA - Ex: Llama @AIatMeta & Gemini and BYOL @GoogleDeepMind
Columbia CS professor. Head of Research at a16z crypto. Research on algorithms, game theory, mechanism design, blockchains/web3. Author of Algorithms Illuminated, Twenty Lectures on Algorithmic Game Theory, and Beyond the Worst-Case Analysis of Algorithms.
CS PhD student at USC- Interested in Generative AI, AI for science
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
CEO of FutureHouse, building an AI Scientist
Senior Staff Research Scientist @Google DeepMind, previously Stats Prof @Oxford Uni - interested in Computational Statistics, Generative Modeling, Monte Carlo methods, Optimal Transport.
Digital Geometer, Associate Professor of Computer Science & Robotics at Carnegie Mellon University. There are four lights.
https://www.cs.cmu.edu/~kmcrane/
Faculty at the Max Planck Institute for Software Systems, working at the intersection of ML, language models, and cognitive neuroscience. Yogurt snob. https://mtoneva.com/
Ph.D. student at UW-Madison. Working on automating foundation model guided science. Previously at CMU, UCSD, Fresno City College.
https://nick11roberts.science
Artist, Designer and Boat Builder π±
Experience the art of wooden boat building
Joined Bluesky in Dec 2024
Associate Prof. of Databases @ Carnegie Mellon.
Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI
The AI Accelerator Company. https://discord.gg/nousresearch
I work at Sakana AI ππ π‘ β @sakanaai.bsky.social
https://sakana.ai/careers
Working towards the safe development of AI for the benefit of all at UniversitΓ© de MontrΓ©al, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/