Ollie Liu's Avatar

Ollie Liu

@oliu-io.bsky.social

https://ollieliu.com/; oliver irl phd'ing in ml@usc; prev. ml@cmu, msr multimodal foundation models, ai4sci, decision making

1,515 Followers  |  326 Following  |  13 Posts  |  Joined: 12.11.2024  |  1.7483

Latest posts by oliu-io.bsky.social on Bluesky

Thanks to my amazing collaborators: @samsja19.bsky.social , Johannes Hagemann, @shangshang-wang.bsky.social , Jason Wiemels, Jeff Kaufman, and @willieneis.bsky.social
Special shout out to the Nucleic Acid Observatory for the sequencing data, and @PrimeIntellect for compute support.

06.01.2025 17:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We’re sharing METAGENE-1’s:
πŸ“„Paper: metagene.ai/metagene-1-p...
🌐Website: metagene.ai
πŸ€—Model weights: huggingface.co/metagene-ai
🧡7/

06.01.2025 17:04 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

πŸ›‘Tailored for detection, not design. We scoped METAGENE-1 to minimize risks while maximizing potential for public health and biosurveillance. Responsible open-sourcing matters. With open weights, we aim to drive progress in interpretability and safe genomics research.
🧡6/

06.01.2025 17:04 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ“ˆMETAGENE-1 achieves state-of-the-art results in:
- Pathogen detection
- Genomic embedding benchmarks
- Generalization to multi-species tasks
It already shows promise in public health and biosurveillance, and we are collaborating with experts to unlock its full impact.
🧡5/

06.01.2025 17:04 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1
Post image

The METAGENE-1 model is 7B parameter Llama-style transformer πŸ¦™, pretrained and optimized for anomaly detection, embedding, and multi-species genomics. Fully compatible with πŸ€—Hugging Face (huggingface.co/metagene-ai) – ready to use like any of your favorite LLMs!
🧡4/

06.01.2025 17:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ“ŠThe data behind METAGENE-1:
- Brand-new dataset collected with experts from Southern California & Missouri
- 1.5 trillion base pairs from diverse wastewater samples
- Short reads (100–300 BPs), deep sequencing at scale
- Byte-Pair Encoding customized for genomic sequences
🧡3/

06.01.2025 17:04 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Why is METAGENE-1 special? πŸ€”We trained it on wastewater metagenomics, capturing the human-adjacent microbiome across the US for the past 12 months. This unlocks powerful capabilities for early pathogen detection and microbial ecosystems understanding. 🌱🦠
🌐Website: metagene.ai
🧡2/

06.01.2025 17:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Introducing METAGENE-1🧬, an open-source 7B-parameter metagenomics foundation model pretrained on 1.5 trillion base pairs. Built for pandemic monitoring, pathogen detection, and biosurveillance, with SOTA results across many genomics tasks.
🧡1/

06.01.2025 17:04 β€” πŸ‘ 27    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0
Post image

Landed at Vancouver to attend #NeurIPS :-) Excited to chat about multimodal models, AI4Science, decision making, and more!

10.12.2024 00:28 β€” πŸ‘ 15    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

SmolVLM can be fine-tuned on a Google collab and be run on a laptop! Or process millions of documents with a consumer GPU!

26.11.2024 15:57 β€” πŸ‘ 104    πŸ” 22    πŸ’¬ 4    πŸ“Œ 4

πŸ‘‹ nlp@usc student. thanks!

25.11.2024 03:56 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

tfw you realize that this isn't an alt twitter for academic posting but an alt insta for cute doggos.

this is doodle, our border collie pup that often used as adversarial attacks for image classification models (they classify him as corgi :-)

18.11.2024 14:44 β€” πŸ‘ 13    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

yes please if there's still space left :-P

18.11.2024 05:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

our border collie pup doodle absolutely wants nothing from that plate of banana :-P

18.11.2024 04:24 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@oliu-io is following 20 prominent accounts