Yunha Hwang's Avatar

Yunha Hwang

@microyunha.bsky.social

Building genomic intelligence @ Tatta Bio

1,277 Followers  |  1,116 Following  |  36 Posts  |  Joined: 04.12.2023  |  1.7583

Latest posts by microyunha.bsky.social on Bluesky

This. Is. So. Cool. 🀯

05.11.2025 23:51 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Hi Roland, our servers are in the US, we explicitly state in our docs that we do not train models on private data, and the data is private to you only - unless intentionally made public (for publication/data sharing purposes)!

30.10.2025 01:36 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

thanks for the feedback! We are working on making more of the platform exportable as figures😊

29.10.2025 12:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thank you for the shoutout!

28.10.2025 18:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Released today from Tatta Bio: SeqHub! A place to explore, annotate, and share sequence data with functional insights.Β 

Over 1,000 scientists worldwide have already used SeqHub to annotate more than 550,000 proteins, uncovering new insights and accelerating discovery.

28.10.2025 15:03 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

Annotations are mapped using embedding-based search, making it faster than most alignment-based search. HMM prediction speed-up comes from some optimization and parallelization :)

28.10.2025 16:33 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thank you! and PaperBLAST team deserves a shoutout for the sequence-paper linkages

28.10.2025 16:31 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@ancornman1.bsky.social @sokrypton.org @pgirguis.bsky.social @alexbateman1.bsky.social @simrouxvirus.bsky.social @apcamargo.bsky.social

28.10.2025 13:47 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
SeqHub SeqHub is a platform for exploring, annotating, and sharing biological sequences.

Currently, SeqHub is optimized for microbial protein and genome analysis. As we expand beyond microbial data, we'd love your feedback to help shape what comes next. I'm deeply grateful to our team at Tatta Bio, and to our collaborators and funders, for making this vision a reality. πŸ”— seqhub.org

28.10.2025 13:47 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 4    πŸ“Œ 1
Video thumbnail

We're thrilled to announce SeqHub, an AI-enabled platform for biological sequence analysis. SeqHub brings together sequence search, genome annotation, and data sharing in one place.

28.10.2025 13:47 β€” πŸ‘ 49    πŸ” 20    πŸ’¬ 3    πŸ“Œ 2

Ready to explore New Lineages of Life with @jgi.doe.gov ? 🧬🦠

Registration for our 2025 NeLLi Symposium is now open. For the first time in collaboration with @unlv.edu

Mark the date: November 6-7 in Las Vegas, NV

25.08.2025 21:39 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Preview
Gaia β€” Tatta Bio

We are building this infrastructure for the scientific community, and we invite feedback and collaboration from researchers at every stage. We are grateful to
the Moore Foundation for their generous support in making this project possible. Stay tuned for more updates!

www.tatta.bio/gaia

02.06.2025 16:23 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Today's sequence data infrastructure is set up for failure in the age of AI. Building an open and collaborative sequence platform for both Human and AI scientists.

At Tatta Bio, we have been thinking deeply about the sequence-to-function problem. We believe that before AI can power functional prediction, we first need to rethink how we curate, manage, and share sequence data. Here, we share our initial ideas on what we are building next:

02.06.2025 16:23 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
Assemblies of long-read metagenomes suffer from diverse errors Genomes from metagenomes have revolutionised our understanding of microbial diversity, ecology, and evolution, propelling advances in basic science, biomedicine, and biotechnology. Assembly algorithms...

I am very happy (and anxious) to share with you our most recent work in which we evaluated four of the most popular long-read assemblers,

www.biorxiv.org/content/10.1...

and tell you just a little bit about it in the following 🧡

28.04.2025 08:07 β€” πŸ‘ 134    πŸ” 71    πŸ’¬ 5    πŸ“Œ 7

I am so grateful for all the support I received from my mentors, colleagues and collaborators over the years: @pgirguis.bsky.social, @sokrypton.org, @simrouxvirus.bsky.social, @alexjprobst.bsky.social, @annedekas.bsky.social

28.04.2025 14:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It’s been an incredible journey building Tatta Bio with @ancornman1.bsky.social to advance AI infrastructure for biology, and I will continue to further our mission as chief scientist.

28.04.2025 13:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My lab will couple ML and high throughput experimentation to harness the remarkable functional diversity of microbial genomes. If you are excited about the intersection of AI and microbiology, please get in touch!

28.04.2025 13:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

It’s official! πŸŽ‰ I’m thrilled to announce that I will be joining MIT as an assistant professor in a shared appointment between Biology, EECS and Schwarzman College of Computing this fall.

28.04.2025 13:47 β€” πŸ‘ 66    πŸ” 3    πŸ’¬ 9    πŸ“Œ 0
Preview
Job Board | Notion Overview

Tatta Bio is growing! We are hiring *two positions* in Business Development and Software Engineering to lead the development of AI-enabled scientific software for open science and biological sequence interpretation. Please check out the job postings at www.tatta.bio/careers and share widely!

24.03.2025 16:29 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Our thoughts too! (stay tunedπŸ‘€) πŸ˜‰

18.12.2024 22:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

As we improve Gaia Agent, we want to hear your feedback on the agent predictions. If you have suggestions on how we can increase its capabilities, please reach out! This was a major collaborative effort with @cong-ml.bsky.social , @joshuakravitz.com @nishantjha.org @ancornman1.bsky.social @Tatta Bio

17.12.2024 13:38 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Gaia Agent: Context-Aware Functional Insights at Scale β€” Tatta Bio An AI biologist discovers previously uncharacterized systems in the Mtb genome.

We tested Gaia Agent's capabilities with hypothetical genes in Mycobeterium tuberculosis. In our blog, We detail our in silico validation of Gaia Agent-predicted membrane transporter and lanthipeptide biosynthesis loci that were uncharacterized despite decades of Mtb research. Read more:

17.12.2024 13:38 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Like a human biologist, Gaia Agent considers sequence, structure and genomic context to *think* about functions of novel genes, drastically accelerating our ability to predict functions of billions of unannotated proteins across the tree of life.

17.12.2024 13:38 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Can LLM agents discover novel protein functions? Introducing Gaia Agent 🌎 πŸ€–: an AI biologist capable of reasoning across genomic contexts to predict functions of proteins! Gaia Agent is now integrated with Gaia Search at gaia.tatta.bio

17.12.2024 13:38 β€” πŸ‘ 38    πŸ” 13    πŸ’¬ 2    πŸ“Œ 1
Post image

If you are at #NeurIPS2024 don't miss @ancornman1.bsky.social's talk on OMG/gLM2 at 9AM! @workshopmlsb.bsky.social East meeting room 11,12

15.12.2024 16:21 β€” πŸ‘ 12    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling Biological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraini...

Excited to be at #NeurIPS this week. @ancornman1.bsky.social will give a spotlight talk at the @workshopmlsb.bsky.social on gLM2/OMG! Please reach out if you want to chat about gLM2/OMG/Gaia and our latest projectsπŸ˜‡

www.biorxiv.org/content/10.1...

10.12.2024 16:01 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
MIBiG 4.0: advancing biosynthetic gene cluster curation through global collaboration Abstract. Specialized or secondary metabolites are small molecules of biological origin, often showing potent biological activities with applications in ag

Are you working on natural products? We’ve just released version 4.0 of the MIBiG data standard and repository! It now includes 3059 biosynthetic gene clusters, thanks to the combined efforts of 288 expert contributors. A thread: (1/8) academic.oup.com/nar/advance-...

10.12.2024 08:05 β€” πŸ‘ 92    πŸ” 53    πŸ’¬ 4    πŸ“Œ 12
overview of results for PLAID!

overview of results for PLAID!

1/🧬 Excited to share PLAID, our new approach for co-generating sequence and all-atom protein structures by sampling from the latent space of ESMFold. This requires only sequences during training, which unlocks more data and annotations:

bit.ly/plaid-proteins
🧡

06.12.2024 17:44 β€” πŸ‘ 121    πŸ” 37    πŸ’¬ 1    πŸ“Œ 3

you can search for eukaryotic sequences too, and you might find interesting homology to microbial proteins! (the current database you search against is microbial)

23.11.2024 23:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Our Big Fantastic Virus Database (BFVD) is now published NAR! It contains protein structure predictions of major viral clades, enhanced by petabase-scale homology search and it's explorable on the web.
🌐 bfvd.foldseek.com
πŸ’Ύ bfvd.steineggerlab.workers.dev
πŸ“„ academic.oup.com/nar/advance-...

23.11.2024 21:12 β€” πŸ‘ 339    πŸ” 127    πŸ’¬ 6    πŸ“Œ 5

@microyunha is following 20 prominent accounts