Dave Burke's Avatar

Dave Burke

@daveyburke.bsky.social

CTO at Arc Institute | Google Advisor (Android) ๐Ÿ‡ฎ๐Ÿ‡ช + ๐Ÿ‡บ๐Ÿ‡ฒ

99 Followers  |  23 Following  |  18 Posts  |  Joined: 23.11.2024  |  1.9855

Latest posts by daveyburke.bsky.social on Bluesky

Post image

Zach believes we're actually making a company to sell this thing. He has a business card (he's the CEO and I'm his CTO obvs). Even made a badge. For now, we're open sourcing the base model :). Python code and build instructions here: github.com/daveyburke/Z.... Enjoy!

23.03.2025 05:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

My 7 yr old has been dreaming up an AI teddy bear. So we made him! Zaby is a clever, pedagogical & funny teddy that loves talking math. Powered by Gemini Flash & Google speech recognition/synthesis. His mouth moves in sync with the speech envelope. Smarter than the average bear!

23.03.2025 05:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
GTC March 2025 Keynote with NVIDIA CEO Jensen Huang
YouTube video by NVIDIA GTC March 2025 Keynote with NVIDIA CEO Jensen Huang

"They help us unravel... the language of life" - Arc Institute's Evo 2 model featured during NVIDIA's GTC 2025 keynote - m.youtube.com/watch?v=_waP...

18.03.2025 22:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Virtual Cell Atlas | Arc Institute Arc Institute is a independent nonprofit research organization headquartered in Palo Alto, California.

Excited to announce the Arc Virtual Cell Atlas, a collection of high quality, curated, open datasets, incorporating scBaseCamp and Tahoe-100M from @vevo_ai. We hope this can be the beginning of an "ImageNet moment" for virtual cell modeling. Available at arcinstitute.org/tools/virtua...

25.02.2025 14:39 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Manuscript | Arc Institute Arc Institute is a independent nonprofit research organization headquartered in Palo Alto, California.

In the future, we can use this mechanism to steer DNA generation, for example make a prokaryotic sequence have more eukaryotic features, or increase the presence of alpha helices. You can read more in the Evo 2 preprint here: arcinstitute.org/manuscripts/...

19.02.2025 16:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It shows genomic concepts in a reference genome such as coding sequences, alpha helices, tRNAs, etc. The tool overlays corresponding features that activate when Evo 2 detects such concepts. Whatโ€™s amazing is Evo learned all this from genomes in nature without any supervision!

19.02.2025 16:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Together with @GoodfireAI we built a visualizer that lets you explore the concepts learned by Evo 2. Try it here: arcinstitute.org/tools/evo/ev...

19.02.2025 16:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We applied sparse autoencoders to Evo 2, our new DNA model, to show it autonomously learns a breadth of biological features, including exonโ€“intron boundaries, transcription factor binding sites, protein structural elements, and prophage genomic regions

19.02.2025 16:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A common critique of large AI models is that they are black boxes. The recent field of mechanistic interpretability aims to โ€œlook insideโ€ the AI black box

19.02.2025 16:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This is one of many applications of this work. Evolution has learned to read and write DNA over millions of years, and Evo 2 aims to learn from this knowledge. The AI model serves as a foundation for understanding the language of life across all domainsโ€”from bacteria to humans

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This particular variant was initially reported as a variant of unknown significance (VUS). Years later, oncologists learned it was a driver of breast and ovarian cancers. In the Evo paper, we show state of the art performance on classifying BRCA1 variants of unknown significance

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

If I take a known deleterious mutation c.5095C>T that changes just the 5095th nucleotide in exon 17 from C to T, the negative log likelihood increases from 0.96 to 0.99 indicating the model is less confident. Evo recognizes that this mutation causes a loss of function of the gene

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Evo Designer can also score DNA sequences, i.e. how likely the sequence is in nature. Hereโ€™s an example of a section of the BRCA1 - certain mutations in this gene are known to increase the risk of breast & ovarian cancer

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Prompt with a sequence or species and the model will generate new DNA. Select sections of generated DNA to visualize the corresponding proteins, or use BLAST to find similar sequences in nature

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Evo 2: DNA Foundation Model | Arc Institute Arc Institute is a independent nonprofit research organization headquartered in Palo Alto, California.

We built a new interactive user interface for generation and scoring called Evo Designer arcinstitute.org/tools/evo/ev...

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Manuscript | Arc Institute Arc Institute is a independent nonprofit research organization headquartered in Palo Alto, California.

Evo 2 uses a new hybrid architecture called StripedHyena 2 enabling a long context window of 1M nucleotides with a model size of 40B parameters, trained on 2048 H100 GPUs. Preprint can be found at arcinstitute.org/manuscripts/... and includes links to source code

19.02.2025 16:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Introducing Evo 2 from Arc Institute - an AI that can model and design the genetic code for all domains of life. Itโ€™s one of the largest-scale truly open source AI models for biology (and in fact more generally - most โ€œopen sourceโ€ large language models are only โ€œopen weightsโ€)

19.02.2025 16:05 โ€” ๐Ÿ‘ 8    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
1^3+2^3+3^3+4^3+5^3+6^3+7^3+8^3+9^3 - Google Search

Happy \sum_{n=1}^{9}n^3
www.google.com/search?q=1%5...

01.01.2025 06:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@daveyburke is following 20 prominent accounts