Jacob Schreiber's Avatar

Jacob Schreiber

@jmschreiber91.bsky.social

Studying genomics, machine learning, and fruit. My code is like our genomes -- most of it is junk. Assistant Professor UMass Chan Previously IMP Vienna, Stanford Genetics, UW CSE.

6,604 Followers  |  1,405 Following  |  817 Posts  |  Joined: 17.11.2023  |  1.925

Latest posts by jmschreiber91.bsky.social on Bluesky

The @impvienna.bsky.social is a unique place where talented researchers are doing amazing science. I'm glad I had an opportunity to spend some time there, and am looking forward to the next excuse to visit Vienna!

25.11.2025 15:16 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

That sounds too biologically important for me to be involved.

20.11.2025 19:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My first @umasschan.bsky.social/@impvienna.bsky.social affiliated paper is up!

tomtom-lite is a re-implementation of tomtom targeting the ML age of genomics. Fast annotations ("what is this motif?") and simple large-scale discovery of motifs.

Check it out!

academic.oup.com/bioinformati...

20.11.2025 14:02 β€” πŸ‘ 35    πŸ” 14    πŸ’¬ 1    πŸ“Œ 0

Why is it a problem to translate C++ to changng standards when you can just use Agenic AI?

31.10.2025 19:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Tune in for a great MIA talk on ML for regulatory genomics by @jmschreiber91.bsky.social and Gregory Andrews now! πŸ§ͺ
broad.io/mia

(Will also be available online on our YouTube playlist later: www.youtube.com/watch?v=sMTO...)

29.10.2025 13:00 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
gReLU: a comprehensive framework for DNA sequence modeling and design - Nature Methods gReLU advances deep-learning-based modeling and analysis of DNA sequences with comprehensive toolsets and versatile applications.

I'm happy to share that our gReLU package is now published in Nature Methods!

www.nature.com/articles/s41...

15.10.2025 21:21 β€” πŸ‘ 20    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0

they told the flight attendants to sit down for the second half of the flight (an hour) because it was entirely turbulence

13.10.2025 21:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

had to land in a nor'eastern storm in boston and i'm ready to move back to the west coast forever

13.10.2025 21:37 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Now that I'm settled in at @umasschan.bsky.social, I'm hiring at all levels: grad students, post-docs, and software engineers/bioinformaticians!

The goal of my lab is to understand the regulatory role of every nucleotide in our genomes and how this changes across every cell in our bodies.

07.10.2025 15:24 β€” πŸ‘ 43    πŸ” 26    πŸ’¬ 6    πŸ“Œ 1

I figure I can spend the time until I get tenure on answering the question, and then the time after tenure arguing about what "regulatory" means

07.10.2025 15:37 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

If you're interested, please reach out with your CV and which topics you'd be interested in working on!

07.10.2025 15:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

- Genomics Software Ecosystem: A major obstacle to our goal is the lack of simple+scalable software that everyone can use. Come build this with me. Training a lightweight deep learning model and using it for design/interpretability/VE prediction should be no more challenging than mapping reads.

07.10.2025 15:28 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

- Foundation Models: As someone involved in ML, I am legally required to be working on this topic.

07.10.2025 15:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Programmatic design and editing of cis-regulatory elements The development of modern genome editing tools has enabled researchers to make such edits with high precision but has left unsolved the problem of designing these edits. As a solution, we propose Ledi...

We have an array of ML-based projects for going after this, focusing on the following topics:

- DNA Design ( 🧬 ) We have shown that Ledidi (www.biorxiv.org/content/10.1...) can precisely design DNA, and now it's time to push the boundaries in several directions w/ some very cool collaborations.

07.10.2025 15:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Now that I'm settled in at @umasschan.bsky.social, I'm hiring at all levels: grad students, post-docs, and software engineers/bioinformaticians!

The goal of my lab is to understand the regulatory role of every nucleotide in our genomes and how this changes across every cell in our bodies.

07.10.2025 15:24 β€” πŸ‘ 43    πŸ” 26    πŸ’¬ 6    πŸ“Œ 1

It was suggested that the audience may not appreciate/understand :(

03.10.2025 06:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

is it a good idea to wear a "join, or die!" hat to a big talk in europe? please say yes

30.09.2025 18:15 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

the greatest productivity hack is having a grant deadline. there's so much other stuff you can do when you're supposed to be working on a grant.

22.09.2025 14:05 β€” πŸ‘ 15    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I was delighted to have the unexpected opportunity to give a keynote at MLCB 2025 in NYC last week. I used it to explain how I view deep learning models in genomics not as "uninterpretable black boxes" but as indispensable tools for understanding genomics + designing the next gen of synthetic DNA.

19.09.2025 13:59 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

for some reason i thought being a professor would involve more mentoring and research and less filling out disclosures concerning whether plants and seeds were used in my computational study

09.09.2025 19:24 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

stocking up the new apartment with essentials

03.09.2025 14:24 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In the genomics community, we have focused pretty heavily on achieving state-of-the-art predictive performance.

While undoubtedly important, how we *use* these models after training is potentially even more important.

tangermeme v1.0.0 is out now. Hope you find it useful!

27.08.2025 16:20 β€” πŸ‘ 45    πŸ” 14    πŸ’¬ 1    πŸ“Œ 0

For some reason, hitting "comment" on GitHub is significantly more responsive than a month ago and it freaks me out. Surely there are some important calculations that need to be done before letting my thoughts into the wild?

27.08.2025 21:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks! Let me know if you want me to stop in virtually, we can try to figure out a time.

27.08.2025 17:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Hope you find tangermeme helpful in your work! Please reach out if you have any comments + questions.

27.08.2025 16:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Because everything is automatic, we can probe models.

What motifs are driving model predictions? Calculate attributions, call + annotate seqlets, and count the annotations!

BPNet is relying on MYC, whereas Beluga is relying on many more TFs. Easy comparison now.

27.08.2025 16:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Frequently, people manually annotate seqlets and draw bars or boxes around these high-attribution characters themselves. This is not really a problem, but it's just slow and does not scale genome-wide.

In the above picture, everything is automatically done.

27.08.2025 16:41 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

People *talk* about seqlets a lot but tangermeme is the first package for complete functionality.

Here is a complete example of using tangermeme for attributions, seqlet calling + annotation, and plotting, to visualize what five models think of the same locus

27.08.2025 16:40 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Expanding past these implementations, tangermeme has a large focus on automatic seqlet calling and usage. Seqlets are short contiguous spans of high-attribution characters that usually correspond to the binding of a TF.

27.08.2025 16:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

By considering attributions you can see how variants disrupt or change usage of motifs. Maybe you'll even find that a variant causes alternative binding by inducing a new motif or slightly changing competition! That would be challenging to see from the predictions alone.

27.08.2025 16:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@jmschreiber91 is following 20 prominent accounts