Martin Steinegger πŸ‡ΊπŸ‡¦'s Avatar

Martin Steinegger πŸ‡ΊπŸ‡¦

@martinsteinegger.bsky.social

Developing data intensive computational methods β€’ PI @ Seoul National University πŸ‡°πŸ‡· β€’ #FirstGen β€’ he/him β€’ HauptschΓΌler

4,283 Followers  |  532 Following  |  215 Posts  |  Joined: 30.06.2023
Posts Following

Posts by Martin Steinegger πŸ‡ΊπŸ‡¦ (@martinsteinegger.bsky.social)

I’ve never performed gene-targeted assembly myself, so I can’t speak from direct experience. I do remember that SPAdes had a gene-reference-based assembly mode that used HMMER.

01.03.2026 05:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Can we simulate realistic evolutionary trajectories and β€œreplay the tape of life”? In this work, we propose a flexible, generalizable deep learning framework for modeling how the entire protein sequence evolves over time while capturing complex interactions across sites. 1/n
doi.org/10.64898/202...

21.02.2026 17:13 β€” πŸ‘ 83    πŸ” 35    πŸ’¬ 3    πŸ“Œ 1

I am happy Marv finally made it!

20.02.2026 04:26 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Congratulations! But how did you get the Marv logo into figure 1? We needed to remove the logos πŸ₯²

19.02.2026 22:52 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
AlphaFold Database welcomes community datasets Latest AlphaFold Database update adds high-value datasets for microbial and viral proteins, generated by specialist communities

The AlphaFold Database, jointly developed with Google DeepMind, now contains datasets from specialist communities.

This includes rich datasets for microbes, viruses & parasites associated with tropical diseases.

Explore the new datasets in the AlphaFold Database.

www.ebi.ac.uk/about/news/u...

17.02.2026 13:40 β€” πŸ‘ 58    πŸ” 25    πŸ’¬ 1    πŸ“Œ 6
Post image

Introducing The Structural History of Eukarya (SHE): The first proteome-scale phylogeny constructed entirely from 3D structure.
We computed 300 trillion alignments across 1,542 species to map the tree of life. πŸ§΅πŸ‘‡ (1/5)

07.02.2026 08:50 β€” πŸ‘ 84    πŸ” 40    πŸ’¬ 2    πŸ“Œ 0
Post image

:D

30.01.2026 09:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Multiple protein structure alignment at scale with FoldMason Protein structure is conserved beyond sequence, making multiple structural alignment (MSTA) essential for analyzing distantly related proteins. Computational prediction methods have vastly extended ou...

FoldMason is out now in @science.org. It generates accurate multiple structure alignments for thousands of protein structures in seconds. Great work by Cameron L. M. Gilchrist and @milot.bsky.social.
πŸ“„ www.science.org/doi/10.1126/...
🌐 search.foldseek.com/foldmason
πŸ’Ύ github.com/steineggerla...

30.01.2026 06:11 β€” πŸ‘ 300    πŸ” 147    πŸ’¬ 4    πŸ“Œ 3
AmpliPhy improves gene trees by adding homologs without affecting alignments In phylogenomics, gene tree reconstruction depends on multiple sequence alignment (MSA) and tree inference, and ongoing work continues to improve inference quality. Denser taxon sampling has been associated with improved gene tree inference, suggesting that adding homologs could be a practical route to higher accuracy as sequence databases continue to expand. However, adding sequences can influence multiple steps of typical inference pipelines, and little is known on its specific effect on the multiple sequence alignment, tree reconstruction, and rooting steps. We performed a large-scale empirical benchmark to quantify how homolog enrichment affects alignment and phylogenetic inference. Using an enrichment-impoverishment design and a measure of tree accuracy based on taxonomic congruence, we found that enrichment consistently improves tree inference quality, while effects on alignment quality are marginal. We show that this improvement is associated with accurate root placement on enriched trees when sensitive homolog search is accompanied. Notably, much of the benefit can be retained with relatively compact alignments produced by sequence addition. Building on these observations, we provide a tool, AmpliPhy, which efficiently improves phylogenetic reconstruction of protein families through homolog enrichment. The AmpliPhy open-source pipeline software is available at https://github.com/DessimozLab/ampliphy. ### Competing Interest Statement The authors have declared no competing interest. Swiss National Science Foundation, https://ror.org/00yjd3n13, 216623, 10005715

Can ever-increasing sequence databases improve phylogenetic reconstruction of a gene family? Our new preprint introduces AmpliPhy, a pipeline that automates homolog enrichment to improve gene tree inference, built on a robust phylogenomic benchmark scheme. 🧡1/n
πŸ“ƒ doi.org/10.64898/2026.01.26.701724

28.01.2026 06:10 β€” πŸ‘ 25    πŸ” 14    πŸ’¬ 1    πŸ“Œ 0

Sorry to hear that :( looks like such a cool paper.

27.01.2026 04:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Very good point. We do not show these values in the front end. I flagged this. We will add them.

25.01.2026 06:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In the complex report, results are sorted first by TM-score normalized to the query complex length, then by the target complex TM-score. So their should be a visibile ordering.

24.01.2026 08:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We don’t combine per-chain bit scores into a single complex score. A proper bit score would need to account for the full complex superposition. However, we consider the per chain e-values. For now, use pair-chain E-values + complex TM-score for filtering is what we do.

24.01.2026 07:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Milot’s venture into establishing his own lab is incredibly excitinge. I highly recommend to join Milot on his mission to advance molecular biology through open-source bioinformatics.

21.01.2026 03:37 β€” πŸ‘ 36    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
Mirdita Lab - Laboratory for Computational Biology & Molecular Machine Learning Mirdita Lab builds scalable bioinformatics methods.

My time in @martinsteinegger.bsky.social's group is ending, but I’m staying in Korea to build a lab at Sungkyunkwan University School of Medicine. If you or someone you know is interested in molecular machine learning and open-source bioinformatics, please reach out. I am hiring!
mirdita.org

20.01.2026 11:07 β€” πŸ‘ 104    πŸ” 55    πŸ’¬ 7    πŸ“Œ 1
Preview
The 2026 Nucleic Acids Research database issue and the online molecular biology database collection Abstract. The 2026 Nucleic Acids Research database issue has 182 papers from across biology and neighbouring fields. Eighty-four of these papers describe n

The new 2026 NAR Nucleic Acids Research Database Issue is now out! πŸŽ‰

Take a look at the Editorial here academic.oup.com/nar/article/...
and browse the whole Issue here
academic.oup.com/nar/issue/54...

18.01.2026 11:37 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

I am deeply saddened by this news. Peer was a brilliant scientist, and his passing is a tremendous loss, not only for our field, but for science as a whole.

17.01.2026 02:49 β€” πŸ‘ 18    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Structome-AlignViewer: On Confidence Assessment in Structure-Aware Alignments Abstract. Protein structure-based comparison provides a framework for uncovering deep evolutionary relationships that can escape conventional sequence-base

An interesting way to look at encoded structural characters to build alignments and trees downstream. This takes structural phylogenetics one step further in the post-AlphaFold era.

#Evolution #Science #StrPhy

On Confidence Assessment in Structure-Aware Alignments

doi.org/10.1093/gbe/...

14.01.2026 11:07 β€” πŸ‘ 18    πŸ” 10    πŸ’¬ 0    πŸ“Œ 0

Phold's manuscript is now available @narjournal.bsky.social thanks to @susiegriggo.bsky.social @npbhavya.bsky.social @vijinim.bsky.social @linsalrob.bsky.social @martinsteinegger.bsky.social @milot.bsky.social @eunbelivable.bsky.social & others not on bsky #phagesky academic.oup.com/nar/article/...

14.01.2026 05:10 β€” πŸ‘ 82    πŸ” 44    πŸ’¬ 1    πŸ“Œ 1

Did you write them?

05.01.2026 21:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A Vtuber is explaining the ProteinTTT paper. I’ve wondered when this trend would hit science.

03.01.2026 23:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
γ€θ«–ζ–‡η°‘ε˜θ§£θͺ¬γ€‘One protein is all you need簑単解θͺ¬πŸ’¨ #shorts #vtuberζΊ–ε‚™δΈ­ #ζΎͺ乃ゆい
YouTube video by Miono Yui ch. | ζΎͺ乃ゆい γ€θ«–ζ–‡η°‘ε˜θ§£θͺ¬γ€‘One protein is all you need簑単解θͺ¬πŸ’¨ #shorts #vtuberζΊ–ε‚™δΈ­ #ζΎͺ乃ゆい

Amazing summary of our recent ProteinTTT paper youtube.com/shorts/XWueh...

03.01.2026 13:25 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Group Leader - Generative Biology and AI Do you want to help us improve human health and understand life on Earth? Make your mark by shaping the future to enable or deliver life-changing science to solve some of humanity’s greatest challenge...

Looking to start your lab in generative biology / AI?
Come join us at the @sangerinstitute.bsky.social
Sanger is core-funded so you can generate data at scale to train the next generation of models and understanding. Design/Engineering/Chemistry/Proteins/Pathways!
pls RT
tinyurl.com/GenGenFaculty

01.01.2026 12:08 β€” πŸ‘ 33    πŸ” 33    πŸ’¬ 0    πŸ“Œ 0

I hope they fix the system soon. For now, I won’t review for them.

30.12.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Got three emails on Dec 24th, 25th, and Saturday the 27th asking me to review a manuscript… only to be removed as reviewer today. When I complained, the response was: β€œOur offices are closed December 24-January 1.”
Thank you @plos.org Computational Biology.

30.12.2025 14:02 β€” πŸ‘ 24    πŸ” 0    πŸ’¬ 4    πŸ“Œ 0

Hey you just broke too many grant proposals.

26.12.2025 12:23 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"..based on a common wavefront design that can be adapted to support a variety of dynamic programming algorithms: local, global, and semi-global alignment of genomic and protein sequences with a variety of commonly used scoring schemes" from
@martinsteinegger.bsky.social andco

20.12.2025 11:05 β€” πŸ‘ 14    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

New preprint🚨
Imagine (re)designing a protein via inverse folding. AF2 predicts the designed sequence to a structure with pLDDT 94 & you get 1.8 Γ… RMSD to the input. Perfect design?
What if I told u that the structure has 4 solvent-exposed Trp and 3 Pro where a Gly should be?

Why to be waryπŸ§΅πŸ‘‡

16.12.2025 15:15 β€” πŸ‘ 58    πŸ” 22    πŸ’¬ 4    πŸ“Œ 1
Release Heading into the sunset Β· tseemann/prokka The future This is probably the last release of Prokka. I won't be making any code changes except bug fixes. I will update the databases occasionally. I strongly recommend you use Bakta by @oschwen...

πŸ’Ύ Prokka 1.15.6 is released!

This is the last major release of Prokka. But don't be sad, because @oschwengers.bsky.social already has an excellent replacement called Bakta you can migrate to.
#bioinformatics #microbiology #genomics

github.com/tseemann/pro...

15.12.2025 21:09 β€” πŸ‘ 117    πŸ” 60    πŸ’¬ 3    πŸ“Œ 2

Congratulations, Julia Mahamid!

Julia Mahamid receives the @dfg.de Leibniz Prize for her work on structural cell biology!

#LeibnizPreis

11.12.2025 10:12 β€” πŸ‘ 102    πŸ” 20    πŸ’¬ 1    πŸ“Œ 3