Milot Mirdita's Avatar

Milot Mirdita

@milot.bsky.social

Open source #bioinformatics at Sungkyunkwan University ๐Ÿ‡ฐ๐Ÿ‡ท | former Steinegger Lab @ SNU, Sรถding Lab @ MPI-NAT | http://mstdn.science/@milotmirdita

2,422 Followers  |  804 Following  |  62 Posts  |  Joined: 11.07.2023  |  1.8614

Latest posts by milot.bsky.social on Bluesky

Post image

Introducing The Structural History of Eukarya (SHE): The first proteome-scale phylogeny constructed entirely from 3D structure.
We computed 300 trillion alignments across 1,542 species to map the tree of life. ๐Ÿงต๐Ÿ‘‡ (1/5)

07.02.2026 08:50 โ€” ๐Ÿ‘ 83    ๐Ÿ” 40    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Compbio Asia

Please spread the word:

We invite applications to a two-week Computational Biology workshop in Singapore, June 14-27.

This NSF-funded workshop brings together 16-20 US grad students with international peers.
Apply by March 21: compbioasia.net
๐Ÿงต Details below:

05.02.2026 17:22 โ€” ๐Ÿ‘ 1    ๐Ÿ” 9    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Distance-Restraint-Guided Diffusion Models for Sampling Protein Conformational Changes and Ligand Dissociation Pathways
Tatsuki Hori, Yoshitaka Moriwaki, Ryuichiro Ishitani
www.biorxiv.org/content/10.6...
Our new preprint is out.

02.02.2026 07:52 โ€” ๐Ÿ‘ 6    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Multiple protein structure alignment at scale with FoldMason Protein structure is conserved beyond sequence, making multiple structural alignment (MSTA) essential for analyzing distantly related proteins. Computational prediction methods have vastly extended ou...

FoldMason is out now in @science.org. It generates accurate multiple structure alignments for thousands of protein structures in seconds. Great work by Cameron L. M. Gilchrist and @milot.bsky.social.
๐Ÿ“„ www.science.org/doi/10.1126/...
๐ŸŒ search.foldseek.com/foldmason
๐Ÿ’พ github.com/steineggerla...

30.01.2026 06:11 โ€” ๐Ÿ‘ 297    ๐Ÿ” 147    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 3
AmpliPhy improves gene trees by adding homologs without affecting alignments In phylogenomics, gene tree reconstruction depends on multiple sequence alignment (MSA) and tree inference, and ongoing work continues to improve inference quality. Denser taxon sampling has been associated with improved gene tree inference, suggesting that adding homologs could be a practical route to higher accuracy as sequence databases continue to expand. However, adding sequences can influence multiple steps of typical inference pipelines, and little is known on its specific effect on the multiple sequence alignment, tree reconstruction, and rooting steps. We performed a large-scale empirical benchmark to quantify how homolog enrichment affects alignment and phylogenetic inference. Using an enrichment-impoverishment design and a measure of tree accuracy based on taxonomic congruence, we found that enrichment consistently improves tree inference quality, while effects on alignment quality are marginal. We show that this improvement is associated with accurate root placement on enriched trees when sensitive homolog search is accompanied. Notably, much of the benefit can be retained with relatively compact alignments produced by sequence addition. Building on these observations, we provide a tool, AmpliPhy, which efficiently improves phylogenetic reconstruction of protein families through homolog enrichment. The AmpliPhy open-source pipeline software is available at https://github.com/DessimozLab/ampliphy. ### Competing Interest Statement The authors have declared no competing interest. Swiss National Science Foundation, https://ror.org/00yjd3n13, 216623, 10005715

Can ever-increasing sequence databases improve phylogenetic reconstruction of a gene family? Our new preprint introduces AmpliPhy, a pipeline that automates homolog enrichment to improve gene tree inference, built on a robust phylogenomic benchmark scheme. ๐Ÿงต1/n
๐Ÿ“ƒ doi.org/10.64898/2026.01.26.701724

28.01.2026 06:10 โ€” ๐Ÿ‘ 25    ๐Ÿ” 14    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Milotโ€™s venture into establishing his own lab is incredibly excitinge. I highly recommend to join Milot on his mission to advance molecular biology through open-source bioinformatics.

21.01.2026 03:37 โ€” ๐Ÿ‘ 36    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Mirdita Lab - Laboratory for Computational Biology & Molecular Machine Learning Mirdita Lab builds scalable bioinformatics methods.

My time in @martinsteinegger.bsky.social's group is ending, but Iโ€™m staying in Korea to build a lab at Sungkyunkwan University School of Medicine. If you or someone you know is interested in molecular machine learning and open-source bioinformatics, please reach out. I am hiring!
mirdita.org

20.01.2026 11:07 โ€” ๐Ÿ‘ 105    ๐Ÿ” 54    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 1
Preview
In remembrance of Peer Borkย  | EMBL EMBL and its community are deeply saddened by the death of Peer Bork, the organisationโ€™s Interim Director General.

This is very sad news

'It is with great sadness that EMBL announces that Interim Director General Professor Peer Bork passed away from natural causes on 16 January 2026.'

www.embl.org/news/embl-an...

16.01.2026 18:06 โ€” ๐Ÿ‘ 30    ๐Ÿ” 10    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 2

Phold's manuscript is now available @narjournal.bsky.social thanks to @susiegriggo.bsky.social @npbhavya.bsky.social @vijinim.bsky.social @linsalrob.bsky.social @martinsteinegger.bsky.social @milot.bsky.social @eunbelivable.bsky.social & others not on bsky #phagesky academic.oup.com/nar/article/...

14.01.2026 05:10 โ€” ๐Ÿ‘ 82    ๐Ÿ” 44    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

Happy to share that our work on HLp, a bacterial histone from Leptospira perolatii, is now published in Nature Communications ๐ŸŽ‰

In this study, we show that HLp forms stable tetramers that wrap ~60 bp of DNA, revealing a distinct histoneโ€“DNA organization in bacteria.

www.nature.com/articles/s41...

13.12.2025 08:09 โ€” ๐Ÿ‘ 54    ๐Ÿ” 16    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Preview
PDBe: enhanced structural data exploration to facilitate discovery Abstract. Protein Data Bank in Europe (PDBe) is a founding member of the worldwide Protein Data Bank (wwPDB), delivering open access to experimentally dete

From Sameer Velankar & colleagues in @narjournal.bsky.social #NARDatabaseIssue | PDBe: enhanced structural data exploration to facilitate discovery | #Bioinformatics #Database #OpenScience #Proteomics #PDB ๐Ÿงฌ ๐Ÿ–ฅ๏ธ๐Ÿงช๐Ÿ”“
โฌ‡๏ธ
academic.oup.com/nar/advance-...

11.12.2025 15:13 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Today marks one year since the Dec. 3, 2024 martial law declaration that rocked South Korea and still reverberates today. Whatโ€™s on my mind today is the grit of South Koreans who rushed to the National Assembly that night, in freezing weather, to demand a return to democratic government.

03.12.2025 03:09 โ€” ๐Ÿ‘ 2169    ๐Ÿ” 549    ๐Ÿ’ฌ 24    ๐Ÿ“Œ 21
Post image Post image

We are deeply saddened to learn of the passing of Amos Bairoch. His vision and leadership helped build the foundations of todayโ€™s bioinformatics community. From the creation of essential biological databases to decades of mentorship, his influence can be felt across research groups worldwide.

02.12.2025 17:00 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

LoL-align: sensitive and fast probabilistic protein structure alignment https://www.biorxiv.org/content/10.1101/2025.11.24.690091v1

26.11.2025 02:46 โ€” ๐Ÿ‘ 12    ๐Ÿ” 7    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
AlphaFold Protein Structure Database 2025: a redesigned interface and updated structural coverage Abstract. The AlphaFold Protein Structure Database (AFDB; https://alphafold.ebi.ac.uk), developed by EMBLโ€“EBI and Google DeepMind, provides open access to

From Sameer Velankar & colleagues in @narjournal.bsky.social #NARDatabaseIssue | #AlphaFold #Protein #Structure #Database 2025: a redesigned interface and updated structural coverage | #Bioinformatics #Proteomics #OpenScience #AFDB ๐Ÿงช๐Ÿ”“ CC/ @ebi.embl.org
โฌ‡๏ธ
academic.oup.com/nar/advance-...

24.11.2025 00:56 โ€” ๐Ÿ‘ 30    ๐Ÿ” 14    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

A few py2Dmol updates ๐Ÿงฌ

py2dmol.solab.org
Integration with AlphaFoldDB (will auto fetch results). Drag and drop results from AF3-server or ColabFold for interactive experience! (1/4)

19.11.2025 08:15 โ€” ๐Ÿ‘ 103    ๐Ÿ” 31    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Congrats Spyro!

15.11.2025 07:10 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Guess the news is officially out! Extremely excited to announce that I will be starting my own laboratory at Institut Pasteur @pasteur.fr this coming spring!

Slight change to my office window view from Tokyo Tower๐Ÿ—ผ to the Tour Eiffel. ๐Ÿ‡ซ๐Ÿ‡ท

15.11.2025 06:42 โ€” ๐Ÿ‘ 110    ๐Ÿ” 11    ๐Ÿ’ฌ 28    ๐Ÿ“Œ 0

I want to spell this out in case the implications aren't clear:

This means all public tools/webapps of GISAID data (all the ones you've been used to seeing thru the pandemic, as far as we can tell) are prohibited.

The file allowed this. Cut that - cut off all tools the public & others were using.

07.11.2025 14:41 โ€” ๐Ÿ‘ 258    ๐Ÿ” 136    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 8

OpenFold3-preview (OF3p) is out: a sneak peek of our AF3-based structure prediction model. Our aim for OF3 is full AF3-parity for every modality. We now believe we have a clear path towards this goal and are releasing OF3p to enable building in the OF3 ecosystem. More๐Ÿ‘‡

28.10.2025 18:30 โ€” ๐Ÿ‘ 125    ๐Ÿ” 42    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Preview
GitHub - bbuchfink/diamond: Accelerated BLAST compatible local sequence aligner. Accelerated BLAST compatible local sequence aligner. - bbuchfink/diamond

DIAMOND v2.1.15 now supports all taxonomy features for BLAST databases, and support for using BLAST databases has also been added to the Bioconda version github.com/bbuchfink/di...

28.10.2025 16:45 โ€” ๐Ÿ‘ 16    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Predicting protein complexes in biosynthetic gene clusters Biosynthetic gene clusters (BGCs) are contiguous genomic regions that encode diverse, non-homologous proteins required for the production of specific natural products. Their genetic diversity underlie...

Our new preprint is out. Our group performed a comprehensive proteinโ€“protein complex prediction within 2,437 biosynthetic gene clusters. We predicted a total of 487,828 complexes for known BGCs, identifying 15,438 heteromeric interactions with an ipTM โ‰ฅ 0.6. (2/3)
www.biorxiv.org/content/10.1...

28.10.2025 05:58 โ€” ๐Ÿ‘ 25    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Video thumbnail

Working on the protein-hunter-chai google colab notebook. ๐Ÿ˜ˆ

@yehlincho.bsky.social

28.10.2025 03:34 โ€” ๐Ÿ‘ 33    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Excited to release BoltzGen which brings SOTA folding performance to binder design! The best part of this project is collaborating with a broad network of leading wetlabs that test BoltzGen at an unprecedented scale, showing success on many novel targets and pushing the model to its limits!

26.10.2025 22:40 โ€” ๐Ÿ‘ 103    ๐Ÿ” 41    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 5
Video thumbnail

We train machine learning models on millions of proteins. But when it comes to making predictions, do we need them to understand all proteins at once? Often, we need an accurate model for the specific protein we are studying or designing. We address this with ProteinTTT arxiv.org/abs/2411.02109 1/๐Ÿงต

23.10.2025 13:08 โ€” ๐Ÿ‘ 68    ๐Ÿ” 25    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Video thumbnail

End-to-end protein design in the browser through evedesign. Generate and interactively explore designs in 2D/3D and export them as codon-optimized DNA. The underlying open source framework (released soon) is build to easily add new methods, more on that soon.
๐ŸŒ evedesign.bio

22.10.2025 14:30 โ€” ๐Ÿ‘ 93    ๐Ÿ” 29    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Video thumbnail

Announcing our new protein design server evedesign.bio:
โ€ข End-to-end protein design for everyone!
โ€ข Analyze your generated library interactively and on 3D structures
โ€ข Export codon-optimized DNA sequences for experimental testing.

22.10.2025 14:17 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We (@sobuelow.bsky.social) developed AF-CALVADOS to integrate AlphaFold and CALVADOS to simulate flexible multidomain proteins at scale

See preprint for:
โ€” Ensembles of >12000 full-length human proteins
โ€” Analysis of IDRs in >1500 TFs

๐Ÿ“œ doi.org/10.1101/2025...
๐Ÿ’พ github.com/KULL-Centre/...

20.10.2025 11:26 โ€” ๐Ÿ‘ 92    ๐Ÿ” 37    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Does it have any Foldseek hits? Just search against all databases in the webserver (with the profile/iterative search ideally).

17.10.2025 08:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

One man's trash is another's treasure :)

17.10.2025 08:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@milot is following 20 prominent accounts