Franziska did a spectacularly successful PhD in my group and to a large extend set up the lab when we first started. I'm extremely happy for her (though not the least bit surprised) that she won these very important awards for her PhD. Watch this one, great things are certain to come.
26.06.2025 14:24 β π 5 π 1 π¬ 0 π 0
That is an alignment scoring matrix, similar to Blosum for amino acids. Alignment and phylogenetic substitution matrices have different purposes (one scores co-occurences within a column, the other the likelihood of substitution across a branch).
12.06.2025 08:21 β π 1 π 0 π¬ 1 π 0
This matrix comes with the same simplifying assumptions we make about amino acid evolution, so there is that.
It will probably work for ABC transporters, but shoot me an email if you need help.
11.06.2025 19:38 β π 0 π 0 π¬ 1 π 0
Exactly the opposite. This is an evolutionary model like WAG.
11.06.2025 19:10 β π 0 π 0 π¬ 1 π 0
Oh and Sriram should be a co-corresponding author on this with me, but the journal messed this up after the proofs. I hope this will be corrected when the final version is out. He really drove this project.
11.06.2025 14:01 β π 4 π 0 π¬ 0 π 0
Finally, I also want to point out the contribution of other groups in this field, in particular Nick Matzke and Christophe Dessimoz, who developed similar ideas to ours.
11.06.2025 14:01 β π 1 π 0 π¬ 1 π 0
We are very excited about the possibilities this opens up, especially with respect to very deep history. We have number of very exciting things cooking that Iβm dying to show to people when they are ready.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
We are hoping that this matrix will eventually be integrated into IQtree directly, but for now you can just download it and use it as custom substitution model.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
Our paper also discusses what we think this new method is good for, and what it may not be good for, as well as how it differs from other structural phylogenetics methods that other clever people have already developed.
11.06.2025 14:01 β π 0 π 0 π¬ 2 π 0
We then use this new model to revisit the old question about the root using the structures of universal paralogs. Thankfully, the answer is that the root indeed lies where everyone always thought it would. But now we finally have certainty.
11.06.2025 14:01 β π 1 π 0 π¬ 1 π 0
We inferred such a model from 3Di alignments of 1000s of protein families, so that you don't have to. The matrix is available with the paper, please feel free to use it immediately. You can use it easily with IQtree.
11.06.2025 14:01 β π 2 π 0 π¬ 1 π 0
The key missing ingredient for this is a substitution model that describes how one 3Di character evolves into another. This is what our paper now provides.
11.06.2025 14:01 β π 3 π 0 π¬ 1 π 0
This alphabet has 20 letters, and therefore integrates well with existing tree inference software (to which a 3di alignment looks like a regular old protein alignment). This means we can use this to infer trees.
11.06.2025 14:01 β π 4 π 1 π¬ 1 π 0
The key development is the 3Di structural alphbet, developed by @martinsteinegger.bsky.social et al. 3Di allows a structural 'sequence' to be derived from a protein structure and was developed for remote homology searches.
11.06.2025 14:01 β π 4 π 0 π¬ 1 π 0
The big problem was that we needed a model for the evolution of protein structures, such that we could unlock the power of maximum likelihood phylogenetics.
11.06.2025 14:01 β π 3 π 0 π¬ 1 π 0
The idea is that instead of sequences, we could use structures to infer phylogenies. This idea is not ours, and itβs not new either. But for a long time there just wasnβt a good way to easily do this.
11.06.2025 14:01 β π 1 π 0 π¬ 1 π 0
Hereβs where our paper comes in. The central idea is that structure evolves more slowly than proteins. You can recognize structural homology over much greater phylogenetic distances than sequence homology.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
This had long raised suspicions that the root may be wrong, or at least not certain, because our phylogenetic models struggle to find signal in these very diverged sequences. Other important problems, like the origin of photosynthesis suffer from the same issue.
11.06.2025 14:01 β π 1 π 0 π¬ 1 π 0
There are a handful of such paralogs and thatβs where the inference about the root of the tree of life came from. But itβs a messy business. The sequences of these universal paralogs have in most cases diverged almost beyond recognition.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
The answer is so-called universal paralogs. These are genes that duplicated prior to LUCA, such that organisms now usually have two copies of them. If we throw both paralogs on the same tree, they root each other.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
You might not know how this was actually decided. There is no outgroup for life, so how was that tree even rooted?
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
This is common and affects things you might think we know for sure. For example, I am sure you have seen the three (now two-) domain tree of life, that places the root (and therefore LUCA) on the branch between bacteria and Archaea.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
For example, you might be interested in the relationship between two proteins that you suspect are related. But their lineages separated so long ago that their amino acid sequences no longer have any recognizable sequence similarity.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
This is especially relevant the deeper you go. Phylogenetics operates on synapomorphies β shared states of some character that indicate common descent. This signal can disappear simply too much evolution has occurred.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
Such trees are hard to infer for many reasons. But one big problem is that protein sequences sometimes have just accumulated too many changes to still contain enough information to resolve the things we care about.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
Deep phylogenetics is important. What genes did LUCA have? Where do eukaryotes come from? How did the major metabolic pathways evolve? All questions whose answers depend on phylogenetic trees that cover vast time-scales.
11.06.2025 14:01 β π 0 π 0 π¬ 1 π 0
A general substitution matrix for structural phylogenetics.
Abstract. Sequence-based maximum likelihood (ML) phylogenetics is a widely used method for inferring evolutionary relationships, which has illuminated the
New paper from the lab from Sriram Garg in my group. We introduce a general substitution matrix for structural phylogenetics. I think this is a big deal, so read on below if you think deep history is important. academic.oup.com/mbe/advance-...
11.06.2025 14:01 β π 96 π 52 π¬ 3 π 2
busy day!
30.05.2025 07:15 β π 1 π 0 π¬ 1 π 0
Evolutionary biologist; Lewis-Sigler Scholar @ Princeton. Formerly @ Harvard.
PhD student at the @memprotlabuv.bsky.social - University of Valencia
Especially interested in the biogenesis of viral proteins.
We study social evolution in viruses.
Assistant Professor @zoology.ubc.ca & @ubcbiodiversity.bsky.social
asherleeks.com
Philosopher of science, occasional scientist. Phil of:
ev biol, probability, modeling, stat inference. Book: Evolution and the Machinery of Chance. Activity here as private citizen, not as rep of nor supported by employer. https://marshallalmostsurely.com
Biophysicist | genotype-phenotype maps, fitness landscapes and evolution | Independent Fellow leading a group at @barcelonacollaboratorium.com @crg.eu
PhD student in Structural Biologyπ§βπ¬π
CryoET on mitochondria in photosynthetic organisms π¦
Engel Lab, Biozentrum Basel
PhD student rewiring E. coli for synthetic C1 assimilation
PhD. Bioinformatic. Evolutionary biologist. Kinetoplastid research.
Bioinformatics Enthusiast. I like transposons. PhD student at IMBA, part of Vienna BioCenter
PhD candidate in molecular evolution @UChicago
manyuanlonglab.uchicago.edu
Seeking postdoc opportunities!
Associate professor at ETH Zurich, studying the cellular consequences of genetic variation. Affiliated with the Swiss Institute of Bioinformatics and a part of the LOOP Zurich.
Principal Investigator | Microbial Molecular Genomics Lab | https://www.rousset-lab.com |International Center for Infectiology Research (CIRI) in Lyon π«π· Studying the molecular basis of phage-bacteria interactions.
PI@MPIPZ, Plant Immunity, EvoMPMI, Proteomics, Signal transduction, Phosphorylation
https://www.mpipz.mpg.de/nakagami
https://www.mpipz.mpg.de/MS
Development, evolution, worms, gene name etymology. I log interesting papers, questions, and ideas.
Studying ancient ribozymes. Likes group II introns a little too much. PhD @mutschlerlab.bsky.social
Developing data intensive computational methods β’ PI @ Seoul National University π°π· β’ #FirstGen β’ he/him β’ HauptschΓΌler
Rubisco, protein engineering and photosynthesis at the Cambridge Biochemistry Department.
pryweslab.org
methanotroph ecology and evolution across ecosystems (marine, freshwater, rocks,...)
Senior Scientist in KΓΌsellab in Jena/Germany, she/her
omics-driven and constraint-based modeling of microbial community metabolism 𧬠post-doc in the Sunagawa Lab, Institute of Microbiology, ETH Zürich
A researcher interested in the origin and evolution of life, recently focusing on virus-like elements.
https://sites.google.com/site/sonakagawae/