's Avatar

@georghochberg.bsky.social

Evolutionary biochemist and protein complex enthusiast at the University of Marburg.

212 Followers  |  76 Following  |  48 Posts  |  Joined: 05.12.2024  |  1.8067

Latest posts by georghochberg.bsky.social on Bluesky

Preview
Origin of chaperone dependence and assembly complexity in Rubisco biogenesis Molecular chaperones assist with the folding and assembly of protein clients. Consequently, they are essential to diverse cellular functions. In most aerobic photosynthetic organisms such as B-cyanoba...

A new preprint from the lab. I won't do a tweetorial until this is peer reviewed, but I think it's a banger, led by
@JedNzy. It's about Rubisco and what chaperones are really for. He's on the market, get him while you can. www.biorxiv.org/content/10.1...

24.09.2025 19:53 β€” πŸ‘ 28    πŸ” 14    πŸ’¬ 0    πŸ“Œ 0
Preview
In situ structure of a bacterial flagellar motor at subnanometre resolution reveals adaptations for increased torque - Nature Microbiology In situ cryoEM and modelling of the Campylobacter jejuni flagellar motor characterize the periplasmic scaffold and the structural basis for increased torque generation.

The Beeby lab's magnum opus on the Campylobacter flagellum is out. This is a uniquely powerful flagellum and how it evolved out of simpler precursors will keep us busy for a while still. Check it out! www.nature.com/articles/s41...

01.07.2025 19:16 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Franziska did a spectacularly successful PhD in my group and to a large extend set up the lab when we first started. I'm extremely happy for her (though not the least bit surprised) that she won these very important awards for her PhD. Watch this one, great things are certain to come.

26.06.2025 14:24 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

That is an alignment scoring matrix, similar to Blosum for amino acids. Alignment and phylogenetic substitution matrices have different purposes (one scores co-occurences within a column, the other the likelihood of substitution across a branch).

12.06.2025 08:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This matrix comes with the same simplifying assumptions we make about amino acid evolution, so there is that.
It will probably work for ABC transporters, but shoot me an email if you need help.

11.06.2025 19:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Exactly the opposite. This is an evolutionary model like WAG.

11.06.2025 19:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Oh and Sriram should be a co-corresponding author on this with me, but the journal messed this up after the proofs. I hope this will be corrected when the final version is out. He really drove this project.

11.06.2025 14:01 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Finally, I also want to point out the contribution of other groups in this field, in particular Nick Matzke and Christophe Dessimoz, who developed similar ideas to ours.

11.06.2025 14:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We are very excited about the possibilities this opens up, especially with respect to very deep history. We have number of very exciting things cooking that I’m dying to show to people when they are ready.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We are hoping that this matrix will eventually be integrated into IQtree directly, but for now you can just download it and use it as custom substitution model.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Our paper also discusses what we think this new method is good for, and what it may not be good for, as well as how it differs from other structural phylogenetics methods that other clever people have already developed.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

We then use this new model to revisit the old question about the root using the structures of universal paralogs. Thankfully, the answer is that the root indeed lies where everyone always thought it would. But now we finally have certainty.

11.06.2025 14:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We inferred such a model from 3Di alignments of 1000s of protein families, so that you don't have to. The matrix is available with the paper, please feel free to use it immediately. You can use it easily with IQtree.

11.06.2025 14:01 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The key missing ingredient for this is a substitution model that describes how one 3Di character evolves into another. This is what our paper now provides.

11.06.2025 14:01 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This alphabet has 20 letters, and therefore integrates well with existing tree inference software (to which a 3di alignment looks like a regular old protein alignment). This means we can use this to infer trees.

11.06.2025 14:01 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

The key development is the 3Di structural alphbet, developed by @martinsteinegger.bsky.social et al. 3Di allows a structural 'sequence' to be derived from a protein structure and was developed for remote homology searches.

11.06.2025 14:01 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The big problem was that we needed a model for the evolution of protein structures, such that we could unlock the power of maximum likelihood phylogenetics.

11.06.2025 14:01 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The idea is that instead of sequences, we could use structures to infer phylogenies. This idea is not ours, and it’s not new either. But for a long time there just wasn’t a good way to easily do this.

11.06.2025 14:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Here’s where our paper comes in. The central idea is that structure evolves more slowly than proteins. You can recognize structural homology over much greater phylogenetic distances than sequence homology.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This had long raised suspicions that the root may be wrong, or at least not certain, because our phylogenetic models struggle to find signal in these very diverged sequences. Other important problems, like the origin of photosynthesis suffer from the same issue.

11.06.2025 14:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

There are a handful of such paralogs and that’s where the inference about the root of the tree of life came from. But it’s a messy business. The sequences of these universal paralogs have in most cases diverged almost beyond recognition.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The answer is so-called universal paralogs. These are genes that duplicated prior to LUCA, such that organisms now usually have two copies of them. If we throw both paralogs on the same tree, they root each other.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You might not know how this was actually decided. There is no outgroup for life, so how was that tree even rooted?

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is common and affects things you might think we know for sure. For example, I am sure you have seen the three (now two-) domain tree of life, that places the root (and therefore LUCA) on the branch between bacteria and Archaea.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

For example, you might be interested in the relationship between two proteins that you suspect are related. But their lineages separated so long ago that their amino acid sequences no longer have any recognizable sequence similarity.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is especially relevant the deeper you go. Phylogenetics operates on synapomorphies – shared states of some character that indicate common descent. This signal can disappear simply too much evolution has occurred.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Such trees are hard to infer for many reasons. But one big problem is that protein sequences sometimes have just accumulated too many changes to still contain enough information to resolve the things we care about.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Deep phylogenetics is important. What genes did LUCA have? Where do eukaryotes come from? How did the major metabolic pathways evolve? All questions whose answers depend on phylogenetic trees that cover vast time-scales.

11.06.2025 14:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
A general substitution matrix for structural phylogenetics. Abstract. Sequence-based maximum likelihood (ML) phylogenetics is a widely used method for inferring evolutionary relationships, which has illuminated the

New paper from the lab from Sriram Garg in my group. We introduce a general substitution matrix for structural phylogenetics. I think this is a big deal, so read on below if you think deep history is important. academic.oup.com/mbe/advance-...

11.06.2025 14:01 β€” πŸ‘ 96    πŸ” 52    πŸ’¬ 3    πŸ“Œ 2

busy day!

30.05.2025 07:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@georghochberg is following 20 prominent accounts