James Lingford's Avatar

James Lingford

@jameslingford.bsky.social

PhD student in structural biology with @greening.bsky.social and @knottrna.bsky.social at Monash Uni. (he/him) Interested in hydrogenases, evolution, protein design. πŸ’» https://www.jameslingford.com/

162 Followers  |  231 Following  |  60 Posts  |  Joined: 10.01.2025  |  2.0886

Latest posts by jameslingford.bsky.social on Bluesky

Preview
Viro3D: a comprehensive database of virus protein structure predictions | Molecular Systems Biology imageimageViro3D provides proteome-level, high confidence AI-protein structure predictions for >4,400 viruses, allowing mapping of form and function across the human and animal virosphere. Viro3D i...

Viro3D paper is out! We predicted 85,000 protein structures from human & animal viruses. 1/5 🧡

πŸ“‘ Paper doi.org/10.1038/s443...
πŸ”­ Explore virosphere viro3d.cvr.gla.ac.uk

26.09.2025 14:06 β€” πŸ‘ 53    πŸ” 23    πŸ’¬ 1    πŸ“Œ 3

RFdiffusion3 is here: www.biorxiv.org/content/10.1...

19.09.2025 07:56 β€” πŸ‘ 2    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

Switched from using pandas to @pola.rs and it has been a total life saver. Handling dataframes is way more intuitive with the method chaining syntax. Prettier terminal printing, better error messages, lightning fast, fewer headaches.

18.09.2025 02:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Recent advances in the inference of deep viral
evolutionary history journals.asm.org/doi/10.1128/...

A review on structural phylogenetics utility and limitations

02.09.2025 22:35 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Protein Structural Phylogenetics Abstract. Protein structural phylogenetics is an interdisciplinary branch of molecular evolution that (i) uses 3D structural data to trace evolutionary his

Fantastic review on methodological developments in protein structural phylogenetics
academic.oup.com/gbe/article/...

01.09.2025 23:15 β€” πŸ‘ 13    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Preview
Large protein databases reveal structural complementarity and functional locality - Nature Communications Researchers mapped the protein structure landscape, revealing structural complementarity across databases and functional clustering in specific regions. Their web tool helps explore this space, unlock...

Large protein databases reveal structural complementarity and functional locality
www.nature.com/articles/s41...

27.08.2025 07:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

RFdiffusion2 is now live!
github.com/RosettaCommo...

You can now design proteins, and in particular enzymes from just partially defined amino acid side chains, and without defining their sequence position or order!

22.08.2025 00:51 β€” πŸ‘ 39    πŸ” 18    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - lehner-lab/combinatorialcores: Source code for analyses and figure reproduction in "Genetics, energetics, and allostery in proteins with randomized cores and surfaces", Escobedo et. al Scienc... Source code for analyses and figure reproduction in "Genetics, energetics, and allostery in proteins with randomized cores and surfaces", Escobedo et. al Science 2025 - lehner-lab/combina...

This is a remarkable paper on protein evolution from the Lehner lab. What makes it especially cool is that all the code to reproduce all the analysis/plots is included in Jupyter notebooks up on Github
www.science.org/doi/10.1126/...
github.com/lehner-lab/c...

21.08.2025 00:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

β€’ a big concern I have with these LLMs is how they might end up changing our attitudes around productivity. The promise is instant results & faster publications. But the cost will end up being garbage results that are only caught when it's too late

13.08.2025 23:44 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Just to piggyback on this critique of LLMs for coding with some of my own critiques:

β€’ it's a bad idea to install a tool on your machine that has vast read+write permissions
β€’ code that works but is really outputting false info is a real danger and hard to catch. LLMs make that danger worse

13.08.2025 23:38 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Does sequence clustering confound AlphaFold2? Predicting multiple conformational states of proteins represents a significant open challenge in structural biology. Increasingly many methods have be…

Does sequence clustering confound AlphaFold2?
www.sciencedirect.com/science/arti... From @hkws.bsky.social

11.08.2025 02:11 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

MMseqs2 v18 is out
- SIMD FW/BW alignment (preprint soon!)
- Sub. Mat. Ξ» calculator by Eric Dawson
- Faster ARM SW by Alexander Nesterovskiy
- MSA-Pairformer’s proximity-based pairing for multimer prediction (www.biorxiv.org/content/10.1...; avail. in ColabFold API)
πŸ’Ύ github.com/soedinglab/M... & 🐍

05.08.2025 08:25 β€” πŸ‘ 62    πŸ” 17    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - chevrettelab/gator-gc Contribute to chevrettelab/gator-gc development by creating an account on GitHub.

Pretty amazing looking tool for analysing genetic neighbourhoods:
github.com/chevrettelab...

23.07.2025 04:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ’₯ Excited to introduce Bacformer 🦠 - the first foundation model for bacterial genomics. Bacformer represents genomes as sequences of ordered proteins, learning the β€œgrammar” of how genes are arranged, interact and evolve.

Preprint πŸ“: biorxiv.org/content/10.1...

🧡 1/n

21.07.2025 09:55 β€” πŸ‘ 91    πŸ” 59    πŸ’¬ 3    πŸ“Œ 2

Excited to share our work on WitChi! πŸ› οΈπŸ–₯️
We tested it on the GTDB r220 archaeal supermatrix (5,869 taxa & 10,101 cols) removing 55% of sites in <2h.

The phylogeny showed several interesting groupings with overall improved branch support:
#phylogenetics #ArchaeaSky #MSA #opensource #MEvoSky #MicroSky

20.07.2025 16:58 β€” πŸ‘ 30    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - zeqianli/tgv: Explore 5,000+ genomes in the terminal. Light, blazing fast πŸš€, vim-motion. Explore 5,000+ genomes in the terminal. Light, blazing fast πŸš€, vim-motion. - zeqianli/tgv

New tgv release: local cache!

tgv download hg38

Download UCSC reference genomes to a local sqlite db for much faster browsing. Awesome Rust tools (twobit, bigtools) made this simple.

github.com/zeqianli/tgv

19.07.2025 23:37 β€” πŸ‘ 30    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0
Logo for the Sandpiper website

Logo for the Sandpiper website

Out in @natbiotech.nature.com: Metagenome taxonomy profilers usually ignore unknown species. SingleM is an accurate profiler which doesn't, even detecting phyla with no MAGs. Profiles of 700,000 metagenomes at sandpiper.qut.edu.au. A 🧡

16.07.2025 21:59 β€” πŸ‘ 129    πŸ” 71    πŸ’¬ 7    πŸ“Œ 9
Video thumbnail

Got this setup working where I can now run .ipynb notebooks right from inside the terminal with a combination of neovim, quarto, kitty, and this neovim plugin called molten: github.com/benlubas/mol...
Never have to abandon my precious vim setup again

11.07.2025 07:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Inhibiting heme piracy by pathogenic Escherichia coli using de novo-designed proteins - Nature Communications Many pathogens encode transporters that extract heme directly from host proteins. In this study, the authors demonstrate the utility of de novo-designed proteins in understanding the mechanism behind ...

Excited to share our latest work using AI-designed proteins to block heme-piracy by E. coli. Published in @natcomms.nature.com. A team effort between my lab and the β€ͺβ€ͺ@knottrna.bsky.social‬ ‬lab, with experimental work led by the talented @danielrfox.bsky.social
www.nature.com/articles/s41...

09.07.2025 23:55 β€” πŸ‘ 40    πŸ” 19    πŸ’¬ 1    πŸ“Œ 3

Been eagerly awaiting this one. Amazing work

07.07.2025 08:45 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We have written up a tutorial on how to run BindCraft, how to prepare your input PDB, how to select hotspots, and various other tips and tricks to get the most out of binder design!

github.com/martinpacesa...

30.06.2025 19:45 β€” πŸ‘ 138    πŸ” 55    πŸ’¬ 4    πŸ“Œ 0
Post image

Some matplotlib work in progress

27.06.2025 07:13 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

1/27 We have a new paper out! Turns out that snowflake yeast have been hiding a secret from us - they've evolved a (very!) crude circulatory system. Not with blood vessels or a heart, but through spontaneous fluid flows powered by their metabolism. πŸ§ͺπŸ”¬

www.science.org/doi/full/10....

24.06.2025 16:52 β€” πŸ‘ 357    πŸ” 146    πŸ’¬ 14    πŸ“Œ 25
Post image

Closer... I think at this point the solution lies in manually making a list of the hex codes, but that's for another day.

24.06.2025 07:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Revisiting this topic now that I've forced myself to use PyMOL. Using this script to install the viridis family of colour palettes: github.com/smsaladi/pym... and running '"spectrum count, palette=magma, MODEL_NAME". The palette is not there still. They must modify the magma palette somehow

23.06.2025 22:55 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

How do we know for sure if we have the best AF prediction? We still need prior knowledge/expectations of what we're trying to predict with AF. And in the absence of that, I guess we can't really know unless we try every possible combination in the search space (which is not feasible).

20.06.2025 22:42 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Seen similar things where the PAEs of a multimer are poor until all the subunits in the correct stoichiometry are provided.

This worries me, because one could have a good prediction like "C", but miss out on the best prediction "D".

20.06.2025 22:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image Post image

Physics-based design of efficient Kemp eliminases

@lynnkamerlin.bsky.social

www.nature.com/articles/s41...

19.06.2025 21:18 β€” πŸ‘ 22    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0
Post image

Learning some Blender molecular nodes from @sarahjpiper.bsky.social @ccemmp-outreach.bsky.social

13.06.2025 04:03 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
A general substitution matrix for structural phylogenetics. Abstract. Sequence-based maximum likelihood (ML) phylogenetics is a widely used method for inferring evolutionary relationships, which has illuminated the

New paper from the lab from Sriram Garg in my group. We introduce a general substitution matrix for structural phylogenetics. I think this is a big deal, so read on below if you think deep history is important. academic.oup.com/mbe/advance-...

11.06.2025 14:01 β€” πŸ‘ 95    πŸ” 52    πŸ’¬ 3    πŸ“Œ 2

@jameslingford is following 20 prominent accounts