Steven Robbins's Avatar

Steven Robbins

@stevenjrobbins.bsky.social

Do my science @ace_uq studying coral reef microbiomes. Data wrangler, meta-omics and long-read wonk, clean energy enthusiast, Saganist zealot, collector of weird zoology facts, other nonsense.

3,435 Followers  |  496 Following  |  767 Posts  |  Joined: 20.11.2023  |  2.2979

Latest posts by stevenjrobbins.bsky.social on Bluesky

I get in a bit of a huff at the vocal experimentalist crowd that seem to want to discount genomics studies as a blanket cultural norm without individual consideration.

10.10.2025 16:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Like, try to tell me a whole class of bacteria all maintain that amount of co-localized machinery without ever using it. Itโ€™s possible, but incredibly unlikely. And in fact, SIP studies on DOM degradation were done after this that showed the same taxa we proposed to degrade DOM.

10.10.2025 16:42 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

But I agree, everything becomes much more interesting/defensible when you focus on genomes rather than individual genes. At least then I can say โ€œthis group is enriched in whole classes of CAZymes, co-localized on the genome in PULs, with transporters for the degrading products, etc.โ€

10.10.2025 16:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Which is a baffling thought process to me. Gene presence and phylogeny are like 99.9% correlated. Itโ€™s like, โ€œis the organism selected because of its genes or are the genes enriched because the organism is?โ€ To which I think, whatโ€™s the difference?

10.10.2025 16:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Actually though, the genes vs organisms thing has been brought up by reviewers of multiple papers in the form of โ€œhow do you know those genes arenโ€™t enriched simply because those organisms are enriched? Is it really because the genes are important?โ€

10.10.2025 16:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

But I agree. I think at its core, I just think a lot of genomics is treaty superficially and without a starting goal. We try to think of these things like in-silico experiments with expected outcomes based on current theory, rather than โ€œI wonder what gene go up/down?โ€

10.10.2025 16:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thanks man! Iโ€™m actually proud of this one specifically. We got desk rejected 3 times from glam journals with a perfunctory โ€œitโ€™s only genomics, we donโ€™t careโ€ from editors not in the field. Our ISME reviewers in the field had things nice enough to say I wanted to frame the comments.

10.10.2025 16:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Finishing a complete giraffe genome from telomere to telomere with Verkko-Fillet High-quality genome assemblies are essential for understanding speciation, evolution, and for building reference genomes and pan-genomes. Despite major advances in long-read sequencing technologies an...

โ€œFinishing a complete giraffe genome from telomere to telomere with Verkko-Filletโ€ walks you through the process of finishing a complete T2T genome using a new Python toolkit developed by Juhyun Kim that makes Verkko output more accessible... www.biorxiv.org/content/10.1...

10.10.2025 15:26 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Wonโ€™t get any complaints from me on that. Lots of people out there with sequence data making stretched claims based on cursory analysis of a few isolated genes.

10.10.2025 15:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This was a great read. I feel similarly.

09.10.2025 16:52 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Also throws away a lot of potential to identify new functions in organism with atypical pathways because weโ€™re relying on whatโ€™s been seen before in lab rat microbes, but I donโ€™t think pure genomics should typically be the domain of proposing new pathways anyway. Leave that for wet lab experiments.

09.10.2025 16:01 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Then if you go to the KEGG page for each KO, it lists experimental evidence for function.

Still room for error, proteins could still have promiscuous function, but at least if we require that most pathways/module genes are present, with key genes, narrows down to a much more trustworthy set.

09.10.2025 15:57 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thatโ€™s actually a great question. Itโ€™s why we required the โ€œkeyโ€ genes, and >70% of the pathway to be present, because many genes belong to multiple pathways. But for example, carbon fixation pathways tend to have key genes that arenโ€™t part of other pathways,so we required those to be present.

09.10.2025 15:53 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Accepted version of our article "Half of microbial eukaryote literature focuses on only twelve human parasites" now out in ISMEJ!

Awesome effort led by undergrad Joanna Lepper and with @hbrappap.bsky.social.

08.10.2025 16:49 โ€” ๐Ÿ‘ 23    ๐Ÿ” 9    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Perfect, thanks Florian! Should have just assumed Anvio would have something. Such a Swiss Army knife these days.

08.10.2025 15:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

hear*

07.10.2025 16:36 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Unfortunately, the power of EnrichM was the consideration of KOs into modules/pathways and doing the stats on the different DBs with standard outputs. Really wish it got published but Joel got an industry job. Maybe one can hack the backend DBs, but my guess is the formats have changed.

07.10.2025 16:01 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thanks @jcamthrash.bsky.social and @acritschristoph.bsky.social! DRAM I know of, though it's slowwww from what I here--default to single threads for things that could be multi-threaded. Not sure if that's been fixed. There's also Metacerberus for faster annotation.

07.10.2025 15:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

I think that's why KEGG is so powerful, it ties its genes into machine-readable functional units so you're not looking at them in isolation. My opinion, anyway.

07.10.2025 15:56 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
A genomic view of the microbiome of coral reef demosponges AbstractSponges underpin the productivity of coral reefs, yet few of their microbial symbionts have been functionally characterised. Here we present an ana

Like here our criteria was that >70% of genes in a an enriched pathway had to be present, including the "key" genes required for proper functioning. One enriched gene says little, but we try to make it as unlikely that we're focusing on statistical accidents.

academic.oup.com/ismej/articl...

07.10.2025 15:55 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Yeah, EnrichM just had something like a Bonferoni correction applied to the p-values. And like any gene/based analysis like that, Iโ€™d only consider it valid if, say, >50% of genes in a module or pathway were enriched. We donโ€™t come up with big stories based on a single gene.

06.10.2025 20:24 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

In EnrichM, you could supply two sets of MAGs, it would annotate with KEGG/PFAM/CAZY, and give outputs showing which KOs and KEGG modules, PFAMS, and CAZY groups were statistically enriched in one set over another, giving a p-value for each KO/PFAM/Module and fold enrichment between the two groups.

06.10.2025 18:19 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Curious, if someone wanted to annotate a large set of MAGs with KEGG, PFAM, CAZY, and then do some statistical comparison, how would you do that these days?

Back in the day I loved EnrichM, but the backend databases for KEGG, etc, are woefully out of date now.

06.10.2025 17:18 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 0

To fill in a bit, the reason I think The Expanse is the GOAT is because itโ€™s got an epic story arc, super interesting world building, but itโ€™s not dense, has great character development. I get exhausted by sci-fi thatโ€™s trying to look smart with the sci but donโ€™t have characters I truly care about.

05.10.2025 18:34 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Bringing the uncultivated microbial majority of freshwater ecosystems into culture - Nature Communications A large fraction of aquatic bacteria remains uncultured. Here, the authors cultivated 627 strains of abundant freshwater bacteria from 14 European lakes, thus generating a collection that includes many previously uncultured, oligotrophic bacteria that may serve as model organisms.

โš ๏ธNew paper โžก๏ธ isolating >600 freshwater microbes using dilution-to-extinction and low-nutrient, lake-mimicking media.

๐ŸคฏExpands cultured diversity to ~70 % of detected genera and enables physiological studies of streamlined oligotrophs.

๐Ÿ”— www.nature.com/articles/s41...
#protistsonsky

05.10.2025 11:26 โ€” ๐Ÿ‘ 65    ๐Ÿ” 25    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

Team, I need a new book/series to read. Iโ€™m partial to Sci-Fi or fantasy, points for anything that โ€œchanged your lifeโ€ or got you super reved.

Books/authors I love:
The Expanse series (personal GOAT)
The Bobiverse
Anything Kurt Vonnegut or Neil Gaiman

Already simul-reading 4 non-fiction.

05.10.2025 18:14 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 0
Preview
Giant DNA viruses encode a hallmark translation initiation complex of eukaryotic life In contrast to living organisms, viruses were long thought to lack protein synthesis machinery and instead depend on host factors to translate viral transcripts. Here, we discover that giant DNA virus...

Historically, viruses were thought to primarily use host cell's translational machinery. New work from @harvardcellbio.bsky.social faculty Amy Lee reveals that a giant DNA virus encodes its own IF4F initiation complex, suggesting an unexpected evolutionary innovation. www.biorxiv.org/content/10.1...

02.10.2025 18:10 โ€” ๐Ÿ‘ 93    ๐Ÿ” 39    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Post image Post image

Systematic comparative benchmarking of computational methods for the detection of transposable elements in long-read sequencing data www.biorxiv.org/content/10.1... ๐Ÿงฌ๐Ÿ–ฅ๏ธ๐Ÿงช

02.10.2025 15:55 โ€” ๐Ÿ‘ 10    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
Efficient sequence alignment against millions of prokaryotic genomes with LexicMap - Nature Biotechnology LexicMap uses a fixed set of probes to efficiently query gene sequences for fast and low-memory alignment.

Interested in aligning your long sequences or small genomes against huge reference databases containing millions of prokaryotic genomes ? A new tool has been released that can do this efficiently - LexicMap - www.nature.com/articles/s41...

01.10.2025 18:47 โ€” ๐Ÿ‘ 13    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Theyโ€™re coming yโ€™allโ€ฆthe megaplasmids are coming

(I say this in a slightly frustrated manner as someone who has been thoroughly unsuccessful in getting funding to study megaplasmids for the better part of 15 years)

01.10.2025 16:50 โ€” ๐Ÿ‘ 42    ๐Ÿ” 5    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 0

@stevenjrobbins is following 20 prominent accounts