Claus Wilke on Alphafold and the problem of protein folding in 2025
13.07.2025 20:41 β π 18 π 7 π¬ 0 π 0
Go 1.25 interactive tour
Fake clock, new GC, flight recorder and more.
Go 1.25 interactive tour
Go 1.25 is scheduled for release in August, so it's a good time to explore what's new.
#golang
antonz.org/go-1-25/
28.06.2025 03:15 β π 5 π 2 π¬ 0 π 0
Excited to launch our AlphaGenome API goo.gle/3ZPUeFX along with the preprint goo.gle/45AkUyc describing and evaluating our latest DNA sequence model powering the API. Looking forward to seeing how scientists use it! @googledeepmind
25.06.2025 14:29 β π 217 π 82 π¬ 5 π 10
A general substitution matrix for structural phylogenetics.
Abstract. Sequence-based maximum likelihood (ML) phylogenetics is a widely used method for inferring evolutionary relationships, which has illuminated the
New paper from the lab from Sriram Garg in my group. We introduce a general substitution matrix for structural phylogenetics. I think this is a big deal, so read on below if you think deep history is important. academic.oup.com/mbe/advance-...
11.06.2025 14:01 β π 93 π 52 π¬ 3 π 2
Powerful stuff from @juliosaezrod.bsky.social who found himself on the other end of the process - as a patient not a computational biology researcher - giving him insight into both research and patient perspectives. Huge credit to Julio for talking about his experiences here
20.06.2025 08:06 β π 30 π 10 π¬ 0 π 0
Michael Ashburner FRS was an influential figure in the fields of Drosophila genomics and early sequencing database initiatives such as @ebi.embl.org.
Read about their contributions across genetics and bioinformatics in the new biographical memoir: buff.ly/f01zNat
@geneticscam.bsky.socialβ¬
17.06.2025 10:44 β π 29 π 19 π¬ 3 π 0
Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment.
arxiv.org/abs/2506.12986
17.06.2025 01:48 β π 108 π 54 π¬ 0 π 1
Probabilistic Data Structures in Go: Building and Benchmarking a Bloom Filter
#golang
dev.to/umangsinha1...
14.06.2025 04:29 β π 2 π 1 π¬ 1 π 0
Figure 3. Overview of data sources for RNA modification detection model development
"Investigating RNA Dynamics from Single Molecule Transcriptomes"
by Sarath Chandra Janga & colleagues
"In this review, we examine . . . isoform detection, poly(A) tail length quantification, and mapping of RNA modifications."
FREE till July 24th with this link:
authors.elsevier.com/a/1lC%7EfcQb...
05.06.2025 13:37 β π 3 π 1 π¬ 0 π 0
Bit-Reproducible Phylogenetic Tree Inference under Varying Core-Counts via Reproducible Parallel Reduction Operators
Motivation: Phylogenetic trees describe the evolutionary history among biological species based on their genomic data. Maximum Likelihood (ML) based phylogenetic inference tools search for the tree and evolutionary model that best explain the observed genomic data. Given the independence of likelihood score calculations between different genomic sites, parallel computation is commonly deployed. This is followed by a parallel summation over the per-site scores to obtain the overall likelihood score of the tree. However, basic arithmetic operations on IEEE 754 floating-point numbers, such as addition and multiplication, inherently introduce rounding errors. Consequently, the order by which floating-point operations are executed affects the exact resulting likelihood value since these operations are not associative. Moreover, parallel reduction algorithms in numerical codes re-associate operations as a function of the core count and cluster network topology, inducing different round-off errors. These low-level deviations can cause heuristic searches to diverge and induce high-level result discrepancies (e.g., yield topologically distinct phylogenies). This effect has also been observed in multiple scientific fields, beyond phylogenetics. Results: We observe that varying the degree of parallelism results in diverging phylogenetic tree searches (high level results) for over 31 % out of 10 130 empirical datasets. More importantly, 8 % of these diverging datasets yield trees that are statistically significantly worse than the best known ML tree for the dataset (AU-test, p < 0.05). To alleviate this, we develop a variant of the widely used phylogenetic inference tool RAxML-NG, which does yield bit-reproducible results under varying core-counts, with a slowdown of only 0 to 12.7 % (median 0.8 %) on up to 768 cores. We further introduce the ReproRed reduction algorithm, which yields bit-identical results under varying core-counts, by maintaining a fixed operation order that is independent of the communication pattern. ReproRed is thus applicable to all associative reduction operations β in contrast to competitors, which are confined to summation. Our ReproRed reduction algorithm only exchanges the theoretical minimum number of messages, overlaps communication with computation, and utilizes fast base-cases for local reductions. ReproRed is able to all-reduce (via a subsequent broadcast) 4.1 Β· 106 operands across 48 to 768 cores in 19.7 to 48.61 Β΅s, thereby exhibiting a slowdown of 13 to 93 % over a non-reproducible all-reduce algorithm. ReproRed outperforms the state-of-the-art reproducible all-reduction algorithm ReproBLAS (offers summation only) beyond 10 000 elements per core. In summary, we re-assess non-reproducibility in parallel phylogenetic inference, present the first bit-reproducible parallel phylogenetic inference tool, as well as introduce a general algorithm and open-source code for conducting reproducible associative parallel reduction operations. ### Competing Interest Statement The authors have declared no competing interest. European Research Council, https://ror.org/0472cxd90, 882500 European Union, https://ror.org/019w4f821, 101087081
Check out our new preprint on reproducible parallel phylogenetic inference under varying core counts - it also includes a generic method for reproducible parallel associative reduction operations www.biorxiv.org/content/10.1...
05.06.2025 15:14 β π 9 π 4 π¬ 0 π 0
Short reads often miss complex isoform dynamics. A new approach published in @natbiotech.nature.com, miniQuant improves quantification by leveraging complementary strengths of long reads and short reads. #LongReads #LongReadTranscriptomics
05.06.2025 13:03 β π 3 π 2 π¬ 0 π 0
Industry friends, now is the time for MUCH more speaking out on behalf of academic colleagues under duress. Here are core open source methods that many of your products doubtlessly depend on either directly or indirectly (see en.wikipedia.org/wiki/HMMER) being abruptly defunded. Make noise.
29.05.2025 14:39 β π 76 π 50 π¬ 1 π 0
Genuinely... quite excited about this. I think I might have to install it just for old skool x new school kicks.
(plus - great for controlled cloud environments!)
27.05.2025 22:00 β π 40 π 15 π¬ 0 π 0
Conservation of regulatory elements with highly diverged sequences across large evolutionary distances
Nature Genetics - Combining functional genomic data from mouse and chicken with a synteny-based strategy identifies positionally conserved cis-regulatory elements in the absence of direct sequence...
How to find Evolutionary Conserved Enhancers in 2025? π£-π
Check out our paper - fresh off the press!!!
We find widespread functional conservation of enhancers in absence of sequence homology
Including: a bioinformatic tool to map sequence-diverged enhancers!
rdcu.be/enVDN
github.com/tobiaszehnde...
27.05.2025 12:19 β π 241 π 109 π¬ 7 π 9
Oxford Nanopore Tech Update LC2025 highlights
My highlights from the LC2025 announcements
Oxford Nanopore Tech Update LC2025. My full analysis of what this means for NGS and Multi-Omics, including the new Proteomics PoC. open.substack.com/pub/albertvi...
27.05.2025 06:45 β π 13 π 6 π¬ 0 π 0
Estimation of substitution and indel rates via k-mer statistics https://www.biorxiv.org/content/10.1101/2025.05.14.653858v1
18.05.2025 04:48 β π 10 π 7 π¬ 0 π 1
End-to-end simulation of nanopore sequencing signals with feed-forward transformers https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btae744/7930676 π§¬π₯οΈπ§ͺ https://github.com/ZKI-PH-ImageAnalysis/seq2squiggle
24.12.2024 08:21 β π 9 π 5 π¬ 0 π 1
This time-lapse captures 17 hours of axonal growth from a chicken dorsal root ganglion explant, visualized through the actin cytoskeleton using live confocal imaging.
I just submitted this video to the Nikon Small World in Motion competition. Today is the last day to upload yours! π
π§ͺ
30.04.2025 07:16 β π 1432 π 225 π¬ 68 π 26
Research scientist at @GoogleDeepMind passionate about AI, genomics and biology.
Population and evolutionary genetics @UCDavis. Posts, grammar, & spelling are my views only. He/him. #OA popgen book https://github.com/cooplab/popgen-notes/releases
http://theverge.com covers life in the future.
A bioinformatics postdoc at Victoria Popic's lab at Broad Institute.
Researcher in @BonsaiSeqBioinfo
Lille, France. Bioinformatics, data-structures for DNA/RNA.
https://x.com/golangch with over 30k followers
Go Development and AI Consulting: https://altafino.com
Clinical Microbiology #Bioinformatics @ Karolinska Hospital | PhD alumn http://pharmb.io | Systems thinking, Reproducibility, #SciWorkflows scipipe.org | Exploring ideas in genomics algorithms & more.
Married to Emebet. Dad. Sinner saved by grace.
Colorado RNA biologist & tRNA enthusiast exploring the wild frontiers of nanopore direct RNA sequencing at the intrepid venn diagram of northern blots & machine learning.
Mathematician, data privacy consultant
See also @johndcook.mathstodon.xyz.ap.brid.gy
Long-TREC: The Long-Reads Transcriptomics European Consortium. Next-generation transcriptome biology revealed by single-molecule sequencing technologies
An expedition to study coastal ecosystems and their response to the environment, from molecules to communities
Automatically tweets new posts from http://statmodeling.stat.columbia.edu
Please respond in the comment section of the blog.
Old posts spool at https://twitter.com/StatRetro
We accelerate research in the sciences and humanities with machine learning; part of the @ml4science.bsky.social cluster at @unituebingen.bsky.social
Web: mlcolab.org
Member profiles: @sethaxen.com @vstaros.bsky.social
Exploring the frontiers of science and math. Illuminating humanityβs quest to understand the universe. Capturing the wonder of discovery. For readers who want to know more.
Visit our website for more: www.quantabooks.org
Illuminating math and science. Supported by the Simons Foundation. 2022 Pulitzer Prize in Explanatory Reporting. www.quantamagazine.org