The workspace publishing has been such a hassle. So glad to see this out
18.09.2025 14:40 β π 3 π 0 π¬ 0 π 0@noamteyssier.bsky.social
Bioinformatics Scientist at the Arc Institute. Working at the intersection of functional genomics, systems biology, and network dynamics. I also build rusty bioinformatics tools https://github.com/noamteyssier
The workspace publishing has been such a hassle. So glad to see this out
18.09.2025 14:40 β π 3 π 0 π¬ 0 π 0Sounds great! Would be very interested in that and happy to help build one
17.09.2025 14:18 β π 1 π 0 π¬ 0 π 0bsky.app/profile/noam...
Here was a benchmark I ran a while back comparing twobit and binseq on a single-thread
2bit was built for genomes where there are very long contiguous N-blocks. the overhead for managing these blocks though on fastq-style records (generally very short and non-contiguous Ns) is massive and most of the time unnecessary.
15.09.2025 17:13 β π 2 π 0 π¬ 1 π 0Paraseq 0.4 is out now! With double the throughput for processing paired-end input :)
github.com/noamteyssier...
Added a feature to bqtools yesterday for colored grep output. Also supports colored FASTX output as well. Already useful this morning as I troubleshoot some sequencing outputs!
04.09.2025 17:56 β π 3 π 0 π¬ 1 π 0Excited that the paper presenting our mouse brain in vivo CRISPR screening platform is out today in @natneuro.nature.com!
Great team effort, led by Biswa Ramani and @ivlrose.bsky.social in the Kampmann lab.
www.nature.com/articles/s41...
Preprint alert!
We present K2Rmini, an ultra-fast, grep-like tool that extracts sequences of interest from FASTA/FASTQ files based on their k-mer content.
www.biorxiv.org/content/10.1...
A thread
Writing in rust again after a long stretch of python is such a breath of fresh air.
26.06.2025 02:47 β π 4 π 0 π¬ 0 π 0Are you going to have a remote component to this? Would love to watch some of these talks if I can
26.06.2025 01:28 β π 0 π 0 π¬ 0 π 0Ah this is the way that I do it in paraseq! Doesn't work for fastq headers but works well for fasta
24.06.2025 20:04 β π 0 π 0 π¬ 0 π 0Introducing Arc Instituteβs first virtual cell model: STATE
23.06.2025 17:28 β π 17 π 6 π¬ 1 π 1Pretty cool little utility and blog post - fun to see the business/pleasure index for rust crates
boydkane.com/projects/cra...
Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment.
arxiv.org/abs/2506.12986
R.I.P your email inbox haha
16.06.2025 16:25 β π 1 π 0 π¬ 1 π 0New preprint! Deacon is a versatile tool for filtering FASTA/FASTQ files and streams at hundreds of megabases per second using minimizers, built with rapid metagenomic host depletion in mind, but equally useful for search.
github.com/bede/deacon
ish is a grep-like CLI tool that uses optimal alignment instead of exact matching.
Itβs record-type aware, supporting line, FASTA, and FASTQ records.
Built in Mojo as a proof of concept for bioinformatics.
π§΅1/5
A good workaround for defaults I use sometimes is Bon. Adds to compile times though which can be annoying
bon-rs.com
lol what expires in this? Itβs like pure metal
07.06.2025 14:59 β π 0 π 0 π¬ 1 π 0Slides from my talk (with @kamilsjaron.bsky.social) on an history of k-mers in bioinformatics: rayan.chikhi.name/pdf/2025-kme...
03.06.2025 09:25 β π 44 π 24 π¬ 1 π 2Love seeing audio stuff in rust. Howβd you make the visualization?
28.05.2025 17:30 β π 0 π 0 π¬ 1 π 0π Excited to share insights from our recent paper: "Kaminari: a resource-frugal index for approximate colored k-mer queries". The study aims to efficiently identify documents containing a query string, focusing on DNA strings. www.biorxiv.org/content/10.1... 𧬠π₯οΈ 1/8
27.05.2025 12:06 β π 24 π 16 π¬ 1 π 1One of the great success stories of change haha
23.05.2025 16:14 β π 1 π 0 π¬ 0 π 0I think the best way to spur change is to make the new solution as easy as the old one. If it's an easy swap then I think its people will try it out and convince themselves its worth it.
Like swapping out std::collections::HashMap for hashbrown::HashMap.
But its easier said than done
Oftentimes inertia is the biggest reason for lack of change. If things work as they are people are unlikely to change.
23.05.2025 15:47 β π 1 π 0 π¬ 1 π 0Our Proseg paper is now out in Nature Methods!
www.nature.com/articles/s41...
We borrowed a sampling procedure from the cell simulation literature to infer cell boundaries that best explains the spatial distribution of transcripts.
π The scanners are humming, the film is flowing.
The microfiche livestream is upβdigitizing government docs in real time for Democracyβs Library.
Perfect second-screen vibes: Preservation in progress.
π’ Live M-F, 7:30amβ3:30pm PT (except U.S. holidays)
β‘οΈ www.youtube.com/live/aPg2V5R...
So yeah, this is why I keep going on about: do we have to sanitize user input or not? File formats where bad inputs are simply not representable are good, because it saves us from this 100x slowdown.
16.05.2025 17:51 β π 4 π 2 π¬ 0 π 0One other option Iβd be curious about is an unreachable!()
16.05.2025 15:10 β π 0 π 0 π¬ 1 π 0I feel like -1 would lead to some smaller assembly footprintβ¦ but super curious what the diff is
16.05.2025 03:28 β π 1 π 0 π¬ 1 π 0