Kristoffer Sahlin's Avatar

Kristoffer Sahlin

@ksahlin.bsky.social

Associate Professor at the Department of Mathematics, Stockholm University, and a Scilifelab Fellow. Algorithms, Modeling, Transcriptomics, Genomics. Hobby runner 5000m 18:48 | 10k 37:40 | HM 1:27:43 | M 3:39:06

758 Followers  |  153 Following  |  29 Posts  |  Joined: 10.09.2023
Posts Following

Posts by Kristoffer Sahlin (@ksahlin.bsky.social)

DSB 2026 Venice - February 18-19 Workshop Data Structures in Bioinformatics

DSB Program is out !
Seems incredible (as ever)
dsb-meeting.github.io/DSB2026/

20.01.2026 15:35 β€” πŸ‘ 14    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Key Dates and Deadlines RECOMB-Seq 2026 Web Page

recomb-seq.github.io/seq2026/keyd...

11.01.2026 10:17 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
Multi-context seeds enable fast and high-accuracy read mapping A key step in sequence similarity search is to identify shared seeds between a query and a reference sequence. A well-known tradeoff is that longer seeds offer fast searches but reduce sensitivity in ...

... and the preprint has also been updated: www.biorxiv.org/content/10.1...

29.12.2025 05:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The commit benefited tremendously by minimap2’s practical algorithmic solutions for chaining and rescue -- big credits to @lh3lh3.bsky.social and his minimap2. Also to my students Nicolas Buchin and Ivan Tolstoganov, as well as Marcel Martin 4/4

29.12.2025 05:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Strobealign can now map long reads. Still POC, i.e., PAF only -- no supplementary chains or piecewise extension (yet). 3/4

29.12.2025 05:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Accuracy upgrades: chaining instead of NAMs, smarter local rescue of repetitive hits (minimap2-style), and improved multi-context seeding 2/4

29.12.2025 05:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Release Version 0.17.0 Β· ksahlin/strobealign Changes #504: Introduce collinear chaining as the new default mapping and alignment method, replacing NAMs. This reproduces the Minimap2 chaining algorithm providing better mapping accuracy. NAMs ...

Strobealign v0.17.0 is out! github.com/ksahlin/stro...

29.12.2025 05:49 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

Thank you folks for your feedback on our survey about Hash functions in genomic sequence analysis. We've updated the paper and you can see the new version here: tinyurl.com/4kk9ccmt.

25.09.2025 13:21 β€” πŸ‘ 11    πŸ” 6    πŸ’¬ 0    πŸ“Œ 1

Preprint out for myloasm, our new nanopore / HiFi metagenome assembler!

Nanopore's getting accurate, but

1. Can this lead to better metagenome assemblies?
2. How, algorithmically, to leverage them?

with co-author Max Marin @mgmarin.bsky.social, supervised by Heng Li @lh3lh3.bsky.social

1 / N

07.09.2025 23:34 β€” πŸ‘ 114    πŸ” 80    πŸ’¬ 5    πŸ“Œ 5
Post image Post image

Congratulations to Rayan Chiki, (Institut Pasteur) head of the β€œSequence Bioinformatics” unit, for securing the ERC Proof of Concept 2025 for his project ENZYMINER! πŸ‘

β€ͺ@rayan.chiki.bsky.social

#Bioinformatics

24.07.2025 15:10 β€” πŸ‘ 60    πŸ” 13    πŸ’¬ 4    πŸ“Œ 2

Incredible! πŸ‘

25.07.2025 22:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

We have officially started #HitSeq track @hitseq.bsky.social at #ISMBECCB2025. Francisco de la Vega, introduces our first #keynote speaker Valentina Boeva @valboeva.bsky.social with her talk: "Learning variant effects on chromatin accessibility and 3D structure without matched Hi-C data"

23.07.2025 10:53 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Meet our amazing sponsor PacBio @pacbio.bsky.social for @hitseq.bsky.social track at #ISMBECCB2025 represented by Elizabeth Tseng with her talk "Bioinformatics analysis for long-read RNA sequencing: challenges and promises" #hitseq #iscb #sequencing #application #iverpool #uk

23.07.2025 16:00 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Dont miss any of our #LongTREC communications at #ISMBECCB2025. Download this flyer to make catching all the latest & hottest long-read transcriptomics research simple.

@anaconesa.bsky.social

21.07.2025 12:30 β€” πŸ‘ 5    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

@hitseq.bsky.social is kicking off with our first keynote @valboeva.bsky.social talking about "Learning variant effects on chromatin accessibility and 3D structure without matched Hi-C data". #ISMBECCB2025

23.07.2025 10:38 β€” πŸ‘ 8    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

πŸ“½οΈ Next in the LongTREC Series: Mahmud Sami Aydin!
Sami is a Doctoral Candidate at @stockholm-uni.bsky.social , working under the supervision of @ksahlin.bsky.social .In this video, Sami shares his research and his role in the broader LongTREC collaboration across Europe.
#AlgorithmDevelopment

13.07.2025 08:28 β€” πŸ‘ 6    πŸ” 4    πŸ’¬ 0    πŸ“Œ 1
Preview
OReO: optimizing read order for practical compression AbstractMotivation. Recent advances in high-throughput and third-generation sequencing technologies have created significant challenges in storing and mana

Paper alert!
We present Oreo a tools that reorder long reads datasets in a way to compress them efficiently with ANY universal compressor like gz, zstd, xz ...
TLDR: You can get state of the art compression WITHOUT a dedicated compressor/decompressor!
academic.oup.com/bioinformati...
A thread!

03.07.2025 10:52 β€” πŸ‘ 23    πŸ” 18    πŸ’¬ 1    πŸ“Œ 1

I worked with Thomas during a three months research visit during his PhD, and it resulted in a paper in NAR. I highly recommend him. doi.org/10.1093/nar/...

02.07.2025 11:48 β€” πŸ‘ 9    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0
Post image

Thomas Baudeau defended his thesis on Studying the properties of viral long reads mapping methods - congrats docteur Baudeau you'll be deeply missed in the team. I'm very glad I got the chance to work with you. Thomas is also on the lookout for a postdoc πŸ‘€

30.06.2025 16:35 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 2

🧡1/n
Estimating mutation rates using k-mers is fastβ€”but what happens when repeats dominate the genome?

In a new preprint, Haonan Wu, Antonio Blanca, and myself propose a *repeat-aware* estimator that's accurate even in centromeres.

25.06.2025 13:19 β€” πŸ‘ 30    πŸ” 14    πŸ’¬ 1    πŸ“Œ 0
Post image

Hey yeast lovers. Do you like pangenomes?
O'Donnel et al. 2023 produced T2T assemblies of different strains, including phased haplotypes for yeast.

Here I selected 10 phased haplotypes and the S288C reference,
and looked for the MST28 / YAR033W gene reported to contain SVs such as indels.

πŸ‘‡πŸ»πŸ‘‡πŸ»

11.06.2025 14:46 β€” πŸ‘ 9    πŸ” 7    πŸ’¬ 2    πŸ“Œ 0

Congrats πŸ‘πŸ‘

09.05.2025 22:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

IMO it matters a lot as a 'first impression'

08.05.2025 13:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I did only very minor impl. contributions, but from my (non-expert) view, I like that (1) it installs easily (also on a MacBook) and (2) no header files. Felt much easier to get started with than, e.g., C++. I never truly learned good .h/.cpp practices, and I could never get OpenMP/g++ working well

08.05.2025 13:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
GitHub - aljpetri/isONclust3: De novo clustering of long transcript reads into genes De novo clustering of long transcript reads into genes - GitHub - aljpetri/isONclust3: De novo clustering of long transcript reads into genes

Also, it's in Rust! Tool available at github.com/aljpetri/isO...

08.05.2025 13:04 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

As for results, isONclust3 handles a 37M reads PacBio dataset from a revio machine in under 10h while other algorithms fail (>256Gb mem or >120h runtime). On the other datasets, isONclust3 has comparable or better accuracy than the other benchmarked tools.

08.05.2025 13:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The algorithm follows isONclust's algorithm in the general structure (greedy minimizer matching) but adds three key concepts: high confidence minimizers, on-the-fly cluster information update, and iterative (post-)cluster merging.

08.05.2025 13:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The motivation to develop this algorithm came from the inability of other algorithms to process recent large datasets (10-100M reads) from Revio or PromethION machines.

08.05.2025 13:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
De novo clustering of large long-read transcriptome datasets with isONclust3 AbstractMotivation. Long-read sequencing techniques can sequence transcripts from end to end, greatly improving our ability to study the transcription proc

@alexanderjpetri.bsky.social's isONclust3 algorithm is now published doi.org/10.1093/bioi.... isONclust3 performs de novo clustering of long-read cDNA sequencing data. A key step in reference-free transcriptome analysis.

08.05.2025 13:04 β€” πŸ‘ 11    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0
Post image

@tolyan.bsky.social is our very last speaker, on randstrobes ( high sensitivity seeds ) and their evolution the multi context seeds

25.04.2025 07:39 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0