Our preprint on our new metagenomic HiFi assembler Alice is out ๐ฅณ Based on a *new sketching method* (๐งต1/6)
๐ Preprint www.biorxiv.org/content/10.1...
๐ Github github.com/rolandfaure/...
@rayanchikhi.bsky.social
Our preprint on our new metagenomic HiFi assembler Alice is out ๐ฅณ Based on a *new sketching method* (๐งต1/6)
๐ Preprint www.biorxiv.org/content/10.1...
๐ Github github.com/rolandfaure/...
There's a typo on line 225, it is actually 50% identity, not 90%. But yeah, SRA is highly redundant :)
03.09.2025 09:31 โ ๐ 3 ๐ 0 ๐ฌ 2 ๐ 0@martinsteinegger.bsky.socialโฌ, @caleblareau.bsky.social, @pierrepeterlongo.bsky.social, @rnalab.bsky.social
03.09.2025 08:39 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0@tlemane.bsky.socialโฌ, @mmontonerin.bsky.social, @apcamargo.bsky.social, @mattlabguy.bsky.social, @sinamajidian.bsky.socialโฌ, @rfaure.bsky.socialโฌ, @jmouradesousa.bsky.socialโฌ, @epcrocha.bsky.socialโฌ, @david-koslicki.bsky.social, @pashadag.bsky.socialโฌ,
03.09.2025 08:39 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0Earthโs genetic diversity is a heritage of humanity. It has been an honour to explore this data with a team of dedicated scientists who shared our vision of making this data free and accessible to all ๐๐งฌโค๏ธ Thank you!
Updated preprint: doi.org/10.1101/2024...
This is a new frontier for biological discovery and AI training data. Logan expands the universe of known proteins, plasmids, AMR, P4 satellites, and the newly discovered Obelisk RNA elements.
03.09.2025 08:39 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0All Logan data is freely-available (cc0) right now. We show how Logan-Search (www.logan-search.org) can be used to uncover viral reactivation (HHV-6) in cell therapy products (TIL and CAR-T).
03.09.2025 08:39 โ ๐ 7 ๐ 1 ๐ฌ 2 ๐ 0Logan rapidly accesses the tapestry of Lifeโs genetic diversity and can help solve global issues.
To tackle the microplastic crisis, we searched Logan for new versions of the 213 known plastic-degrading enzymes. We identified 200+ million homologs ๐คฏ, including new high-efficiency enzymes ๐ฅค๐ฅ
Logan enables minute-scale k-mer search, and hour-scale deep homology protein alignment search, across 100+ Billion proteins.
www.logan-search.org
One year after our initial preprint, we're excited to post a major update to Logan.
At its heart, Logan is the assembly of 27 million samples (50 Pbp) using a 6-day cloud-compute peaking at 2.2M vCPUs. This compresses the SRA 140x compared to raw FASTQs.
github.com/IndexThePlan...
๐๐ฉโ๐ฌ For 15+ years biology has accumulated petabytes (million gigabytes) of๐งฌDNA sequencing data๐งฌ from the far reaches of our planet.๐ฆ ๐๐ต
Logan now democratizes efficient access to the worldโs most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
Congratulations to Rayan Chiki, (Institut Pasteur) head of the โSequence Bioinformaticsโ unit, for securing the ERC Proof of Concept 2025 for his project ENZYMINER! ๐
โช@rayan.chiki.bsky.social
#Bioinformatics
thanks Niema!
25.07.2025 21:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0thanks Ben!!
25.07.2025 21:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0merci Sophie!
25.07.2025 21:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0yes๐
03.06.2025 14:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Slides from my talk (with @kamilsjaron.bsky.social) on an history of k-mers in bioinformatics: rayan.chikhi.name/pdf/2025-kme...
03.06.2025 09:25 โ ๐ 44 ๐ 24 ๐ฌ 1 ๐ 2๐งฌ Excited to share our latest work, MUSET ๐ญ, a new tool for creating abundance unitig matrices from sequencing data. It was published yesterday in Oxford Bioinformatics if you want to have a look๐ :
academic.oup.com/bioinformati...
Let's break it down:
For more context: Logan is a collection of all public sequencing data (until end of 2023) assembled into contigs. It is freely hosted on the cloud, and contains hundreds of terabytes of valuable genomic data: github.com/IndexThePlan...
03.02.2025 17:17 โ ๐ 5 ๐ 1 ๐ฌ 0 ๐ 0We have updated all Logan contigs (now at version 1.1)! Contiguity has been much improved (2x) and a duplicated k-mers bug has been fixed. More information and changelog here: github.com/IndexThePlan...
03.02.2025 17:17 โ ๐ 24 ๐ 10 ๐ฌ 1 ๐ 0๐จ Keynotes at RECOMB-seq 2025! ๐จ
๐ Alicia Oshlack โ computational transcriptomics
@aliciao.bsky.social
๐ Rayan Chikhi โ sequencing data structures
@rayanchikhi.bsky.social
๐๏ธ Dates: April 24โ25, 2025
๐ Seoul, South Korea
recomb-seq.github.io/speakers/
Do you want to learn systematic ways in which you can revise your research papers? I've posted a short collection of 4 lectures youtube.com/playlist?lis... 1/n
20.12.2024 13:04 โ ๐ 23 ๐ 9 ๐ฌ 1 ๐ 0Ty Rob!
24.11.2024 09:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Our Big Fantastic Virus Database (BFVD) is now published NAR! It contains protein structure predictions of major viral clades, enhanced by petabase-scale homology search and it's explorable on the web.
๐ bfvd.foldseek.com
๐พ bfvd.steineggerlab.workers.dev
๐ academic.oup.com/nar/advance-...