"OpenZL is our answer to the tension between the performance of format-specific compressors and the maintenance simplicity of a single executable binary."
engineering.fb.com/2025/10/06/d...
@jbonfield.bsky.social
Walker, archer, and volunteer woodland warden by weekend, and bioinformatics software engineer and general geek by weekday. My favourite prime is 15551, my favourite colour is, obviously, octarine, and I love nothing more than being immersed in nature.
"OpenZL is our answer to the tension between the performance of format-specific compressors and the maintenance simplicity of a single executable binary."
engineering.fb.com/2025/10/06/d...
Delighted to finally announce a preprint describing the Q100 project! βA complete diploid human genome benchmark for personalized genomicsβ For which we finished HG002 to near-perfect accuracy: www.biorxiv.org/content/10.1... π§΅[1/14]
22.09.2025 17:01 β π 96 π 57 π¬ 4 π 4Note: OLD POST! (2023), but I just noticed it.
While it's nice to see comparisons, why compare an (at the time) 2 year old GATK against a 5 year old bcftools?
Since then both have come on a lot. It'd be interesting to see new independent comparisons. (Neither can hold up to deepvariant now.)
I noted in their presentation they said that samtools mpileup didn't work. I think they're a bit out of date. Bcftools mpileup --poly-mqual can handle the qualities in homopolymers, plus other newer -X profiles.
I haven't tuned it yet though for SBX, but think it'll be OK in general. (To try!)
By creating an Account with Academia.edu, you grant us a worldwide, irrevocable, non-exclusive, transferable license, permission, and consent for Academia.edu to use your Member Content and your personal information (including, but not limited to, your name, voice, signature, photograph, likeness, city, institutional affiliations, citations, mentions, publications, and areas of interest) in any manner, including for the purpose of advertising, selling, or soliciting the use or purchase of Academia.edu's Services.
I'm sorry, worldwide, irrevocable, non-exclusive, transferable permission to my voice and likeness? For what now? In any manner for any purpose???
This is in academia/.edu's new ToS, which you're prompted to agree to on login. Anyway I'll be jumping ship. You can find my stuff at hcommons.org.
Instagram is like facebork but even more annoying. I looked and I can't even find the equivalent post for you over there. It's just a hateful platform. Probably OK for doom scrolling on a phone, but that's about it.
16.09.2025 08:01 β π 2 π 0 π¬ 1 π 0We ought to update htslib.org with more precise recipes, especially for things like conda where we know A) people make mistakes, often and B) it's used A LOT. We may be able to point to something like biocontainers too (or roll our own, but I'd rather not).
It's rarely built from source it seems.
Even more prolific is looking at their WhatsApp number from the minimap2 fake site, and associated email. So so many fake sites. Scary
(See 447950904740 phone number, and emmawatsofficial54 partial email search results).
A google for the support phone number shows how many other phishing sites they have.
www.google.com/search?clien...
Most likely their "support" offering involves getting you to install some trojan.
minimap2.com is potentially a phishing site. Please don't use anything from that website.
github.com/lh3/minimap2...
Heads up: ignore samtools dot org, similarly minimap2 dot com and likely others. It's owned by a known phishing site and while the binaries they offer look valid currently (but note they may be serving us different binaries to others), that could change.
Ie: it's not us (Samtools team)! Be warned
Nothing like cold hard data. It's almost as if Brexit was a pack of lies? Who'd have believed it. ;-)
Of course the people that need to see this obviously won't as it'll be deemed "fake news". I really don't know how to fix that one.
The sun sets behind some trees over a meadow.
What's one way you can reconnect with nature this month?
Fresh out of ideas?
- Take a walk without your phone: notice 5 things around you
- Go on a picnic in a public park
- Learn the name of one local bird and see how often you can spot it
- Pick up some litter
#nature #rewild2gether #share
Nigel Farage looks uncomfortable as Jamie Raskin uses his opening statement to absolutely demolish him
03.09.2025 16:39 β π 21299 π 6880 π¬ 1331 π 1421The binary version changes are probably the biggest issue, with (IIRC) BCF 4.2 not being readable by bcftools and BCF 4.3 not being readable by GATK, as the minor version bump was a breaking change that made them incompatible.
I think it was necessary as some data was broken, but :-( :-( :(
FWIW if I ever get time to finish my bgzf2 (zstd) branch (github.com/jkbonfield/h...) it really shines with multi-sample VCF.
The line lengths are just too big for bgzf to do remotely well due to the 32Kb deflate window size.
BCF gets some things right, but it made many of the same mistakes that BAM did (being of the same era). It's too serial rather than block based, harming any sort of efficient processing and compression. In short, it's the binarisation of the text format that makes it poor.
08.08.2025 20:28 β π 1 π 0 π¬ 1 π 0A text format we can hack and play with allows for fast experimentation, but it shouldn't be the primary format. Not should we have binary guys which are essentially memory dumps from parsing the text. That's partially what killed BCF from adoption. All the pain with minimal gain!
07.08.2025 06:29 β π 3 π 0 π¬ 2 π 0I've not tried this before. Thanks
π§© Puzzle #735
π€ 22 guesses
β±οΈ 6m 43s
π alphaguess.com
Although he does also suitably demonstrate the total lack of PPE needed when scything. That makes it *so* much nicer (along with the noise) than strimmers.
Well, provided you're not scything nettles or brambles as then shoes / shin covering is handy!
Lol, I'm hiding my marriage from myself too apparently!
(Sorry dear. I didn't mean to be ghosting you these last few decades.)
Answering my own question following a lunchtime walk with Iain from @wildlifebcn.org , yes... They start off black.
Learn something new every day π
Preprint alert!
We present K2Rmini, an ultra-fast, grep-like tool that extracts sequences of interest from FASTA/FASTQ files based on their k-mer content.
www.biorxiv.org/content/10.1...
A thread
A black common lizard
Are juvenile common lizards normally black, or is this a melanistic one? It was tiny. Seen at #RSPB #Fowlmere.
27.06.2025 18:10 β π 1 π 0 π¬ 1 π 0There's also Kent - a Swedish group with a distinctly more pop feel. I bought both Agricantus and Kent albums while on work trips away. I liked to pick up random albums local(ish) to the area I'm in. About time I played them again (car has a CD player still).
www.youtube.com/watch?v=1A10...
Not Faroese, but that reminds me of some of Agricantus's music. I'm not sure they go in for "bangers", but perhaps www.youtube.com/watch?v=xbLx... is the closest I could find. I think it has a mix of African and south European languages.
02.06.2025 22:12 β π 2 π 0 π¬ 1 π 0Release 1.22 of HTSlib, SAMtools, and BCFtools is now available from GitHub. See htslib.org/download/ for links to tarballs and release notes. π§ͺ
#samtools #bcftools #htslib #bioinformatics
π’ HPRC Release 2 is here!
Now with phased genomes from 200+ individuals, a 5x increase from Release 1.
Explore sequencing data, assemblies, annotations & alignments in our interactive data explorer β¬οΈ:
humanpangenome.org/hprc-data-re...
Good news. I keep pushing for another htslib release still...
12.05.2025 05:32 β π 0 π 0 π¬ 0 π 0Could definitely start with lynx. Muntjac are a bigger problem to control as they don't herd and are therefore harder to track. They breed all year too I think. (Also not native)
05.05.2025 10:05 β π 0 π 0 π¬ 1 π 0