Wen-Wei Liao's Avatar

Wen-Wei Liao

@wwliao.bsky.social

Postdoctoral Associate at Yale University https://medicine.yale.edu/profile/wen-wei-liao

57 Followers  |  110 Following  |  1 Posts  |  Joined: 10.02.2024  |  2.2814

Latest posts by wwliao.bsky.social on Bluesky

I'm excited to share our pre-print about a new variant benchmarking tool we've been working on for the past few months!

Aardvark: Sifting through differences in a mound of variants
GitHub: github.com/PacificBiosc...

Some highlights in this thread:
1/N

06.10.2025 20:07 β€” πŸ‘ 29    πŸ” 15    πŸ’¬ 1    πŸ“Œ 0

πŸ¦’Long read giraffe is out!πŸ¦’
Mapping long reads to pangenome graphs is ~10x faster than with GraphAligner, with veeery slightly better mapping accuracy, short variant calling, and SV genotyping than GraphAligner or Minimap2

02.10.2025 06:28 β€” πŸ‘ 41    πŸ” 22    πŸ’¬ 1    πŸ“Œ 0
Post image

Do you know ~60% of human SVs fall in ~1% of GRCh38? See our new preprint: arxiv.org/abs/2509.23057 and the companion blog post on how we started this project and longdust: lh3.github.io/2025/09/29/o.... Work with Alvin Qin

30.09.2025 02:19 β€” πŸ‘ 75    πŸ” 27    πŸ’¬ 0    πŸ“Œ 0
Preview
A complete diploid human genome benchmark for personalized genomics Human genome resequencing typically involves mapping reads to a reference genome to call variants; however, this approach suffers from both technical and reference biases, leaving many duplicated and ...

Delighted to finally announce a preprint describing the Q100 project! β€œA complete diploid human genome benchmark for personalized genomics” For which we finished HG002 to near-perfect accuracy: www.biorxiv.org/content/10.1... 🧡[1/14]

22.09.2025 17:01 β€” πŸ‘ 96    πŸ” 57    πŸ’¬ 4    πŸ“Œ 4
Post image

I'm hiring a computational biologist interested in complex trait genetics using deep learning approaches. Reach out to me, if interested.

12.09.2025 19:00 β€” πŸ‘ 36    πŸ” 46    πŸ’¬ 1    πŸ“Œ 0
Preview
Phishing site : minimap2.com Β· Issue #1316 Β· lh3/minimap2 Not sure how to label this one, but I have come across a website minimap2.com which appears to be AI generated but is serving it's own copy of the Github repository. If you search the address or em...

minimap2.com is potentially a phishing site. Please don't use anything from that website.
github.com/lh3/minimap2...

09.09.2025 15:39 β€” πŸ‘ 26    πŸ” 27    πŸ’¬ 1    πŸ“Œ 2

Now published in GigaScience with minor improvements: academic.oup.com/gigascience/...

* Download: zenodo.org/records/1490...
* More info: github.com/lh3/panmask

04.09.2025 16:44 β€” πŸ‘ 30    πŸ” 10    πŸ’¬ 1    πŸ“Œ 1
Preview
GitHub - jltsiren/gbz-base: Prototype for an immutable pangenome graph in SQLite Prototype for an immutable pangenome graph in SQLite - jltsiren/gbz-base

GBZ-base has been a side project for me for a couple of years. It's basically a GBZ graph stored in SQLite instead of a custom file format. You can convert a GBZ graph to GBZ-base quickly and then extract subgraphs around nodes / reference positions on a laptop. 1/n

28.08.2025 00:49 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0
Post image

Interesting job listings πŸ‘€ astromech.com

www.sec.gov/Archives/edg...

15.08.2025 17:19 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Postdoctoral Scholar – Genomics, Aging, Somatic Mutation, Structural Variation, Evolution , Cancer – Integrative Biology University of California, Berkeley is hiring. Apply now!

The Sudmant lab at UC Berkeley is seeking a postdoc to work on a fully funded NIH project to understand differences in DNA repair and somatic mutation across the primate tree of life. Please spread widely to those who may be interested aprecruit.berkeley.edu/JPF05052

13.08.2025 04:22 β€” πŸ‘ 54    πŸ” 55    πŸ’¬ 0    πŸ“Œ 1
Preview
Higher eQTL power reveals signals that boost GWAS colocalization Expression quantitative trait locus (eQTL) studies in human cohorts typically detect at least one regulatory signal per gene, and have been proposed as a way to explain mechanisms of genetic liability...

Good discussion & study-design recommendations in this new preprint by @mikelove.bsky.social @klmohlke.bsky.social & colleagues:
Higher eQTL power reveals signals that boost GWAS colocalization πŸ§ͺ🧬

www.biorxiv.org/content/10.1...

06.08.2025 12:45 β€” πŸ‘ 23    πŸ” 9    πŸ’¬ 0    πŸ“Œ 0

This sounds true to me

05.08.2025 10:34 β€” πŸ‘ 33    πŸ” 7    πŸ’¬ 2    πŸ“Œ 0
Beeswarm plot of the prediction error across different methods of double perturbations showing that all methods (scGPT, scFoundation, UCE, scBERT, Geneformer, GEARS, and CPA) perform worse than the additive baseline.

Beeswarm plot of the prediction error across different methods of double perturbations showing that all methods (scGPT, scFoundation, UCE, scBERT, Geneformer, GEARS, and CPA) perform worse than the additive baseline.

Line plot of the true positive rate against the false discovery proportion showing that none of the methods is better at finding non additive interactions than simply predicting no change.

Line plot of the true positive rate against the false discovery proportion showing that none of the methods is better at finding non additive interactions than simply predicting no change.

Our paper benchmarking foundation models for perturbation effect prediction is finally published πŸŽ‰πŸ₯³πŸŽ‰

www.nature.com/articles/s41...

We show that none of the available* models outperform simple linear baselines. Since the original preprint, we added more methods, metrics, and prettier figures!

🧡

04.08.2025 13:52 β€” πŸ‘ 128    πŸ” 57    πŸ’¬ 2    πŸ“Œ 6
Preview
Technical blogging for growth and learning Writing a technical blog improves your writing, forces you to learn new things, helps others and yourself, and helps your career. This post: why blog, what to blog about, how to start, and using AI.

Writing a technical blog improves your writing, forces you to learn new things, helps others and yourself, and helps your career. In this essay: (1) why blog, (2) what to blog about, (3) how to get started, and (4) using AI. https://doi.org/10.59350/bqnfd-7p249 πŸ§ͺ

25.01.2025 16:00 β€” πŸ‘ 54    πŸ” 11    πŸ’¬ 2    πŸ“Œ 2
Preview
65 Genomes Expand Our Picture Of Human Genetics Researchers closely examined the genomes of 65 individuals to paint a more complex, and more complete, picture of human genetic diversity.

Complete genomes alert! @glennislogsdon.bsky.social, @christinebeck.bsky.social, and I were on @scifri.bsky.social today talking about "Complex genetic variation in nearly complete human genomes"
πŸ“„ www.nature.com/articles/s41...
πŸ“» www.sciencefriday.com/segments/65-...

01.08.2025 20:49 β€” πŸ‘ 50    πŸ” 16    πŸ’¬ 1    πŸ“Œ 2
Preview
GitHub - lh3/longdust: Identify long STRs, VNTRs, satellite DNA and other low-complexity regions in a genome Identify long STRs, VNTRs, satellite DNA and other low-complexity regions in a genome - lh3/longdust

Longdust, a new tool to identify highly repetitive STRs, VNTRs, satellite DNA and other low-complexity regions (LCRs). Similar to SDUST but for long regions.
github.com/lh3/longdust

31.07.2025 19:59 β€” πŸ‘ 75    πŸ” 28    πŸ’¬ 0    πŸ“Œ 1
Preview
Complex genetic variation in nearly complete human genomes - Nature Using sequencing and haplotype-resolved assembly of 65 diverse human genomes, complex regions including the major histocompatibility complex and centromeres are analysed.

Two papers in today's issue of @nature.com ‬: 1) we assemble 65 genomes to near completion, including centromeres and the MHC. tinyurl.com/3huhax6w. 2) we sequence 1,019 genomes from the 1kGP with long reads, revealing SVs down to low allele frequencies tinyurl.com/wbx3we9x.

23.07.2025 15:12 β€” πŸ‘ 55    πŸ” 24    πŸ’¬ 1    πŸ“Œ 2
Post image

Preprint on "Finding easy regions for short-read variant calling from pangenome data": arxiv.org/abs/2507.03718

08.07.2025 02:14 β€” πŸ‘ 31    πŸ” 13    πŸ’¬ 0    πŸ“Œ 1

This a really exciting leap forward for genomic sequence to activity gene regulation models. It is a genuine improvement over pretty much all SOTA models spanning a wide range of regulatory, transcriptional and post-transcriptional processes. 1/

25.06.2025 16:18 β€” πŸ‘ 72    πŸ” 20    πŸ’¬ 2    πŸ“Œ 2

Excited to launch our AlphaGenome API goo.gle/3ZPUeFX along with the preprint goo.gle/45AkUyc describing and evaluating our latest DNA sequence model powering the API. Looking forward to seeing how scientists use it! @googledeepmind

25.06.2025 14:29 β€” πŸ‘ 219    πŸ” 82    πŸ’¬ 5    πŸ“Œ 10
Post image

An assessment of DNA language models concludes:
◼️ They do not offer compelling gains over baseline models

Their performance is inconsistent and requires much more compute.

arxiv.org/abs/2412.05430

23.06.2025 20:21 β€” πŸ‘ 52    πŸ” 19    πŸ’¬ 1    πŸ“Œ 3
Preview
Reply to: Reply to: False positives in the study of memory-related gene expression In the Nature paper β€œSpatial transcriptomics reveal neuron–astrocyte synergy in long-term memory” published on March 14th, 2024, authors Sun et al. claimed to identify cell-type specifi…

I wrote a review of a recent paper on false discovery and multiple testing correction. liorpachter.wordpress.com/2025/06/16/r...

16.06.2025 21:31 β€” πŸ‘ 145    πŸ” 40    πŸ’¬ 7    πŸ“Œ 7
Post image

Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment.
arxiv.org/abs/2506.12986

17.06.2025 01:48 β€” πŸ‘ 109    πŸ” 54    πŸ’¬ 0    πŸ“Œ 1
A blueprint for the batmobile.

A blueprint for the batmobile.

Your genome is not a blueprint. A thread about misleading metaphors in science communication. 🧬πŸ§ͺ 1/n

14.06.2025 15:49 β€” πŸ‘ 199    πŸ” 66    πŸ’¬ 9    πŸ“Œ 9

People always stop me in the street to ask: "Yoav, where are the disease-associated eQLTs? We found a lot in GTEx but we can't find anymore. Do you know where they are?"

(For the record, no one has ever asked me this, but it is a really good question!)

I think we know where they are.

10.06.2025 14:20 β€” πŸ‘ 58    πŸ” 14    πŸ’¬ 3    πŸ“Œ 2
Standard methods are equivalent to a flashlight, looking at each gene independently. We combine signals from multiple genes, turning a floodlight onto the genome.

Standard methods are equivalent to a flashlight, looking at each gene independently. We combine signals from multiple genes, turning a floodlight onto the genome.

Excited to share my first PhD paper in the @sbmontgom.bsky.social lab with @tamigj.bsky.social (www.biorxiv.org/content/10.1...)! Standard QTL methods treat each gene independently. But what if a single variant regulates multiple nearby genes at once - what we call β€œallelic proxitropy”? 🧡 ⬇️

08.06.2025 17:38 β€” πŸ‘ 91    πŸ” 33    πŸ’¬ 6    πŸ“Œ 4

Very excited to share I’ll be starting my own group at University of Utah in the Department of Human Genetics in the new year! Reach out if you are interested! vollgerlab.com

05.06.2025 14:15 β€” πŸ‘ 65    πŸ” 12    πŸ’¬ 9    πŸ“Œ 4

Neng Huang developed longcallR for joint SNP calling and phasing from long RNA-seq reads, AND for identifying allele-specific splicing/junctions (ASJ). Although ASJs of statistical significance are rare, a large fraction involve unannotated junctions. In Rust!

30.05.2025 14:54 β€” πŸ‘ 16    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

The human pangenome continues to grow and improve! Release 2 is here! Click through for the details, but this is a pretty amazing dataset including not just the phased assemblies, but PacBio HiFi, ONT Ultralong, Dovetail/Illumina Hi-C, PacBio Kinnex, and Illumina WGS for all samples

12.05.2025 13:52 β€” πŸ‘ 82    πŸ” 36    πŸ’¬ 0    πŸ“Œ 0
Four images to illustrate some prominent single-gene myths. Top left shows a photograph of a person deftly rolling their tongue into a U-shape. Top right shows a photograph of a person’s ear, highlighting the shape and features of the earlobe and cartilage. Bottom left shows a close-up photograph of a person’s eye, with a vivid blue colouration. Bottom right shows a photograph of a person poised to write with their left hand on the blank white page of a spiral-bound notebook.

Four images to illustrate some prominent single-gene myths. Top left shows a photograph of a person deftly rolling their tongue into a U-shape. Top right shows a photograph of a person’s ear, highlighting the shape and features of the earlobe and cartilage. Bottom left shows a close-up photograph of a person’s eye, with a vivid blue colouration. Bottom right shows a photograph of a person poised to write with their left hand on the blank white page of a spiral-bound notebook.

Remember when you first learned about genetics at school? All those fascinating examples of human traits that are each apparently determined by just a single gene? Time to check in on some of your favourites to see how they’re doing. 🧬🧡πŸ§ͺ 1/n

02.05.2025 14:50 β€” πŸ‘ 1265    πŸ” 595    πŸ’¬ 51    πŸ“Œ 83

@wwliao is following 20 prominent accounts