πPaper alert! Extremely excited to share a preprint from our lab! Spearheaded by @axel-schmidt.bsky.social, a super talented medical & computational geneticist, we studied latent Epstein-Barr virus (EBV) infection at population-scale.
Interested in how this works & what we found? Read along! π
22.07.2025 16:10 β π 20 π 12 π¬ 1 π 2
Super excited to see this out. What started as some math in a grant in 2020, to a student deciding to take this on in 2022, to published in 2025.
These things can take time and patience is key!
21.07.2025 18:54 β π 57 π 17 π¬ 3 π 2
Thanks for those kind words Davis! I caught the eQTL bug in your lab and its great to finally contribute to the field
23.07.2025 08:39 β π 1 π 0 π¬ 0 π 0
Unfortunately not yet! This version of quasar does not support cell-level data nor interaction testing, but those are the two biggest features I want to add. The next part of my PhD will likely focus on finer resolution single-cell eQTLs, so watch this space :)
22.07.2025 21:19 β π 2 π 0 π¬ 1 π 0
Finally a big thanks to @chr1sw.bsky.social for her support throughout this project and we welcome any and all feedback on the software and paper!
22.07.2025 10:15 β π 0 π 0 π¬ 0 π 0
In addition, we provide mathematical intuition for why negative binomial mixed models give very similar results to Poisson mixed models and study the interaction between methods for computing gene-level p-values and FDR methods.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
Statistical power of negative binomial and linear model methods across the
OneK1K dataset a) Number of eQTLs detected by the quasar linear model and negative binomial GLM with adjusted profile likelihood dispersion estimation methods across all cell types
in the OneK1K dataset. b) Number of eGenes detected by the quasar linear model and negative
binomial GLM with adjusted profile likelihood dispersion estimation methods across all cell types
in the OneK1K dataset.
When comparing methods we found that mixed model methods did not have better performance, but that, as previously reported, count distribution methods increased power. Overall we recommend the negative binomial GLM model, using the APL, as the method with the best overall performance.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
Histograms of Pearson correlation of β log10 transformed variant-level p-values for each gene, correlating the output of output
of quasar against that uses the same statistical model (LM: tensorQTL, NB-GLM : jaxQTL, LMM:
apex. All results are computed for the B IN cluster. b) Speed of methods across the three representative cell types. All methods were run on CPUs. Methods are labelled by the options used
to run them: for tensorQTL and jaxQTL βcisβ computes significance at the level of genes while βcis
nominalβ computes significance at the level of variants.
When run on CPUs quasar is quite a bit faster (up to ~40x) than exisiting methods, while producing concordant output when the statistical model aligns.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
We compared quasar to three existing eQTL mapping methods (tensorQTL, jaxQTL and apex) in a pesudobulk analysis of the OneK1K dataset and used the flexibility of quasar to compare different models without confounding by implementation.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
Bar charts of number of discoveries across different tools and thresholds in a paper about eQTL mapping
2. We also show that negative binomial models can fail to appropriately control the Type 1 error, which we fix in quasar by implementing the Cox-Reid adjusted profile likelihood (APL), a core part of edgeR and DESeq2.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
1. In mixed models a recurring challenge has been how to approximate the (very slow!) calculation of the score test variance. We introduce and implement a trace-based approx, which can be computed in O(n) time in LMMs. Our derivation also clarifies the effectiveness of the approx used in regenie.
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
Compared to other eQTL mapping methods, quasar implements a much wider variety of statistical models: the linear model, Poisson and negative binomial GLMs, the linear mixed model and Poisson and negative binomial GLMMs. Beyond this versatility, quasar has two pieces of novel methodology:
22.07.2025 10:15 β π 0 π 0 π¬ 1 π 0
π¨New preprint just dropped π¨
medrxiv.org/content/10.1101/2025.06.24.25330216
The main output from my PhD is finally public and weβre SUPER excited about the findings! If youβre interested in what we learnt about IBD with a massive 700+ sample sc-eQTL dataset of the gut, read on!
08.07.2025 08:51 β π 37 π 14 π¬ 1 π 2
I'd be interested in why that is, especially as I've spent the last ~6 months implementing a different part of the edgeR machinery in a mixed model context
20.06.2025 15:32 β π 1 π 0 π¬ 0 π 0
That's true! I think the approach still has a lot of value though to uncover the right weights/variables for each cluster, it would just be interesting if you can recover the pc-eQTLs as sum/difference QTLs.
11.06.2025 06:53 β π 0 π 0 π¬ 0 π 0
Variant-specific priors clarify colocalisation analysis
Author summary Evaluating whether two traits, such as disease risk and gene expression, are affected by the same genetic variants is crucial for understanding the molecular mechanisms through which ge...
Very happy that my first PhD paper is now out in PLOS Genetics! journals.plos.org/plosgenetics.... We describe our implementation of variant-specific priors in coloc. We show that using distance to the TSS as information about which variants are causal can improve colocalisation performance, 1/n
09.06.2025 10:45 β π 23 π 8 π¬ 1 π 1
I agree completely and you're right that what I said only holds exactly when the variables are positively correlated. I guess I'm really wondering whether the 2 cluster cases can also be interpreted as sum-expression or difference-expression QTLs in many cases.
10.06.2025 10:35 β π 1 π 0 π¬ 1 π 0
PC2 QTLs would also be found as QTLs for the difference of the gene expression levels. Then they could be simply interpreted as tuning the difference in gene expression levels. Thanks!
09.06.2025 20:59 β π 0 π 0 π¬ 1 π 0
Such a cool paper! In the case of a cluster of two genes, simulations suggest (shorturl.at/Pbh8Z) that the two PCs are going to be highly correlated with the sum and difference of the expression levels. I wonder in your second two gene-example (NLRC3, CLUAP1) whether the 1/n
09.06.2025 20:59 β π 2 π 0 π¬ 1 π 0
Thanks for those examples, super interesting!
09.06.2025 20:24 β π 1 π 0 π¬ 1 π 0
Interesting! Do you have a strong prior about whether the 'causal path' normally goes through one of the regulated genes, or through all/most of them?
09.06.2025 19:36 β π 0 π 0 π¬ 3 π 0
Can you use variant level information in colocalisation? Yes! Will it improve accuracy on average? Yes! Will it make a substantial difference? Not using any information we could think of.
Very nice work by @jeffreypullin.bsky.social to adapt coloc to enable these questions to be addressed.
09.06.2025 15:58 β π 9 π 5 π¬ 1 π 0
Standard methods are equivalent to a flashlight, looking at each gene independently. We combine signals from multiple genes, turning a floodlight onto the genome.
Excited to share my first PhD paper in the @sbmontgom.bsky.social lab with @tamigj.bsky.social (www.biorxiv.org/content/10.1...)! Standard QTL methods treat each gene independently. But what if a single variant regulates multiple nearby genes at once - what we call βallelic proxitropyβ? π§΅ β¬οΈ
08.06.2025 17:38 β π 91 π 33 π¬ 6 π 4
although the improvement wasn't as large as we initially expected. Big thanks to @chr1sw.bsky.social for her support throughout this project!
09.06.2025 10:45 β π 0 π 0 π¬ 0 π 0
Variant-specific priors clarify colocalisation analysis
Author summary Evaluating whether two traits, such as disease risk and gene expression, are affected by the same genetic variants is crucial for understanding the molecular mechanisms through which ge...
Very happy that my first PhD paper is now out in PLOS Genetics! journals.plos.org/plosgenetics.... We describe our implementation of variant-specific priors in coloc. We show that using distance to the TSS as information about which variants are causal can improve colocalisation performance, 1/n
09.06.2025 10:45 β π 23 π 8 π¬ 1 π 1
PhD candidate in ML for genomics in Heidelberg, Germany with Oli Stegle
Previously at Genentech, UBC and BITS Pilani
https://scholar.google.com/citations?user=4yUtALcAAAAJ&hl=en&oi=ao
πΈπ° PhD student @ KU Leuven and VIB π§πͺ
Genetics, Bioinformatics, and everything in between (she/her)
Teaches at the City College of New York, edits the Journal of Genocide Research. www.dirkmoses.com
π²πΎ scientist in π¦πΊ | π¦ π©Έ #malaria |π¦ #nanobodies | Prof @WEHI_research @ourANU | on Wurundjeri, Ngunnawal & Ngambri lands | views my own ππ»ββοΈ π
she/her/hers | biomed PhD studying 𧬠of π§ @stjuderesearch.bsky.social in @hcmefford.bsky.social lab | cat mom π±| views are my own
Biostatistician. Baritone. He/Him.
Product of more than one country.
May contain nuts.
Journalist. Author. Broadcaster. Liberal extremist.
Science Director of OpenTargets and Group Leader at Wellcome Sanger Institute
Genetics, immunology, drug discovery
Outdoors, cats, dogs and all animals
Views are my own
Head of Genome Biology Dept @EMBL,
Scientist, Principal investigator, Professor
Exploring genome regulation during development,
and everything to do with enhancers.
3D genome, chromatin topology,
cell_fate, embryonic Development,
Single Cell genomics
International lawyer β’ PhDing @CambridgeUni.bsky.social re digital surveillance & repro justice β’ dog enthusiast β’ iced coffee drinker β’ Oxford comma fanatic
Group Leader @OxfordUni working on MHC, immune-mediated traits, multi-ancestry and omics studies
Professor of Medicine at Vanderbilt University Medical Center and Director of the Vanderbilt Genetics Institute. My research is in computational and statistical human genomics.
Medical doctor (2015) | Board-certified in Human Genetics (2025) | Researcher & Bioinformatician | Outdoor sports (hiking - biking - swimming) | Read | Cook | Relax :)
(machine) learning to turn one genome into 959 cells πͺ± | bioinfo phd π§¬π₯οΈ in Leuven (Aerts lab @ vib.ai π§πͺ) & Utrecht (AvO lab @ hubrecht.eu π³π±)
also sometimes urbanism & transport | prev π³π±, ππ°, π©π° | he/him | ook in het nederlands
Group leader at University of Bonn, studying the genomics of birth defects and infectious diseases.
Sydney girl, Fulbright recipient 2020-2021 with Monkol Lek. PhD Kids Research Sydney 2022, now postdoc with @nickywhiffin.bsky.social⬠at BDI Oxford. Splicing & smORFs!
Chief Data Scientist - Black Ochre Data Labs (blackochrelabs.au)
Past President - ABACBS (abacbs.org)
Adjunct A/Professor - Australian National University (anu.edu.au)
Interested in using multiomics to study complex diseases
CSO Nucleus Genomics
Genetics, polygenic risk scores. Previously at impute.me and Genome Center Denmark
PhD Student at UMich Statistics.
The account mostly trashes about urban planning and infrastructure.
Probability, Statistics, and Evolutionary Biology.
https://hanbin973.github.io