The analogy I find myself frequently using here: if everyone smoked (or no one did), the heritability of lung cancer would go up. We think about this a lot in the context of cross-biobank comparisons
31.01.2026 01:07 β π 6 π 0 π¬ 0 π 0The analogy I find myself frequently using here: if everyone smoked (or no one did), the heritability of lung cancer would go up. We think about this a lot in the context of cross-biobank comparisons
31.01.2026 01:07 β π 6 π 0 π¬ 0 π 0gs://ukb-diverse-pops-public/misc/pairwise/pairwise_correlations_regressed.txt.bgz - itβs coded in the way that our pan-UKB phenotypes were so not sure if itβs super easy to use but thatβs pairwise r_p for ~14k phenos
08.10.2025 18:01 β π 0 π 0 π¬ 0 π 0Special thanks to all co-authors that got this here including @masakanai.bsky.social @rahulg603.bsky.social @dalygene.bsky.social @egatkinson.bsky.social and of course @genetisaur.bsky.social for driving this through 5 years of work (after the first GWASes were done!)
18.09.2025 17:32 β π 4 π 0 π¬ 0 π 0Tons of lessons learned around carefully controlling population stratification, using heritability as a QC metric, and probably most importantly, quantifying novelty in a mega-phenotype analysis. Some really cool analyses to find interesting biology e.g. allelic series and ancestry-enriched variants
18.09.2025 17:29 β π 0 π 0 π¬ 1 π 0A project many years in the process, weβre pleased to present our work on multi-ancestry meta-analysis across a boatload of traits in the UK Biobank: www.nature.com/articles/s41...
18.09.2025 17:25 β π 64 π 25 π¬ 1 π 0Weβve put up summary statistics for over 3,000 traits in the All of Us resource, and a shiny new browser alongside it! Explore your favorite gene or phenotype here: allbyall.researchallofus.org #ASHG24
08.11.2024 20:32 β π 35 π 14 π¬ 1 π 1Starter pack of people who create starter packs?
08.11.2024 00:06 β π 1 π 0 π¬ 0 π 0You mean βReNally???β?
07.11.2024 18:23 β π 0 π 0 π¬ 1 π 0Heh, it was on our list but somehow never made it into the pre-submission checklist. Will do!
21.09.2024 17:00 β π 0 π 0 π¬ 0 π 0Interesting question. We do have a βgnomAD-newβ analysis in there but havenβt broken down by ancestry - i fear a lot is going to be driven by βnot yet observedβ (which is the same across all ancestries)
20.09.2024 16:45 β π 1 π 0 π¬ 0 π 0It gets a bit more complicated though - these scores have a mix of impacts of variant-to-gene, as well as prioritizing which genes, when disrupted, lead to phenotypes. Perhaps a new method that combines both these insights optimally will outperform them all!
20.09.2024 15:51 β π 0 π 0 π¬ 1 π 0We found that population-focused methods do best for identifying highly impactful variants (de novoβs in individuals with developmental disorders for instance), while the deep learning methods are better at prioritizing inherited variation in biobanks
20.09.2024 15:49 β π 0 π 0 π¬ 1 π 0We have a new preprint that weβd love feedback on! We benchmarked a bunch of variant scoring methods to figure out what they were actually doing, and how they performed across selection regimes: www.biorxiv.org/content/10.1...
20.09.2024 15:47 β π 7 π 6 π¬ 2 π 0Welcome new followers (and thanks @michelnivard.bsky.social)! Iβm loving the critical mass, and to celebrate, Iβll post some exciting new content (my first time posting here and not on the the other site)
20.09.2024 15:46 β π 2 π 0 π¬ 0 π 0
Recently out on #bioRxiv: our updated approach to identify regional variability in missense mutation intolerance (βconstraintβ) in protein-coding genes using the gnomAD database.
www.biorxiv.org/content/10.1...
1/10
As genomic analyses scale to millions of exomes/genomes, we need a scalable infrastructure to process/QC/handle these data while retaining all the metrics needed for downstream analysis. A new preprint from the Hail team proposes a way to do this! Comments welcome: www.biorxiv.org/content/10.1...
31.01.2024 15:40 β π 4 π 1 π¬ 1 π 0Paella is good. With LOEUF I assumed the culmination would be an omelette but CHARR is better in a paella, so maybe itβs a multi-course meal
07.12.2023 18:10 β π 0 π 0 π¬ 1 π 0Extended data figure 2b has exclusive exon-only. I think we internally made some with intermediate overlaps and it was an intermediate result as youβd expect
07.12.2023 18:09 β π 1 π 0 π¬ 0 π 0And thanks to Ryan Dhindsa and SlavΓ© Petrovski for the excellent writeup and context around our work. Excited for the times ahead! www.nature.com/articles/d41...
06.12.2023 17:14 β π 2 π 0 π¬ 1 π 0This is all thanks to an amazing production team, browser team, and steering committee @gnomad-project.bsky.social, the 76,156 individuals that provided their genomes, and support from Broad Genomics and Hail
06.12.2023 17:12 β π 1 π 0 π¬ 1 π 0Interestingly, these scores also provide additional insight into genes regulated by these regions, even those underpowered by previous constraint metrics:
06.12.2023 17:11 β π 0 π 0 π¬ 1 π 0Gnocchi extends our constraint metrics to the non-coding genome, highlighting for instance, disease-associated non-coding CNVs
06.12.2023 17:11 β π 1 π 0 π¬ 1 π 0We built a new metric we called gnocchi (genomic non-coding constraint of haploinsufficient variation), building on methods that find depletions of variation (natural selection), which we show can prioritize functional variation
06.12.2023 17:10 β π 2 π 0 π¬ 1 π 1Thrilled to have our work on gnomAD out in print at Nature today. With 76K genomes, we can look beyond the coding genome and into the non-coding genome to find regions important for human disease idp.nature.com/authorize?re...
06.12.2023 17:10 β π 13 π 5 π¬ 1 π 1Thanks to Wenhan Lu for driving this effort, Hail (hail.is) for building the scalable infrastructure that enabled this, and @gnomAD-project.bsky.social for the data and support
28.11.2023 19:36 β π 0 π 0 π¬ 0 π 0CHARR operates only on homozygous alternate sites and scales very well (βcost per 1M samplesβ might be my new favorite metric):
28.11.2023 19:35 β π 1 π 0 π¬ 1 π 0Excited for my first post on this new site to share our work in print at AJHG using variant call data to estimate DNA contamination. As our sample sizes get into the millions of genomes, we need methods like this to efficiently process and quality control the data authors.elsevier.com/c/1i8PAgeX6LB~
28.11.2023 19:34 β π 12 π 4 π¬ 1 π 0