Josh Weinstock's Avatar

Josh Weinstock

@joshweinstock.bsky.social

Assistant Professor in the Department of Human Genetics at Emory University. Statistical genetics and genomics + genetic epidemiology of somatic mosaicism. weinstocklab.org

208 Followers  |  291 Following  |  18 Posts  |  Joined: 07.10.2023  |  2.1363

Latest posts by joshweinstock.bsky.social on Bluesky

Post image

STATGEN 2026 will be in Atlanta @emoryrollins.bsky.social - see more info here: statgen26.emory.edu

05.12.2025 17:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Specificity, length and luck drive gene rankings in association studies - Nature Genetic association tests prioritize candidate genes based on different criteria.

How do GWAS and rare variant burden tests rank gene signals?

In new work @nature.com with @hakha.bsky.social, @jkpritch.bsky.social, and our wonderful coauthors we find that the key factors are what we call Specificity, Length, and Luck!

🧬πŸ§ͺ🧡

www.nature.com/articles/s41...

07.11.2025 00:05 β€” πŸ‘ 170    πŸ” 74    πŸ’¬ 5    πŸ“Œ 10

Shout to extraordinary collaborators and co-authors on this, Karen Conneely, Cameron Russell, Mitchell Machiela, Marios Arvanitis, Janghee Woo.

For more info, I'm presenting this work on Thursday at #ASHG at 1:45pm, Room 210C

14.10.2025 21:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Explore clonal hematopoiesis associations with proteins and disease in UK Biobank.

To make results easier to parse, we also release a summary statistics "portal" to view our results, including lots of QC metrics: somatic.emory.edu

We also release lots of code for this, which was all done on the DNA Nexus RAP in a cost effective way.

14.10.2025 21:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The IGH and IGL mutations in particular had really large effect sizes, so we then did a GWAS of a combined IGH + IGL phenotype, which resulted in a single hit: GRAMD1B. Remarkably, this is well characterized risk locus for CLL, suggesting that we are converging on CLL relevant biology.

14.10.2025 21:27 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We then quantified the variance explained on a liability scale of CH to 30 common aging-related diseases. CH explains far more 'liability' scale variance for hematologic malignancies than other classes (as one would expect), though chronic kidney disease appears high here as well.

14.10.2025 21:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

New hits are generally indeed "weaker" than most classic CH drivers, suggesting that these are just underneath the "tip of the iceberg". We also replicate the telomere attrition mechanism reported here www.nature.com/articles/s41..., finding that the phenomenon is broader than previously observed

14.10.2025 21:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

The strongest hits are indeed strongly enriched for canonical CH genes, but we also find non-coding mutations at FGF1, UGT2B7, DGKB, the TERT promoter, chr17 centromere, and immunoglobulin loci:

14.10.2025 21:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Genome-wide characterization of clonal hematopoiesis reveals extensive non-coding putative driver mutations As humans age, we acquire somatic mutations in our blood, leading to clonal hematopoiesis (CH). Despite the prevalence of clonal hematopoiesis (CH) in aged individuals, recent searches for selective s...

What happens if you search the genomes of ~490K adults for variants that associate with age at blood draw? We actually did this crazy idea, using the UKB WGS (from peripheral blood).

Turns out this 'discovers' classic clonal hematopoiesis drivers and more πŸ§΅πŸ‘‡

www.medrxiv.org/content/10.1...

#ASHG

14.10.2025 21:27 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Strober Lab The Strober lab is a computational group at Boston Children's Hospital (a Harvard Medical School affiliated hospital) focused on developing statistical and machine learning tools applied to human gene...

Exciting updates!!
(1) I just opened my lab at Boston Children’s Hospital (Harvard-affiliated)
(2) I’m hiring a postdoc focused on integrating GWAS and functional genomic data. Reach out if you’re interested or connect at ASHG next week!
(3) Learn more at stroberlab.com

07.10.2025 18:55 β€” πŸ‘ 31    πŸ” 10    πŸ’¬ 0    πŸ“Œ 1

Excited for a major milestone in our efforts to map enhancers and interpret variants in the human genome:

The E2G Portal! e2g.stanford.edu

This collates our predictions of enhancer-gene regulatory interactions across >1,600 cell types and tissues.

Uses cases πŸ‘‡

1/

18.09.2025 16:14 β€” πŸ‘ 84    πŸ” 36    πŸ’¬ 2    πŸ“Œ 1
Preview
Beneath the surface of the sum When genetic interactions matter and when they don't

I wrote about gene-gene interactions (epistasis) and the implications for heritability, trait definitions, natural selection, and therapeutic interventions. Biology is clearly full of causal interactions, so why don't we see them in the data? A 🧡:

27.08.2025 20:40 β€” πŸ‘ 144    πŸ” 47    πŸ’¬ 1    πŸ“Œ 6

You can find PRSFNN code here: github.com/weinstockj/PRS . It takes in GWAS summary statistics + LD reference panel + annotations, and we compute the posterior using variational inference to make it fast.

A pleasure to build this with @aprilkim.bsky.social and @alexisbattle.bsky.social

22.07.2025 21:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We observed similar non-linear effects with AlphaMissense predictions, where low impact coding variants were prioritized, but highly pathogenic variants were not prioritized (presumably because these are so rare in real GWAS).

22.07.2025 21:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

More compellingly, it also learns non-linear effects wrt to chromatin accessibility - variants in cCREs present in 10-50 cell types were more highly prioritized than variants in cCRE that are present in numerous (> 50) cell types, suggesting a preference for cell/tissue specificity.

22.07.2025 21:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

In our annotation curation, we included lots of scATAC, cCREs from ENCODE, conservation from Zoonomia, pathogenicity from AlphaMissense, among others. Generally - PRSFNN "learns" that low-frequency SNPs in accessible chromatin are likely to have larger effect sizes (maybe not that surprising).

22.07.2025 21:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We connected the SNP annotations to the parameters of the prior distribution on the weights in a novel way with a neural network, so we're calling it Polygenic Risk Scores with Functional Neural Network (PRSFNN). We were excited to see that PRSFNN does well in benchmarks (at least in our hands).

22.07.2025 21:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Polygenic prediction of phenotypes with a neural empirical Bayes approach Polygenic risk scores (PRS) estimate the expected value of a phenotype based on individual genotypes. Although statistical approaches for calculating PRS have advanced considerably in recent years, fe...

Really excited to share our new PRS method, developed with @aprilkim.bsky.social and @alexisbattle.bsky.social ! Our approach is to use a lot of recently developed functional annotations to better estimate the weights of the SNPs.
www.medrxiv.org/content/10.1...

22.07.2025 21:25 β€” πŸ‘ 26    πŸ” 10    πŸ’¬ 1    πŸ“Œ 1

Congratulations!

05.06.2025 03:03 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Integration of transcriptomics and long-read genomics prioritizes structural variants in rare disease An international, peer-reviewed genome sciences journal featuring outstanding original research that offers novel insights into the biology of all organisms

Happy to share our work characterizing functional rare SVs in rare diseases with long-read genome sequencing and transcriptomic outlier data: genome.cshlp.org/content/earl...

26.03.2025 14:30 β€” πŸ‘ 10    πŸ” 7    πŸ’¬ 1    πŸ“Œ 1

Sharing some of our lecture slides on statistical genetics! 🧬

Co-taught with Mike Epstein, Dave Cutler, Karen Conneely, Jingjing Yang, Jian Hu.

Mendelian randomization: weinstocklab.org/lecture_slid...

Biobank scale GWAS methods: weinstocklab.org/lecture_slid...

Hope they're helpful!

05.03.2025 23:31 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Home | Mostafavi Lab

We have multiple postdoc positions available in my group at NYU. Join us if you're interested in complex trait genetics and biology. More information about the lab on our website: mostafavilab.org

01.06.2024 13:43 β€” πŸ‘ 19    πŸ” 15    πŸ’¬ 0    πŸ“Œ 0

Specificity, length, and luck: How genes are prioritized by rare and common variant association studies https://www.biorxiv.org/content/10.1101/2024.12.12.628073v1

16.12.2024 10:33 β€” πŸ‘ 46    πŸ” 24    πŸ’¬ 0    πŸ“Œ 1
Preview
Cell type and dynamic state govern genetic regulation of gene expression in heterogeneous differentiating cultures Popp etΒ al. generate dozens of cell types from 53 human iPSC lines in order to characterize the dynamic genetic regulation of gene expression across early stages of cellular differentiation. Accessing...

Excited to see our study on genetic regulation in heterogeneous differentiating cultures out in final form!

www.cell.com/cell-genomics/fulltext/S2666-979X(24)00330-6

04.12.2024 02:31 β€” πŸ‘ 19    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1

I'm developing a pipeline to call CHIP mutations in UK Biobank using the DNA Nexus RAP that is fast/cheap/reproducible. Initial results are promising; calls looks reasonable and cost to do this across all of UKB is likely < 500$.

Feel free to DM if of interest.

22.11.2024 22:17 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Great talk from Zeyun Lu on integrating cis-eQTLs with perturb-seq to increase discovery in Mendelian randomization #ASHG2023 !

03.11.2023 15:59 β€” πŸ‘ 14    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1

@joshweinstock is following 20 prominent accounts