Konrad's Avatar

Konrad

@konradjk.bsky.social

Genomicist, computational biologist. Assistant professor @ MGH, HMS. Associate member @ Broad Institute https://klab.is

1,165 Followers  |  44 Following  |  21 Posts  |  Joined: 30.08.2023  |  2.0196

Latest posts by konradjk.bsky.social on Bluesky

All by All The All by All browser maps known and novel associations between genotypes and phenotypes using data contributed by All of Us Research Program participants as of July 1, 2022. All by All encompasses a...

Weโ€™ve put up summary statistics for over 3,000 traits in the All of Us resource, and a shiny new browser alongside it! Explore your favorite gene or phenotype here: allbyall.researchallofus.org #ASHG24

08.11.2024 20:32 โ€” ๐Ÿ‘ 24    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Starter pack of people who create starter packs?

08.11.2024 00:06 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

You mean โ€œReNally???โ€?

07.11.2024 18:23 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Heh, it was on our list but somehow never made it into the pre-submission checklist. Will do!

21.09.2024 17:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Interesting question. We do have a โ€œgnomAD-newโ€ analysis in there but havenโ€™t broken down by ancestry - i fear a lot is going to be driven by โ€œnot yet observedโ€ (which is the same across all ancestries)

20.09.2024 16:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It gets a bit more complicated though - these scores have a mix of impacts of variant-to-gene, as well as prioritizing which genes, when disrupted, lead to phenotypes. Perhaps a new method that combines both these insights optimally will outperform them all!

20.09.2024 15:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

We found that population-focused methods do best for identifying highly impactful variants (de novoโ€™s in individuals with developmental disorders for instance), while the deep learning methods are better at prioritizing inherited variation in biobanks

20.09.2024 15:49 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Variant scoring performance across selection regimes depends on variant-to-gene and gene-to-disease components bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution

We have a new preprint that weโ€™d love feedback on! We benchmarked a bunch of variant scoring methods to figure out what they were actually doing, and how they performed across selection regimes: www.biorxiv.org/content/10.1...

20.09.2024 15:47 โ€” ๐Ÿ‘ 7    ๐Ÿ” 6    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Welcome new followers (and thanks @michelnivard.bsky.social)! Iโ€™m loving the critical mass, and to celebrate, Iโ€™ll post some exciting new content (my first time posting here and not on the the other site)

20.09.2024 15:46 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Recently out on #bioRxiv: our updated approach to identify regional variability in missense mutation intolerance (โ€œconstraintโ€) in protein-coding genes using the gnomAD database.

www.biorxiv.org/content/10.1...

1/10

19.04.2024 23:34 โ€” ๐Ÿ‘ 6    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
The Scalable Variant Call Representation: Enabling Genetic Analysis Beyond One Million Genomes bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution

As genomic analyses scale to millions of exomes/genomes, we need a scalable infrastructure to process/QC/handle these data while retaining all the metrics needed for downstream analysis. A new preprint from the Hail team proposes a way to do this! Comments welcome: www.biorxiv.org/content/10.1...

31.01.2024 15:40 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Paella is good. With LOEUF I assumed the culmination would be an omelette but CHARR is better in a paella, so maybe itโ€™s a multi-course meal

07.12.2023 18:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Extended data figure 2b has exclusive exon-only. I think we internally made some with intermediate overlaps and it was an intermediate result as youโ€™d expect

07.12.2023 18:09 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
An expanded genomic database for identifying disease-related variants An expanded version of a human-genome database called gnomAD, containing 76,156 whole-genome sequences, has enabled investigation of how variants in non-protein-coding regions of the genome affect hea...

And thanks to Ryan Dhindsa and Slavรฉ Petrovski for the excellent writeup and context around our work. Excited for the times ahead! www.nature.com/articles/d41...

06.12.2023 17:14 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This is all thanks to an amazing production team, browser team, and steering committee @gnomad-project.bsky.social, the 76,156 individuals that provided their genomes, and support from Broad Genomics and Hail

06.12.2023 17:12 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Interestingly, these scores also provide additional insight into genes regulated by these regions, even those underpowered by previous constraint metrics:

06.12.2023 17:11 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Gnocchi extends our constraint metrics to the non-coding genome, highlighting for instance, disease-associated non-coding CNVs

06.12.2023 17:11 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

We built a new metric we called gnocchi (genomic non-coding constraint of haploinsufficient variation), building on methods that find depletions of variation (natural selection), which we show can prioritize functional variation

06.12.2023 17:10 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
A genomic mutational constraint map using variation in 76,156 human genomes - Nature A genomic constraint map for the human genome constructed using data from 76,156 human genomes from the Genome Aggregation Database shows that non-coding constrained regions are enriched for regulator...

Thrilled to have our work on gnomAD out in print at Nature today. With 76K genomes, we can look beyond the coding genome and into the non-coding genome to find regions important for human disease idp.nature.com/authorize?re...

06.12.2023 17:10 โ€” ๐Ÿ‘ 13    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Hail | Index

Thanks to Wenhan Lu for driving this effort, Hail (hail.is) for building the scalable infrastructure that enabled this, and @gnomAD-project.bsky.social for the data and support

28.11.2023 19:36 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

CHARR operates only on homozygous alternate sites and scales very well (โ€œcost per 1M samplesโ€ might be my new favorite metric):

28.11.2023 19:35 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Excited for my first post on this new site to share our work in print at AJHG using variant call data to estimate DNA contamination. As our sample sizes get into the millions of genomes, we need methods like this to efficiently process and quality control the data authors.elsevier.com/c/1i8PAgeX6LB~

28.11.2023 19:34 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@konradjk is following 20 prominent accounts