Kaitlin Samocha's Avatar

Kaitlin Samocha

@ksamocha.bsky.social

Assistant Investigator @ MGH / Broad / HMS. Focus on human genomics and modeling rare variation. She/her

1,410 Followers  |  229 Following  |  58 Posts  |  Joined: 26.09.2023  |  1.6284

Latest posts by ksamocha.bsky.social on Bluesky

Post image

πŸ—£οΈ Quote of #ESHG2025 (so far)

"Who licks bone !?!" 🦴
- Johannes Krause

Anyone have that on your bingo card?

Well apparently archeologists do, to distinguish bone from stones and it causes problems in DNA sequencing. πŸ€”

24.05.2025 13:46 β€” πŸ‘ 28    πŸ” 10    πŸ’¬ 1    πŸ“Œ 0

We are just wrapping up day 1 at #ESGH2025 in beautiful Milan. For those who want some extra fun while listening to the great science, you can play bingo.πŸ‘‡

I know multiple of these have already occurred.

24.05.2025 17:14 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Buongiorno Milano! Ready for a great day 1 of #eshg2025?
Packed program of excellent science 8.30am-8.00pm - plus networking event till 9.30pm to meet many friends, colleagues and collaborators! …andiamo @eshg.bsky.social @eshgyoung.bsky.social

24.05.2025 04:45 β€” πŸ‘ 15    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Preview
Scalable automated reanalysis of genomic data in research and clinical rare disease cohorts Reanalysis of genomic data in rare disease is highly effective in increasing diagnostic yields but remains limited by manual approaches. Automation and optimization for high specificity will be necess...

πŸ€— Hugely excited to share our work on automating iterative reanalysis in #raredisease, preprint out: www.medrxiv.org/content/10.1... πŸ€–πŸ§¬

github.com/populationge...

A superb collaboration with @dgmacarthur.bsky.social @cassimons.bsky.social @heidirehm.bsky.social @ksamocha.bsky.social and many more!

23.05.2025 06:04 β€” πŸ‘ 24    πŸ” 13    πŸ’¬ 1    πŸ“Œ 5
Post image

Human Developmental Cell Atlas (HDCA) expression data is now displayed. Expression is displayed in 12 sections of a 6-7 post-conception week human embryo, alongside a sagittal view which displays the region of the embryo represented by each section @mhaniffa.bsky.social

07.05.2025 13:29 β€” πŸ‘ 17    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1
Preview
My son Charlie β€” and the breakthrough that changed our lives James Coney and his wife, Sarah, struggled not knowing why their 12-year-old was born with a severe learning disability. In their darkest moments, they blamed themselves. Then, out of the blue, came a...

A few weeks ago, I had an incredibly emotional call with James Coney, a writer for the Sunday Times whose son Charlie was in the @genomicsengland.bsky.social 100k project and was recently diagnosed with ReNU syndrome. This beautiful article tells their story ❀️ www.thetimes.com/article/0bcc...

02.03.2025 12:06 β€” πŸ‘ 111    πŸ” 41    πŸ’¬ 5    πŸ“Œ 6

Yes, I've done this once before for a paper (listing in the system only first and last authors). It is easy to get those handful of people to approve additional authors in the future if the journal requires it.

Not sure if recommended, but it saved time.

21.02.2025 17:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Genomics of Rare Disease (hybrid conference)
Dates: 9-11 April 2025
Location: Hinxton Hall Conference Centre, Wellcome Genome Campus, UK and online 

In-person registration deadline: 11 March 2025
Virtual registration deadline: 1 April 2025

Genomics of Rare Disease (hybrid conference) Dates: 9-11 April 2025 Location: Hinxton Hall Conference Centre, Wellcome Genome Campus, UK and online In-person registration deadline: 11 March 2025 Virtual registration deadline: 1 April 2025

Join leading experts working in #RareDisease research at our #GRD25 conference.

πŸ“… Dates: 9-11 April 2025
πŸ’­ Share insights in person

Explore the latest #genomics advances accelerating improvements in clinical care for rare disorders, globally.

⏰Secure your place by 11 March: bit.ly/3BpAe44

17.02.2025 13:21 β€” πŸ‘ 9    πŸ” 12    πŸ’¬ 0    πŸ“Œ 6

Projects like this can’t be completed without many others: thanks to Mark Daly for continued mentorship across the years; critical support and work from @anneotation.bsky.social, @konradjk.bsky.social, Predrag Radivojac; the Hail team; and everyone associated with @gnomad-project.bsky.social.

10/10

19.04.2024 23:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

If this work feels familiar, it is because it is building off older work from our team originally released for ExAC.

I view this iteration as more of a franchise reboot instead of a sequel – we have new leads, but similar themes.

9/10

19.04.2024 23:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

As with all gnomAD-led projects, we’ve already shared the data and code. Regions are displayed for v2 on the gnomAD browser, the code can be seen on Github (github.com/broadinstitu...), and MPC scores are available for download.

8/10

19.04.2024 23:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

There is still much more to learn: using 125k exomes, our median region size is ~450bp and we see a relationship between transcript length and the number of regions we can identify due to statistical power.

7/10

19.04.2024 23:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Finally, missense constraint information was incorporated into a deleteriousness metric named MPC (Missense deleteriousness Prediction by Constraint), which separates case from control de novo missense variants well with similar performance to ML models like AlphaMissense.

6/10

19.04.2024 23:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

In collaboration with Predrag Radivojac and team, we demonstrated that coding bases with < 20% of their expected missense variation achieve moderate support for pathogenicity (PM1) following ACMG/AMP guidelines that can be used for clinical classification.

5/10

19.04.2024 23:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Missense depleted regions show an enrichment of (1) de novo missense variants in neurodevelopmental disorder cases compared to controls, (2) partitioned common variant heritability for >260 independent traits from the UK Biobank, and (3) ClinVar pathogenic (P/LP) variants.

4/10

19.04.2024 23:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Why try to find subgenic regions that are specifically missense constrained? Splitting up genes reveals patterns of negative and neutral selection that are obscured when looking gene-wide, including highlighting regions that have a large number of known pathogenic variants.

3/10

19.04.2024 23:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Co-led by the fabulous Katherine Chao and Lily Wang, we used gnomAD v2 and a recursive search to identify ~28% of canonical transcripts that were split into multiple missense constraint regions (measured by variable missense depletion in gnomAD).

2/10

19.04.2024 23:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Recently out on #bioRxiv: our updated approach to identify regional variability in missense mutation intolerance (β€œconstraint”) in protein-coding genes using the gnomAD database.

www.biorxiv.org/content/10.1...

1/10

19.04.2024 23:34 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Some updated guidance on our gnomAD v4 constraint scores: gnomad.broadinstitute.org/news/2024-03...

The @gnomad-project.bsky.social team is hard at work on v4.1 and improvements across the board, so expect more updates.

Thanks to Katherine Chao for spearheading this blogpost.

08.03.2024 16:34 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

A freely accessible version of the paper can be found here: rdcu.be/dsVXx

08.12.2023 02:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Our data are freely available on the gnomAD browser (gnomad.broadinstitute.org).

A huge thank you to Mike Guo, Laurent Francioli, Sarah Stenton, and Julia Goodrich for their hard work on this. This, of course, wouldn't be possible without everyone involved with @gnomad-project.bsky.social.

08.12.2023 02:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We also counted the number of genes that appeared to have compound heterozygous (in trans) deleterious variants and didn’t find that many. After manual curation, we only found seven genes with high-confidence compound heterozygous loss-of-function variants with some caveats.

08.12.2023 02:36 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Accuracy of our approach remains high across a range of allele frequencies and across genetic ancestry groups. As expected, genetic distance and highly mutable sites both impact accuracy.

08.12.2023 02:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

In this work, we used gnomAD v2 to estimate haplotype frequencies, and thereby predict the probability that two rare variants observed in the same gene are in trans. This approach is highly accurate (~96%) for variants with known phase in two independent datasets.

08.12.2023 02:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Our paper describing a way to infer the phase of rare variant pairs using gnomAD v2 is out now in Nature Genetics.

We hope that the resource we generated will be useful when interpreting rare co-occurring variants in the context of recessive disease.

www.nature.com/articles/s41...

08.12.2023 02:31 β€” πŸ‘ 16    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

As a note, I consulted with a few people this week (Katherine Chao, Sam Baxter, Jack Fu) to make sure that I covered as many of the key players as I could. Any remaining omissions or errors are all my own.

6/6

17.11.2023 19:00 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Finally, I’d like to thank the steering committee of gnomAD, and particularly to our co-Directors: Mark Daly and Heidi Rehm. It’s been fun to be a part of this amazing project and process.

5/6

17.11.2023 19:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

On the leadership side, we have a wonderful Scientific Advisory Board who provides suggestions and guidance for us. I’m looking forward to more discussions about how we can make this resource better.

4/6

17.11.2023 19:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The genetic data were collected, curated, and sequenced as part of many, many projects – we have over 300 data contributors for gnomAD v4!

All contributors and projects are listed on our About page:
gnomad.broadinstitute.org/about

3/6

17.11.2023 18:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Of course, gnomAD isn’t a reference population resource without the individuals who agreed to participate in research and shared their genetic information.

To all of you: thank you for contributing. You are the heart of gnomAD.

2/6

17.11.2023 18:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@ksamocha is following 20 prominent accounts