John Butts's Avatar

John Butts

@j-c-butts.bsky.social

Biomedical Sciences PhD Candidate at The University of Maine and The Jackson Laboratory Genetics. Music. Tennis. (Not necessarily in that order)

48 Followers  |  52 Following  |  11 Posts  |  Joined: 12.04.2025  |  1.53

Latest posts by j-c-butts.bsky.social on Bluesky

And a huge thank you to these incredible people!
@alwaysrong.bsky.social @sagergosai.bsky.social @rodrigoicastro.bsky.social @mackenzie-noon.bsky.social @pardissabeti.bsky.social @stevereilly.bsky.social @tewhey.bsky.social @yaleschoolofmed.bsky.social @jacksonlab.bsky.social @umaine.bsky.social

23.04.2025 17:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We have produced a comprehensive catalog of non-coding variant effects and make these predictions available to the community. We encourage researchers interested in gene regulation across fields to explore our precomputed predictions or generate their own to guide their future experiments!

23.04.2025 17:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Identifying non-coding variant effects at scale via machine learning models of cis-regulatory reporter assays Supplemental data and resources for "Identifying non-coding variant effects at scale via machine learning models of cis-regulatory reporter assays" Including: MPAC predictions for: Siraj 2024 (UKBB/BB...

We have made all predictions available on Zenodo and encourage researchers interested in disease, clinical variant interpretation, GWAS studies, regulatory grammar, and evolution to download and explore these data.

zenodo.org/records/1518...

23.04.2025 17:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Lastly we investigate all human promoters by saturation mutagenesis, identifying canonical promoter TFs and linking non-coding variant effect size to coding constraint (LoEUF), bridging the gap between coding and non-coding function.

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

MPAC can scale to predict 514M gnomAD variant effects and we quantify the relationship between allele frequency or evolutionary conservation with predicted skew at an unprecedented level. Notably, we find that variants causing high skew are under greater constraint than those with small effects.

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

In COSMIC we identify known non-coding driver mutations (TERT) and by combining variant recurrence, regulatory element annotations, and cancer-associated promoters we nominate 1,892 emVars as putative non-coding drivers.

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Many clinically identified non-coding variants lack clear effects, using MPAC we can predict the impact of all ClinVar non-coding variants and observe enrichments in pathogenic alleles for highly disruptive variants (emVars).

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

MPAC predictions distinguish causal variants from the UK Biobank, Biobank Japan and eQTLs from GTEx with experimental accuracy but without experimental overhead!

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Trained on MPRA data from a large-scale study of human trait and eQTL variants (www.biorxiv.org/content/10.1...) and extending the Malinois model architecture (www.nature.com/articles/s41...) MPAC predicts variant effects with high accuracy.

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Massively Parallel Reporter Assays (MPRAs) quantify the activity of 10-100s of thousands of sequences, however, it is not feasible to test all known variation. Modeling MPRA can increase scale and lead to better understanding of complex traits, somatic and germline diseases, and population genetics.

23.04.2025 17:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
| bioRxiv bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution

Excited to share our MPAC preprint, a scalable ensemble of ML models for genome-wide non-coding variant effect prediction and our findings from 575M predictions across databases including @ukbiobank.bsky.social, GTEx, ClinVar, COSMIC, and @gnomad-project.bsky.social
www.biorxiv.org/content/10.1...

23.04.2025 17:28 β€” πŸ‘ 17    πŸ” 9    πŸ’¬ 2    πŸ“Œ 3

@j-c-butts is following 20 prominent accounts