This is the preprint write up of my sabbatical work with Dave Kelleyβs group at Calico. We tried out several transformer replacements for multi-task learning in functional genomics (i.e. what Borzoi does). Mamba, in particular, seems to outperform a mini version of Borzoi, especially when βstripedβ.
18.02.2025 04:17 β π 28 π 10 π¬ 1 π 0
Contribution of autosomal rare and de novo variants to sex differences in autism
Autism is four times more prevalent in males than females. To study whether this reflects a difference in genetic predisposition attributed to autosomβ¦
Our new paper about rare variant contributions to sex differences in autism is out at AJHG, led by Mahmoud Koko with @vw1234.bsky.social and Kyle Satterstrom. Biggest analysis of exome data in autism to date including SPARK and ASC. www.sciencedirect.com/science/arti...
16.02.2025 12:29 β π 14 π 5 π¬ 0 π 1
Can DNA sequence models predict mutations affecting human traits?
We introduce TraitGym, a curated benchmark of causal regulatory variants for 113 Mendelian & 83 complex traits, and evaluate functional genomics and DNA language models. Joint work w/ GΓΆkcen Eraslan and @yun-s-song.bsky.social π§΅π
13.02.2025 20:57 β π 28 π 15 π¬ 1 π 2
On-target edit events detected for different cell types (K562, primary T cells, GM12878, Jurkats) and guides using TIDE. Most tested guides resulted in efficient insertions of the 34bp donor sequence containing the T7 promoter.
Superb-seq is also EASY TO PERFORM. Protocol uses standard kits, equipment, and NO virus. We also provide free software, SHERIFF, to analyze Superb-seq data. We hope this will enable wide-spread adoption and use in diverse cell types!
11.02.2025 23:25 β π 0 π 0 π¬ 0 π 0
Superb-seq could be used to readily distinguish benign from potentially risky Cas9 off-target edit profiles for therapeutic applications. Perturb-seq and Guide-seq lack this capability, since they do not jointly detect edit sites and transcriptomes at single cell resolution.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Guide presence is an unreliable indicator of on-target guide editing. Other methods that use guide-only read outs may be confounded by off-target edits that are sometimes even more frequent than the on-target edit.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Edit events colored by the corresponding guide. Only one guide had no detected off target events. Y-axis shows homology between genome and guide spacer. X/I/O indicates whether edit is exonic, intronic, or intergenic. Most off-target edits are intronic.
All off-targets were non-coding, so paired scRNA-seq was crucial to determine their functional effects using differential expression, as shown for the USP9X intronic edit above.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Venn diagram showing overlap between off-target edit sites identified with Superb-seq and those predicted by in silico tools Cas-OFFinder, COSMID and E-CRISP. The overlap is poor and 11 off-targets were not predicted by any computational tool.
Many of the off-target edit sites (total = 36) were not predicted by popular in silico tools. These tools appear to underestimate the tolerance of guide bulges and mismatches with off-target edit sites, yet also predict many off-target edits that do not occur.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Pairwise alignments of SMARCA4 g22 sequence and the genome site at the edit. All off-target sites have PAMS, but have various mismatches to the guide spacer sequence.
Off-target sites had clear guide homology and PAMs. There was however a surprising tolerance of guide bulges and mismatches with off-targets, including mismatches in the βseedβ region.
11.02.2025 23:25 β π 1 π 1 π¬ 1 π 0
Detected edits for SMARCA4 guide 22. 13 edits were detected. Y axis is the frequency of the edit (number of cells) and X axis is the homology between the genome and the guide spacer sequence. Some off target events are more frequent than the intended on-target.
One guide had an off-target edit that was 34x more frequent than the on-target edit! A total of 5 off-target sites were observed in more cells than the on-target for this particular guide.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Sheriff counts edit alleles per cell at each edit site. An off-target edit within the first intron of USP9X is associated with differential expression of USP9X and >100 downstream genes.
We quantified Cas9 edits per cell (βedited allelesβ), and associated edit alleles with gene expression. One intronic off-target edit perturbed the expression of USP9X and >100 downstream genes!
11.02.2025 23:25 β π 1 π 0 π¬ 1 π 0
Genome positions and frequencies (number of cells) of on and off-target events detected with Superb-seq.
SURPRISING RESULT. We applied Superb-seq to 10k K562 cells and detected pervasive off-target Cas9 edits, with an average of 6 off-target sites per guide, ranging in frequency from 0.03-18.6% of cells!
11.02.2025 23:25 β π 1 π 0 π¬ 1 π 0
A schematic of Superb-seq showing 3 steps: (1) edit labeling with donor sequence containing T7 promoter; (2) In situ transcription with T7 polymerase; (3) combinatorial single-cell RNA-seq + computational analysis with Sheriff.
Superb-seq detects edits and cell RNA by labelling nuclease cleavage sites with homology-free insertion of a T7 promoter, followed by βZombieβ in situ transcription with T7-pol to generate edit-site marking barcoded RNA. T7 and cell RNAs are jointly read out with scRNA-seq.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
Unlike single-cell Perturb/CROP-seq methods that read guide RNAs as a proxy of edits, Superb-seq captures edit events directly. This enables functional assessment of edits including off-target events that confound analyses and raise gene therapy risk.
11.02.2025 23:25 β π 0 π 0 π¬ 1 π 0
AI and cognitive science, Founder and CEO (Geometric Intelligence, acquired by Uber). 8 books including Guitar Zero, Rebooting AI and Taming Silicon Valley.
Newsletter (50k subscribers): garymarcus.substack.com
CS PhD Candidate at Stanford. Working at the intersection of Machine Learning, Regulatory Genomics, and Complex Disorders
Laboratory of Professor James Noonan | Department of Genetics at Yale University. Science is a team sport. Reposts are not necessarily endorsements. We speak only for ourselves. Website: noonanlab.org
cocktail enthusiast | neuroscientist | SABV influencer | urban gardener | drum learner | she/her
Insta: @doc_becca (fun stuff)
Signal: @doc_becca.87 (secrets)
Neuroscientist. Human, mouse, and zebrafish geneticist. Trying to understand and treat neurodevelopmental and neuromuscular disease.
If you recognize my avatar from the OG #scitwitter crowd, give me a follow and say Hi.
Founder, CEO, J Craig Venter Institute: Scientist, Author (A Life Decoded: Life at the Speed of Light: The Sorcerer II Expedition), sequenced the first human genome; created the first synthetic cell. Pilot Cirrus SR22T, G7: sailor: SUP surfer: Woodworker
Berkeley professor (Bioeng, Compbio). Visiting Scientist at Calico. JBrowse genome browser / Apollo annotation editor, ML for gene regulation / molecular evolution / synbio. Occasional music, games, jokes
Professor of Genetics and Pediatrics, UPenn/CHOPπ§¬. https://orcid.org/0000-0003-2025-5302. Views my own. π΄σ §σ ’σ ³σ £σ ΄σ ΏπΊπΈ
Same content, different website! Human evolutionary genomics, functional genomics and archaic hominins. Group leader in Human Genomics at SVI in Melbourne/Naarm, Australia. Durian evangelist. Sometimes I go to Estonia.
Scientist errant. Genetics, evolution, whimsy, awe. https://scholar.harvard.edu/jtennessen
π§¬π¦ππ©Έπ€π¦ ππΈπ§¬
Biomedical policy and climate-and-health reporter at Science. Author of The Vaccine Race. Book lover. History lover. Dog lover. Dual U.S.-Canadian citizen, proud Vancouver native. Signal me at: meredithwadman.13
Automatically tweets new posts from http://statmodeling.stat.columbia.edu
Please respond in the comment section of the blog.
Old posts spool at https://twitter.com/StatRetro
Epigenetics & Genomics
Harvard University & Broad Institute
https://www.macfound.org/fellows/class-of-2023/jason-d-buenrostro
buenrostrolab.com
Bluesky newbie. Twitter exile. Previously Professor of Diabetes in Oxford, now heading human genetics at Genentech. Living the California dream. Views my own. #YNWA
Assistant Professor @ Stanford Genetics & BASE Initiative. Mapping the regulatory code of the human genome to understand heart development and disease. www.engreitzlab.org
Group Leader in Human Genetics, Wellcome Sanger Institute
Assistant Professor at UCSF | CZ Biohub β SF Investigator
https://diego-calderon-lab.github.io
https://pharmacy.ucsf.edu/diego-calderon
Head of Human Genetics & Senior Group Leader @ Wellcome Sanger Institute. Statistical geneticist working on complex diseases #ibd #firstgen.
Our group develops and applies computational approaches to study molecular variations and their phenotypic consequence. We are part of DKFZ and EMBL.
Website: https://steglelab.org/