FLARE2: local ancestry inference with poorly-matched reference panels https://www.biorxiv.org/content/10.1101/2025.10.13.681993v1
14.10.2025 17:34 β π 0 π 3 π¬ 0 π 0@sdtemple.bsky.social
PhD Statistician (github.com/sdtemple) UMich & UWashington Stats
FLARE2: local ancestry inference with poorly-matched reference panels https://www.biorxiv.org/content/10.1101/2025.10.13.681993v1
14.10.2025 17:34 β π 0 π 3 π¬ 0 π 0Overall, the method is a scalable and interpretable way to look for haplotype homozygosity in biobanks. I have spent a lot of time implementing it to make it user friendly. Running this can be important to check for confounding: see our other preprint.
pubmed.ncbi.nlm.nih.gov/40672175/
While we have improved (made a more conservative threshold for the African American analysis), there are still unmodeled complexities and likely fine-scale structure that violates our assumption. Buyer beware if signal is weak (barely GW significant)!
26.09.2025 16:29 β π 0 π 0 π¬ 1 π 0We filter to have small admixture proportions in our African American analysis. We think that admixture is more likely to deflate our "selection signal", not inflate it. This is a robustness check.
26.09.2025 16:29 β π 0 π 0 π¬ 1 π 0We think that low-mappability, problematic regions are more likely to deflate our "selection" signal, not inflate it. See Appendix B. This is a robustness check.
26.09.2025 16:29 β π 0 π 0 π¬ 1 π 0There is a large signal of unusual haplotype structure at a chr16 locus in all groups. I also see this when analyzing other consortia data. We describe it in more detail as a complicated region enriched with deletions. Gusev et al have also reported on it.
academic.oup.com/mbe/article/...
The final thesis chapter is published OA at
@ajhgnews.bsky.social!
In short, we determine a valid significance level for a "selection" scan and apply it to 3 ancestry groups in 2 biobanks. π§΅ for updates from the helpful peer review.
www.sciencedirect.com/science/arti...
This is a new conceptual figure which explains how to make our proofs. I hope this idea is helpful to other population geneticists.
16.07.2025 15:33 β π 0 π 0 π¬ 0 π 0Our paper on the limiting distribution of IBD rates (Gaussian) is now published in TPB. Really an amazing experience to work with and learn from Prof Elizabeth Thompson!
doi.org/10.1016/j.tp...
This project completes my development of our suite of methods called isweep, which can detect recent selection and case-control associations, can narrow in on the selection signal, and has fast local simulations to perform statistical inference.
07.07.2025 22:34 β π 0 π 0 π¬ 0 π 0You can also run a fast scan with randomized phenotypes to double check if selection is confounding.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0Sadly, the test is prone to confounding due to very strong positive and recent selection (see LCT in European ancestry cohorts). You can automatically check for this as a part of the automated workflow.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0Two of the six genome-wide significant signals are about current therapeutic targets, e.g., at the NBAS gene.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0We ran the scalable scan for three ancestry cohorts (European, African, and Amish) in the Alzheimer's Disease Sequencing Project.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0The null model is again motivated by the Temple and Thompson (2025) central limit theorems, which is now published in Theoretical Population Biology.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0The test is a two-sample modification of the one-sample selection scan from my prior work (Temple and Browning, 2025): high IBD rates differences in cases versus controls could indicate a genetic association.
07.07.2025 22:34 β π 0 π 0 π¬ 1 π 0Presenting our new scan that uses IBD segments to detect genetic associations with cohorts stratified by a categorical variable (i.e., case-control studies).
www.biorxiv.org/content/10.1...
The mutation rate of SARS-CoV-2 is highly variable between sites and is influenced by sequence context, genomic region, and RNA structure
academic.oup.com/nar/article/...
And the tskibd paper itself on positive selection in Pf. is very interesting.
www.nature.com/articles/s41...
We discuss the use cases of local versus genome-wide IBD simulation. I'll say I have had a great time using msprime + tskibd for genome-wide simulations. We find it is accurate w.r.t. to our work. I highly recommend it.
github.com/bguo068/tskibd
One fantastic suggestion from both peer reviewers was to validate the accuracy of IBD segment length distributions w.r.t. existing tools. These studies are now included as Figures S7-10.
02.06.2025 16:56 β π 0 π 0 π¬ 1 π 0I'm excited to share our open-access publication of my simulation paper with Sharon Browning and Elizabeth Thompson @springernature.com . We show approximately linear runtime to simulate local IBD, which was essential for my other thesis work.
link.springer.com/article/10.1...
I'm excited to present my work on rigorous and scalable multiple-testing corrections for complex haplotype scans in WGS at ASA STATGEN 2025!
t.co/rv205OBCs1
Correction: 3) should be approx. has the first-order Markov property. This is what I get for tweeting fast.
30.01.2025 21:35 β π 0 π 0 π¬ 0 π 0A pair of manuscripts from Sharon Browning on applications of IBD segments for mapping disease-associated loci and detecting natural selection.
www.biorxiv.org/content/10.1...
www.biorxiv.org/content/10.1...
There are long tables of the genes/loci identified. I put all tables and figures at end of papers, so easy to print and staple those. Pay attention to the shared signals. That's a wrap. π
30.01.2025 20:49 β π 0 π 0 π¬ 0 π 0Beyond the Temple and Thompson asymptotics, empirically the IBD rates look ~= normally distributed, outside of many the selected loci
30.01.2025 20:49 β π 0 π 0 π¬ 1 π 0The assumption of IBD rates having exponentially decaying correlations has good empirical evidence across ancestry groups and simulations. We're thinking about if an IBD segment covers both positions, and the segment lengths are often modeled as exponential/Gamma.
30.01.2025 20:49 β π 0 π 0 π¬ 1 π 0The main reason we wrote this is to have a GW significance level for an African ancestry analysis, whereas everyone already knows about LCT selection in Europeans. We replicated a few signals across TOPMed and UKBB samples. I find the chr16 hit compelling.
30.01.2025 20:49 β π 0 π 0 π¬ 1 π 0You can run my software github.com/sdtemple/isw... with a few command lines and a cluster. We did this for many ancestry groups. The LCT, MHC, and OCA2 signals are shared across European and Indian cohorts.
30.01.2025 20:49 β π 0 π 0 π¬ 1 π 0