April Wei's Avatar

April Wei

@aprilwei.bsky.social

Population geneticist, CompBio, Cornell

425 Followers  |  114 Following  |  9 Posts  |  Joined: 28.11.2023  |  1.5847

Latest posts by aprilwei.bsky.social on Bluesky

Post image Post image Post image

Method works from simple and complex scenarios in jointly estimating epoch time, population size, migration rate (symmetric or asymmetric), growth rate, and admixture proportion. Software integrated with msprime, demes, tsinfer/tsdate, relate, and singer. github.com/aprilweilab/...

08.10.2025 14:48 โ€” ๐Ÿ‘ 4    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Inference of complex demographic history using composite likelihood based on whole-genome genealogies Accurate parametric inference on complex demographic models is a continuing challenge in population genetics. Ancestral recombination graphs (ARGs) provide richer information than simple population ge...

Excited to preprint our latest work (w/ Drew DeHaas, Zhibai Jia, Leo Speidel) on using ARGs for demographic inference. w/ applications using data from 1000 Genomes Project. www.biorxiv.org/content/10.1...

08.10.2025 14:48 โ€” ๐Ÿ‘ 45    ๐Ÿ” 23    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Fast Phenotype Simulation for Genotype Representation Graphs Motivation The Genotype Representation Graph (GRG) [[DeHaas et al., 2025][1]] is a graph representation of whole genome polymorphisms, designed to encode the variant hard-call information in phased wh...

Very proud of this manuscript with two talented undergraduate students, Aditya Syam and Chris Adonizio. We are continuing to push towards more scalable statistical genetics with Genotype Representation Graphs, and this is the start. www.biorxiv.org/content/10.1...

25.08.2025 14:26 โ€” ๐Ÿ‘ 11    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
IGD: A simple, efficient genotype data format Motivation While there are a variety of file formats for storing reference-sequence-aligned genotype data, many are complex or inefficient. Programming language support for such formats is often limit...

Our work (by Drew DeHaas) on an extremely simple yet efficient binary genotype format - designed to facilitate scalable bioinformatics tool development. www.biorxiv.org/content/10.1...

12.02.2025 01:03 โ€” ๐Ÿ‘ 10    ๐Ÿ” 10    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Biologically inspired graphs to explore massive genetic datasets - Nature Computational Science A recent study proposes a data structure that addresses crucial challenges related to storage and computation of large genome databases.

๐Ÿ“ขIn a recent News & Views, @ryanlayer.bsky.social discusses a data structure introduced by @aprilwei.bsky.social and colleagues for reducing storage and computational costs for phased whole-genome polymorphisms. www.nature.com/articles/s43...

๐Ÿ”“https://rdcu.be/d8ay3

31.01.2025 13:56 โ€” ๐Ÿ‘ 6    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
Enabling efficient analysis of biobank-scale data with genotype representation graphs Nature Computational Science - The genotype representation graph (GRG) is a compact data structure that encodes 200,000 human genomes in just 5โ€“26โ€‰gigabytes per chromosome. Computation...

Link to pdf. www.nature.com/articles/s43...

05.12.2024 17:10 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Wei Lab Web site created using create-react-app

My lab (aprilweilab.github.io) continues to develop GRG and ARG related methods & more. We are looking for a postdoc to join us.

05.12.2024 17:09 โ€” ๐Ÿ‘ 4    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Our work w/ two co-first authors Drew DeHaas and Ziqing Pan is now published. GRG allows large amounts of WGS polymorphism data to be analyzed in RAM via graph traversal & algebra operations & has some intrinsic connection w/ popgen data generating process & is different from ARG

05.12.2024 17:09 โ€” ๐Ÿ‘ 21    ๐Ÿ” 11    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thanks, Alison. (that's me logging in 6mo later๐Ÿ˜‚

19.11.2024 22:22 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Genotype Representation Graphs: Enabling Efficient Analysis of Biobank-Scale Data bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution

We introduced an ARG-inspired data structure, Genotype Representation Graph (GRG), to enable lossless data compression and efficient computation through graph traversal. Developed a fast inference method. Cost ~80 GBP to convert 350TB VCF (200,000 UKBiobank WGS) into 160 GB GRG.
t.co/0badfCYz47

29.04.2024 18:20 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@aprilwei is following 20 prominent accounts