Yufeng Shen's Avatar

Yufeng Shen

@yshen.bsky.social

Computational genomics and human genetics. Associate Professor @ Columbia University Lab: http://www.columbia.edu/~ys2411/

206 Followers  |  156 Following  |  10 Posts  |  Joined: 09.09.2023  |  1.8521

Latest posts by yshen.bsky.social on Bluesky

this is a thing that I didn't know I needed 🀩

13.02.2026 15:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Our work of protein language models trained on biophysical dynamics was just published in @pnas.org. URL: doi.org/10.1073/pnas...

24.01.2026 20:01 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ“’ Our Dept. of Systems Biology at Columbia University has an open tenure-track Assistant Professor position in the broad area of quantitative biology. Come join our awesome department in NYC! Please circulate.
apply.interfolio.com/177622
Suggested deadline: 12/15/2025.
@columbiasysbio.bsky.social

15.11.2025 04:02 β€” πŸ‘ 31    πŸ” 37    πŸ’¬ 0    πŸ“Œ 1

OpenFold3-preview (OF3p) is out: a sneak peek of our AF3-based structure prediction model. Our aim for OF3 is full AF3-parity for every modality. We now believe we have a clear path towards this goal and are releasing OF3p to enable building in the OF3 ecosystem. MoreπŸ‘‡

28.10.2025 18:30 β€” πŸ‘ 125    πŸ” 42    πŸ’¬ 1    πŸ“Œ 3

Congrats!!

10.06.2025 15:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

congratulations!!

02.06.2025 20:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congratulations!!

02.06.2025 17:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Molecular dynamics simulations of intrinsically disordered protein regions enable biophysical interpretation of variant effect predictors Predictive models for missense variant pathogenicity offer little functional interpretation for intrinsically disordered regions, since they rely on conservation and coevolution across homologous sequ...

How can we better understand pathogenic variants in intrinsically disordered regions (IDRs)? How do models such as AlphaMissense and ESM1b predict pathogenicity, when these regions typically exhibit lower genomic conservation than ordered regions? Read more:
doi.org/10.1101/2025...

13.05.2025 14:15 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Why do large protein language models like ESM2-15B underperform compared to medium-sized ones like ESM2-650M in predicting mutation effects? πŸ€”

We dive into this issue in our new preprintβ€”bringing insights into model scaling on mutation effect prediction. πŸ§¬πŸ“‰

29.04.2025 17:54 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

We have updated our protein lanuage model trained on structure dynamics. Our new models show significant better zero-shot performance on mutation effects of designed and viral proteins compared to ESM2. check the new preprint here: www.biorxiv.org/content/10.1...

17.04.2025 14:40 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Congratulations!!

15.01.2025 12:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Why do association studies prioritize trait-specific variants???

A quick thread about the importance of thinking about all traits at once πŸ‘‡ 1/6 (πŸ§ͺ🧬)

17.12.2024 07:04 β€” πŸ‘ 49    πŸ” 24    πŸ’¬ 2    πŸ“Œ 4
Video thumbnail

Super excited to preprint our work on developing a Biomolecular Emulator (BioEmu): Scalable emulation of protein equilibrium ensembles with generative deep learning from @msftresearch.bsky.social ch AI for Science.

www.biorxiv.org/content/10.1...

06.12.2024 08:38 β€” πŸ‘ 441    πŸ” 147    πŸ’¬ 21    πŸ“Œ 29

Most of the talks from our Oct meeting are now online, with a few more to come: www.precisionmedicine.columbia.edu/videos

05.12.2024 15:47 β€” πŸ‘ 84    πŸ” 39    πŸ’¬ 0    πŸ“Œ 1

Right. With zeros, the point estimate of effect size is just too far off. Maybe you can use pseudo count to stabilize it, although how to do pseudo count depends on assumptions about priors

23.11.2024 19:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Small sample size is a problem, but my hunch is normal approximation would underestimate the tail on the right side of the distribution, therefore stderr based meta analysis would deflate type I

23.11.2024 16:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

congratulations!!

12.11.2024 21:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The final program for the conference is now up: events.columbia.edu/cal/event/ev...

28.09.2024 18:30 β€” πŸ‘ 10    πŸ” 12    πŸ’¬ 0    πŸ“Œ 0

Open rank faculty search in the Program for Mathematical Genomics at Columbia University:

www.nature.com/naturecareer...

05.12.2023 01:06 β€” πŸ‘ 13    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

I highly recommend it as a generic method. In our analysis, it has the most consistent score scale across genes, even for genes under-represented in ClinVar or with somewhat weak MSA. A few other methods, like REVEL, gMVP (from us), and EVE, are better for certain genes but less inconsistent.

10.10.2023 21:39 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
An Owner's Guide to the Human Genome An Owner's Guide to the Human Genome

I'm delighted to release the first half of my new textbook in human genetics:
web.stanford.edu/group/pritch...

"An Owner's Guide to the Human Genome: an introduction to human population genetics, variation and disease"

01.10.2023 22:53 β€” πŸ‘ 294    πŸ” 175    πŸ’¬ 8    πŸ“Œ 11
University of Massachusetts Chan Medical School, Program in Bioinformatics and Integrative Biology Full service online faculty recruitment and application management system for academic institutions worldwide. We offer unique solutions tailored for academic communities.

2nd Bluesky post … faculty search! Genomics and Comp Bio dept @ UMass Chan Med School academicjobsonline.org/ajo/jobs/25641 pop genomics, imaging, stat. genetics, machine learning aka cool science w big data.

Collab & supportive environment where you can innovate & make discoveries. Join us!

13.09.2023 20:50 β€” πŸ‘ 45    πŸ” 59    πŸ’¬ 2    πŸ“Œ 1

@yshen is following 19 prominent accounts