Yusuf Roohani's Avatar

Yusuf Roohani

@yusufroohani.bsky.social

Machine Learning & Systems Biology. ML Group Leader @arcinstitute. PhD @StanfordAILab http://www.yusufroohani.com

177 Followers  |  124 Following  |  8 Posts  |  Joined: 26.11.2024  |  1.8534

Latest posts by yusufroohani.bsky.social on Bluesky

Post image

We're hiring! Come join the team and scale new heights with us! πŸ”οΈ

arcinstitute.org/jobs

25.02.2025 14:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Arc Virtual Cell Atlas launches, combining data from over 300 million cells | Arc Institute Arc Institute today launched the Arc Virtual Cell Atlas, a growing resource for computation-ready single-cell measurements, starting with data from over 300 million cells. The initial release of the A...

scBaseCamp is released as part of the Arc Virtual Cell Atlas!

Great work by Nick Youngblut, Chris Carpenter, Alex Dobin, Dave Burke, @genophoria.bsky.social and team

πŸ“’Announcement: arcinstitute.org/news/news/ar...

πŸ”—Data access: github.com/ArcInstitute...

πŸ“„Report: arcinstitute.org/manuscripts/...

25.02.2025 14:35 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Uniform processing lowers technical variation between scBaseCamp datasets.

Technical factors such as library chemistry and suspension type (single-cell vs single-nucleus) exhibited comparable or lower silhouette scores than biologically meaningful categories like tissue type

25.02.2025 14:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

scBaseCamp is the first large biological data repository curated by an AI agent

We built a hierarchical agentic workflow (SRAgent) to automate discovery, metadata extraction & data processing

It is consistent, easily scalable and automatically updates when new data is available

25.02.2025 14:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

scBaseCamp was built by directly mining all publicly accessible 10X Genomics scRNAseq data from the Sequence Read Archive (SRA)

With over 230M cells drawn from 21 species and 72 tissues, scBaseCamp is significantly larger and more diverse than existing single-cell data repositories

25.02.2025 14:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

At the @arcinstitute.org we are building AI models of cell state from the ground up, rethinking every step, from data generation to biologically relevant evaluation

Today we launch scBaseCamp, the largest public repository of single cell RNAseq data, uniformly processed from raw sequencing reads.

25.02.2025 14:35 β€” πŸ‘ 16    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

Why do you want to switch

04.12.2024 21:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

very easy to do this in Pycharm

04.12.2024 20:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@yusufroohani is following 20 prominent accounts