Shantanu Singh's Avatar

Shantanu Singh

@shantanu-singh.cc.bsky.social

computation biology, drug discovery, computer vision, microscopy, statistics, machine learning, all happening at https://carpenter-singh-lab.broadinstitute.org/

3,734 Followers  |  886 Following  |  59 Posts  |  Joined: 01.10.2023  |  1.8394

Latest posts by shantanu-singh.cc on Bluesky

Bluesky

Big shout out to the whole team: Alรกn F. Muรฑoz, Tim Treis, @shatavishadg.bsky.social , @fabiantheis.bsky.social ntheis.bsky.social, @drannecarpenter.bsky.social, @shantanu-singh.cc

6/6

08.07.2025 19:22 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ”ฌAPI-first feature extraction for image-based profiling workflows

If you need to obtain interpretable features from your segmented microscopy images, but want to do it in a fully automated way, we know the struggle.

1/6

08.07.2025 19:22 โ€” ๐Ÿ‘ 46    ๐Ÿ” 20    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐ŸŒŸ ๐—ฅ๐—ฒ๐—บ๐—ถ๐—ป๐—ฑ๐—ฒ๐—ฟ ๐ŸŒŸ
The deadline for abstract submissions for oral presentations at Cytodata 2025 in Berlin is approaching!

๐Ÿ‘‰ ๐—ฆ๐˜‚๐—ฏ๐—บ๐—ถ๐˜ ๐˜†๐—ผ๐˜‚๐—ฟ ๐—ฎ๐—ฏ๐˜€๐˜๐—ฟ๐—ฎ๐—ฐ๐˜ ๐—ฏ๐˜† ๐—๐˜‚๐—ป๐—ฒ 25!
cytodata25.eu-openscreen.eu/registration/

#BerlinConference #Imageanalysis #Microscopy

20.06.2025 10:35 โ€” ๐Ÿ‘ 7    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Delighted to see this out in print! It captures everything several of us in the field have been thinking about on the topic of measuring signal in high-dimensional profiling data, and I couldn't think of a better torchbearer and storyteller than @alxndrkalinin.bsky.social to champion this work.

11.06.2025 12:59 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
A versatile information retrieval framework for evaluating profile strength and similarity - Nature Communications Profiling assays measure thousands of features to uncover biological insights but lack reliable methods for quality evaluation. Here, the authors develop a versatile information retrieval framework to...

๐Ÿšจ New paper alert! We developed a versatile information retrieval framework that uses mean average precision (mAP) to robustly quantify sample activity and similarity in large-scale profiling data. Now out โ€ช@natcomms.nature.com: doi.org/10.1038/s414...

More in the ๐Ÿงต below:
1/7

10.06.2025 19:53 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2
Post image

Aleatoric and epistemic uncertainty are clear-cut concepts, right? ... right? ๐Ÿ˜ตโ€๐Ÿ’ซ In our new ICLR blogpost we let different schools of thought speak and contradict each other, and revisit chatbots where โ€œthe character of aleatory โ€˜transformsโ€™ into epistemicโ€ iclr-blogposts.github.io/2025/blog/re...

08.05.2025 08:18 โ€” ๐Ÿ‘ 31    ๐Ÿ” 9    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

๐Ÿš€๐Ÿ”ฌ๐Ÿฆ  Releasing ๐Ÿค–Cellpose-SAM๐Ÿค–, a cellular segmentation algorithm with superhuman generalization ๐Ÿฆธโ€โ™€๏ธ. Try it now on ๐Ÿค— huggingface.co/spaces/mouse...

paper: www.biorxiv.org/content/10.1...
w/ @computingnature.bsky.social 1/n

03.05.2025 19:12 โ€” ๐Ÿ‘ 155    ๐Ÿ” 50    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 7

"Ich askid ChatGPT," Well Ich askid the stones, and the forest, and the rayne, and the wynde, and what thei seyde was learninge, and dreames, and growinge thinges, and a worlde wher we talke to each othir.

07.05.2025 03:45 โ€” ๐Ÿ‘ 1269    ๐Ÿ” 362    ๐Ÿ’ฌ 11    ๐Ÿ“Œ 4
Model to Meaning: How to interpret statistical models with marginaleffects for R and Python

Model to Meaning: How to interpret statistical models with marginaleffects for R and Python

๐Ÿ“š๐Ÿ˜…๐ŸŽ‰

Yay!! I just submitted the complete manuscript of my upcoming book to the publisher!

Learn to easily and clearly interpret (almost) any stats model w/ R or Python. Simple ideas, consistent workflow, powerful tools, detailed case studies.

Read it for free @ marginaleffects.com

#RStats #PyData

10.04.2025 19:06 โ€” ๐Ÿ‘ 589    ๐Ÿ” 147    ๐Ÿ’ฌ 19    ๐Ÿ“Œ 9
Post image

๐ŸšจLatin American Workshop Series๐Ÿšจ

The team from @bethcimini.bsky.socialโ€˜s lab at @broadinstitute.org and the Center for Open Bioimage Analysis will host three free online image analysis workshops for LATAM ... in SPANISH๐Ÿ‡ช๐Ÿ‡ธ and PORTUGUESE๐Ÿ‡ง๐Ÿ‡ท!!!๐ŸŽ‰๐ŸŽŠ

broad.io/latam_worksh...

Deadline: April 23rd, 2025

10.04.2025 13:36 โ€” ๐Ÿ‘ 20    ๐Ÿ” 17    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 3
Preview
Ask Erin, Dear Beth On Ask Erin/Dear Beth, bioimage analysis experts Beth Cimini and Erin Weisbart, of the Imaging Platform at the Broad Institute of MIT and Harvard, answer your image analysis questions! Whether itโ€™s ab...

Delighted to announce that @erinweisbart.bsky.social and I have teamed up to create a new bioimage analysis video podcast called Ask Erin/Dear Beth - you can check it out at the link below! It will highlight common challenges in #bioimageanalysis, as well as our favorite solutions to them. (1/x)

07.04.2025 22:11 โ€” ๐Ÿ‘ 55    ๐Ÿ” 29    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
Software engineering intern - summer/fall 2025 - Cambridge, MA USA If you are a current student (undergraduate/masters/PhD) with permission to work in the US and ability to work in-person in Cambridge, MA, US, consider a summer internship in the Broad Institute Imagi...

We're once again hiring a summer (+?) #bioimage #bioimageanalysis #software intern! Due to requirements of the funding program, you must be a current student, as well as work onsite in MA (+ have US work permission). Details at the link below. Spend your summer making great tools with fun people!

07.03.2025 14:23 โ€” ๐Ÿ‘ 22    ๐Ÿ” 18    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 3
A research lab from Northwestern, chosen because it's generic and sort of zoomed out.  Three scientists are visible in lab coats, and there are benches and shelving, with glass along one wall showing another high-rise building nearby. Overhead fluorescents provide light.

A research lab from Northwestern, chosen because it's generic and sort of zoomed out. Three scientists are visible in lab coats, and there are benches and shelving, with glass along one wall showing another high-rise building nearby. Overhead fluorescents provide light.

This is a room where we turn very modest salaries and budgets (and lots of coffee) into new knowledge, life-saving innovations, and technology that feeds business growth.

It's literally the loom that spins hay into gold but these numpties are suddenly worried about the cost of hay.

10.02.2025 23:59 โ€” ๐Ÿ‘ 11396    ๐Ÿ” 1685    ๐Ÿ’ฌ 236    ๐Ÿ“Œ 43
Preview
A genome-wide atlas of human cell morphology - Nature Methods An optical pooled cell profiling platform (PERISCOPE) based on Cell Painting and optical sequencing of molecular barcodes was used to develop the first unbiased genome-wide morphology-based perturbati...

Our paper โ€œA genome-wide atlas of human cell morphologyโ€ is finally out today in @naturemethods.bsky.social ! www.nature.com/articles/s41...

(I tweeted about our preprint in 2023 over at the bad place, but deactivated my account, so here we go again!)

27.01.2025 18:23 โ€” ๐Ÿ‘ 138    ๐Ÿ” 40    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

yep, but that would have been too easy ;)

14.01.2025 00:26 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

and I regenerated my ssh keys 5 times over, began doubting everything lol

14.01.2025 00:16 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿงช Summer internship alert, Feb 7 deadline

New URL: hsph.harvard.edu/fellowship-s...

(+ @harvardchanschool.bsky.social is now on ๐Ÿฆ‹!)

08.01.2025 15:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Hey #StatsSky, what are you favorite papers to cite when you need to justify something that is obvious (I once had a reviewer ask we justify the use of logistic regression on a binary outcome)
or when you need to push-back on silly reviewer requests (e.g., asking for p-values in table 1)?

06.01.2025 08:15 โ€” ๐Ÿ‘ 331    ๐Ÿ” 88    ๐Ÿ’ฌ 75    ๐Ÿ“Œ 10
Post image

I don't think it's widely clear to the #RadiologyAI community just how poorly GPT-4V compares with the top report generation models on chest X-rays, like MedVersa or MAIRA-2.

It's clear we need a way to track progress.

18.12.2024 20:48 โ€” ๐Ÿ‘ 9    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

and @heidirehm.bsky.social too ๐Ÿš€

22.12.2024 16:06 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@rajpurkar.bsky.social is here!

22.12.2024 16:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Instead of listing my publications, as the year draws to an end, I want to shine the spotlight on the commonplace assumption that productivity must always increase. Good research is disruptive and thinking time is central to high quality scholarship and necessary for disruptive research.

20.12.2024 11:18 โ€” ๐Ÿ‘ 1156    ๐Ÿ” 376    ๐Ÿ’ฌ 21    ๐Ÿ“Œ 57

Over the long term there will be progress in closing the "capability-reliability gap" for agents, but for now, I think successful applications will be ones where (1) the user is in the loop, (2) errors are relatively easy to spot and (3) aren't a deal-breaker if not spotted.

20.12.2024 21:32 โ€” ๐Ÿ‘ 12    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

CytoSummaryNet is a Deep Sets-based approach that uses self-supervised contrastive learning in a multiple-instance learning framework. Try it out!

Paper: doi.org/10.1371/jour...

Code: github.com/carpenter-si...

With @johnarevalo.bsky.social @drannecarpenter.bsky.social Mehrtash Babadi

3/n

19.12.2024 23:31 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Robert van Dijk (www.linkedin.com/in/robert-v-...) developed CytoSummaryNet โ€“ a simple strategy to learn an optimal way to aggregate single-cell features into population-level profiles, outperforming traditional averaging on tasks like mechanism-of-action prediction. 2/n

19.12.2024 23:31 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
(a) Human U2OS cells treated with dimethyl sulfoxide (DMSO) and stained using the Cell Painting assay, which employs six dyes in five channels to label eight cellular compartments. The top row (from left to right) shows mitochondrial staining; actin, Golgi, and plasma membrane staining; and nucleolar and cytoplasmic RNA staining. The bottom row (from left to right) displays endoplasmic reticulum staining, DNA staining, and a montage of all five channels (from Cimini et al. [21]). (b) Thousands of features are extracted from each segmented cell in microscopy images of wells. A learned function f(x) (CytoSummaryNet) aggregates this data into a single feature vector: the sampleโ€™s profile. (c) An in-depth look at the model architecture used in this study. The model consists of three elements: a function ฯ†(x), which maps the input data from โ„D to โ„L space, a summation, which collapses the cell dimension, and ฯ(z), which maps the collapsed representation from โ„N to โ„L space. (d) During training, replicate compound profiles are forced to attract each other (green arrows) and simultaneously repel every other compound (red arrows) in the learned feature space. Here, all forces are drawn for a single profile of compound B.

(a) Human U2OS cells treated with dimethyl sulfoxide (DMSO) and stained using the Cell Painting assay, which employs six dyes in five channels to label eight cellular compartments. The top row (from left to right) shows mitochondrial staining; actin, Golgi, and plasma membrane staining; and nucleolar and cytoplasmic RNA staining. The bottom row (from left to right) displays endoplasmic reticulum staining, DNA staining, and a montage of all five channels (from Cimini et al. [21]). (b) Thousands of features are extracted from each segmented cell in microscopy images of wells. A learned function f(x) (CytoSummaryNet) aggregates this data into a single feature vector: the sampleโ€™s profile. (c) An in-depth look at the model architecture used in this study. The model consists of three elements: a function ฯ†(x), which maps the input data from โ„D to โ„L space, a summation, which collapses the cell dimension, and ฯ(z), which maps the collapsed representation from โ„N to โ„L space. (d) During training, replicate compound profiles are forced to attract each other (green arrows) and simultaneously repel every other compound (red arrows) in the learned feature space. Here, all forces are drawn for a single profile of compound B.

Taking pictures of cells with a microscope, then extracting thousands of features from them is uncannily effective for quantifying cell state, esp. for genes and chemicals (e.g., Cell Painting). But we often average the rich single-cell data to simplify analysis. Can we do better?
#bioML ๐Ÿงช
1/n

19.12.2024 23:31 โ€” ๐Ÿ‘ 72    ๐Ÿ” 18    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

True luxury is found in the simplest moments.

18.12.2024 20:59 โ€” ๐Ÿ‘ 19832    ๐Ÿ” 4327    ๐Ÿ’ฌ 428    ๐Ÿ“Œ 287
Preview
Insurers Are Dropping Homeowners as Climate Shocks Worsen (Gift Article) Without insurance, itโ€™s impossible to get a mortgage; without a mortgage, most Americans canโ€™t buy a home.

โ€œThe climate crisis that is coming our way is not just about polar bears, and itโ€™s not just about green jobs,โ€ Mr. Whitehouse said. โ€œIt actually is coming through your mail slot, in the form of insurance cancellations, insurance nonrenewals and dramatic increases in insurance costs.โ€ Gift link:

18.12.2024 21:07 โ€” ๐Ÿ‘ 161    ๐Ÿ” 65    ๐Ÿ’ฌ 11    ๐Ÿ“Œ 3

I am thrilled to nominate @johnarevalo.bsky.social who has landed on ๐Ÿฆ‹ with this fantastic graph dataset!

13.12.2024 21:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿงช So proud of this work by the dream team of @johnarevalo.bsky.social and Ellen Su: a new graph dataset for predicting drug-target interactions, using information from Cell Painting.

Stop by their poster in a few hours @ #NeurIPS! (details below)

PS: John is on the job market ๐Ÿš€

#bioML #MLSky

13.12.2024 21:43 โ€” ๐Ÿ‘ 19    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@shantanu-singh.cc is following 20 prominent accounts