Gregor Sturm's Avatar

Gregor Sturm

@grst.bsky.social

Single Cell/Spatial. Cancer Immunology. Outdoor activities. Core developer @scverse.bsky.social. Working in Clinical Bioinformatics at Boehringer Ingelheim. Formerly PhD student at Medical University of Innsbruck. My private account. github.com/grst

933 Followers  |  358 Following  |  29 Posts  |  Joined: 21.11.2023  |  2.1075

Latest posts by grst.bsky.social on Bluesky

Register now for the best conference of the year!

05.08.2025 05:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
scverse conference 2025 Follow us on our channels to learn more details in the coming weeks

๐Ÿ“ฃ Mark your calendars! The 2025 edition of the scverse conference will take place on 17-19 November at Stanford University (US) scverse.org/conference20...

Call for abstracts and registrations coming soon!

12.05.2025 22:47 โ€” ๐Ÿ‘ 11    ๐Ÿ” 9    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Release v0.5.0 ยท scverse/cookiecutter-scverse New template sync We re-implemented template sync from scratch instead on relying on cruft. This allows us to create real merge conflicts that show up as such on GitHub instead of .rej files. Gene...

Just released a new version of the @scverse.bsky.social cookiecutter template: github.com/scverse/cook...

Some highlights:
๐Ÿ”ƒ improved template sync (merge conflicts now show up as such)
๐Ÿš€ use hatch as project manager
๐Ÿ”ง lots of fixes and documentation updates

02.04.2025 18:37 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Rogue Scholar

rogue-scholar.org

14.03.2025 10:01 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Nice post!
How did you generate the doi-link for a blog post?

14.03.2025 09:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
LEMUR simplified | const-ae A simplified implementation of the LEMUR algorithm.

Blog post by @const-ae.bsky.social with a simple explanation of the manifold regression algorithm & code that underlies our paper โ€œAnalysis of multi-condition single-cell data with latent embedding multivariate regressionโ€ (doi.org/10.1002/eji....).

const-ae.name/post/2025-01...

04.03.2025 18:27 โ€” ๐Ÿ‘ 26    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Working with >1M cells Scirpy scales to millions of cells on a single workstation. This page is a work-in-progess collection with advice how to work with large datasets. Distance metrics: Computing pairwise sequence dist...

Just released scirpy v0.21 -- Now with GPU Support for Hamming sequence distance and a brand new tutorial for working with scTCR datasets >1M cells: scirpy.scverse.org/en/latest/tu...
@scverse.bsky.social

25.02.2025 17:46 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Release notes Version 1.11: 1.11.0 2025-02-14: Release candidates: rc2 2025-01-24, rc1 2024-12-20. Features: rc1 sample() supports both upsampling and downsampling of observations and variables. subsample() is n...

๐ŸŽ‰ Scanpy 1.11.0 is out! ๐ŸŽ‰ just after reaching 2000 stars on GitHub!

- sc.pp.sample replaces subsample with many new features
- Sparse Dask support pca
- session-info2 package for more reproducible notebooks

See the release notes:

14.02.2025 12:08 โ€” ๐Ÿ‘ 49    ๐Ÿ” 19    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Been looking forward to this talk since @alexpeltzer.bsky.social told me about DSO in October!

09.02.2025 14:07 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - Boehringer-Ingelheim/dso: Data Science Operations (dso) command line tool Data Science Operations (dso) command line tool. Contribute to Boehringer-Ingelheim/dso development by creating an account on GitHub.

I'd like to share DSO, a command line helper to build reproducible data science projects with ease.

It is an opinionated way to organize data science projects, built around data version control (DVC).

github.com/Boehringer-I...

05.02.2025 18:32 โ€” ๐Ÿ‘ 11    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We try to avoid that by using this with preprocessed data only. All the heavy lifting is done with nextflow pipelines before. Datasets up to tens of GBs have worked well so far.

05.02.2025 18:40 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Finally, many thanks to my colleagues @alexpeltzer.bsky.social, Daniel Schreyer and Tom Schwarzl for testing, adopting, and contributing to DSO.

05.02.2025 18:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Bytesize: data science operations (DSO) Gregor Sturm, Boehringer Ingelheim

If you want to learn more, I'll be presenting this at a @nf-co.re bytesize talk: nf-co.re/events/2025/...

05.02.2025 18:32 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

We built this at @boehringerglobal.bsky.social to meet the quality standards required for biomarker analysis in clinical trials.

But I think this is useful for any kind of data analysis project.

05.02.2025 18:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
An exemplary PCA plot with a "preliminary" watermark.

An exemplary PCA plot with a "preliminary" watermark.

One of my favorite features: automated watermarking of all plots in a quarto report. Nobody gonna publish my plots anymore before I think they are ready.

05.02.2025 18:32 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

It brings together the best tools:
- git, for code versioning
- dvc, for data versioning and tracking inputs and outputs
- jinja2, for templates
- uv, for Python dep mgmt
- quarto, for authoring reports
- hiyapyco, for hierarchical YAML config
- pre-commit, for linting

05.02.2025 18:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
GitHub - Boehringer-Ingelheim/dso: Data Science Operations (dso) command line tool Data Science Operations (dso) command line tool. Contribute to Boehringer-Ingelheim/dso development by creating an account on GitHub.

I'd like to share DSO, a command line helper to build reproducible data science projects with ease.

It is an opinionated way to organize data science projects, built around data version control (DVC).

github.com/Boehringer-I...

05.02.2025 18:32 โ€” ๐Ÿ‘ 11    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

We (Chen Zhan!) just launched #sccomp for #Python!

Testing for differences in cell-type proportion in #singlecell #spatial data?

#sccomp is a mixed-effect Bayesian model
- Use sum-constrained BetaBinomial distribution
- Outliers detect.
- Remove unwanted effects

github.com/MangiolaLabo...

22.01.2025 02:57 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
GitHub - icbi-lab/luca: Single-cell Lung Cancer Atlas with 1.2M cells Single-cell Lung Cancer Atlas with 1.2M cells. Contribute to icbi-lab/luca development by creating an account on GitHub.

(2) Finding the mistake, tracing it back to its origin, and fixing it was only possible because the data and scripts for building the atlas are publicly available and fully reproducible. github.com/icbi-lab/luca

19.01.2025 10:54 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

(1) Maintaining a data resource is very much like maintaining software. It is never "done" but constantly improving.

19.01.2025 10:54 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Cellxgene Data Portal Find, download, and visually explore curated and standardized single cell datasets.

Two years after publication of our single-cell lung cancer atlas, a user found a mistake in the annotation of the EGFR-status of some patients. We fixed the issue and the atlas is now updated on cell-x-gene: cellxgene.cziscience.com/collections/...

What are the takeaways from that? (1/3)

19.01.2025 10:54 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Preview
Scverse x Owkin Hackathon in Paris We're pleased to announce the next Scverse Hackathon will take place in the Owkin offices in Paris from 17/03/2025 9am to 19/03/2025 1:30pm. This hackathon is a joint initiative between the scverse c...

I am Stoked about our upcoming @scverse.bsky.social
and @owkin.bsky.social hackathon, focused on spatial omics data analysis.
๐Ÿ“… March 17-19, 2025
๐Ÿ“ Owkin office, Paris

Apply now: docs.google.com/forms/d/e/1F...

17.01.2025 10:34 โ€” ๐Ÿ‘ 10    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

protein sequencing ๐Ÿ‘€

14.01.2025 19:53 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Overview of the LEMUR steps: (1) subspace alignment, (2) differential expression, (3) DE neighborhoods, (4) pseudobulking.

Overview of the LEMUR steps: (1) subspace alignment, (2) differential expression, (3) DE neighborhoods, (4) pseudobulking.

After 4y in the making, I am super excited that my main PhD project is published ๐ŸŽ‰๐Ÿฅณ๐ŸŽ‰๐ŸŽ‰๐Ÿฅณ

www.nature.com/articles/s41...

LEMUR is a tool to analyze multi-condition single-cell data and model differential expression as a continuous function of the cell-state space.

Some highlightsโฌ‡๏ธ

03.01.2025 19:25 โ€” ๐Ÿ‘ 155    ๐Ÿ” 35    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 3

The big issue here in Germany is that we pay ~20 ct/kWh in fixed network fees and tax. Really limits how much you can save.

02.01.2025 17:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

While definitely interesting, dynamic tariffs have a much higher cost-saving potential for devices that consume a lot of energy and are easy to regulate automatically. Such as a heat pump or electric car - of which we have neither, for now.

02.01.2025 13:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

There's a certain price risk, though. In the energy crisis 2021-2022 prices increased significantly. However prices in 2023 were down to normal, while many fixed price tariffs increased their rates.

02.01.2025 13:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Monthly comparison of fixed price tariff (AรœW) with dynamic tariff (tado).

Monthly comparison of fixed price tariff (AรœW) with dynamic tariff (tado).

Dynamic electricity tariffs are an incentive to use energy when it's abundant and emits little CO2.

But are they also cheaper?

โœ… For 2024, we would have saved 10-13% compared to our current fixed price tariff. Without any optimization.

Full post (in German): grst.github.io/dynamischer-...

02.01.2025 13:35 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Modern tar detects the compression automatically when reading from a file. So `tar xvf` covers most of the cases already.

30.12.2024 15:02 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Karin Hrovatin Data science blog on topics that donโ€™t get published.

To bring to light data science topics that usually donโ€™t make it into publications I started a blog on this topic: hrovatin.github.io By interviewing different researchers, I plan to find out what is going on in the community.

16.12.2024 06:59 โ€” ๐Ÿ‘ 10    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@grst is following 20 prominent accounts