Register now for the best conference of the year!
05.08.2025 05:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0@grst.bsky.social
Single Cell/Spatial. Cancer Immunology. Outdoor activities. Core developer @scverse.bsky.social. Working in Clinical Bioinformatics at Boehringer Ingelheim. Formerly PhD student at Medical University of Innsbruck. My private account. github.com/grst
Register now for the best conference of the year!
05.08.2025 05:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0๐ฃ Mark your calendars! The 2025 edition of the scverse conference will take place on 17-19 November at Stanford University (US) scverse.org/conference20...
Call for abstracts and registrations coming soon!
Just released a new version of the @scverse.bsky.social cookiecutter template: github.com/scverse/cook...
Some highlights:
๐ improved template sync (merge conflicts now show up as such)
๐ use hatch as project manager
๐ง lots of fixes and documentation updates
Nice post!
How did you generate the doi-link for a blog post?
Blog post by @const-ae.bsky.social with a simple explanation of the manifold regression algorithm & code that underlies our paper โAnalysis of multi-condition single-cell data with latent embedding multivariate regressionโ (doi.org/10.1002/eji....).
const-ae.name/post/2025-01...
Just released scirpy v0.21 -- Now with GPU Support for Hamming sequence distance and a brand new tutorial for working with scTCR datasets >1M cells: scirpy.scverse.org/en/latest/tu...
@scverse.bsky.social
๐ Scanpy 1.11.0 is out! ๐ just after reaching 2000 stars on GitHub!
- sc.pp.sample replaces subsample with many new features
- Sparse Dask support pca
- session-info2 package for more reproducible notebooks
See the release notes:
Been looking forward to this talk since @alexpeltzer.bsky.social told me about DSO in October!
09.02.2025 14:07 โ ๐ 4 ๐ 1 ๐ฌ 0 ๐ 0I'd like to share DSO, a command line helper to build reproducible data science projects with ease.
It is an opinionated way to organize data science projects, built around data version control (DVC).
github.com/Boehringer-I...
We try to avoid that by using this with preprocessed data only. All the heavy lifting is done with nextflow pipelines before. Datasets up to tens of GBs have worked well so far.
05.02.2025 18:40 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Finally, many thanks to my colleagues @alexpeltzer.bsky.social, Daniel Schreyer and Tom Schwarzl for testing, adopting, and contributing to DSO.
05.02.2025 18:32 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0If you want to learn more, I'll be presenting this at a @nf-co.re bytesize talk: nf-co.re/events/2025/...
05.02.2025 18:32 โ ๐ 5 ๐ 2 ๐ฌ 1 ๐ 1We built this at @boehringerglobal.bsky.social to meet the quality standards required for biomarker analysis in clinical trials.
But I think this is useful for any kind of data analysis project.
An exemplary PCA plot with a "preliminary" watermark.
One of my favorite features: automated watermarking of all plots in a quarto report. Nobody gonna publish my plots anymore before I think they are ready.
05.02.2025 18:32 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0It brings together the best tools:
- git, for code versioning
- dvc, for data versioning and tracking inputs and outputs
- jinja2, for templates
- uv, for Python dep mgmt
- quarto, for authoring reports
- hiyapyco, for hierarchical YAML config
- pre-commit, for linting
I'd like to share DSO, a command line helper to build reproducible data science projects with ease.
It is an opinionated way to organize data science projects, built around data version control (DVC).
github.com/Boehringer-I...
We (Chen Zhan!) just launched #sccomp for #Python!
Testing for differences in cell-type proportion in #singlecell #spatial data?
#sccomp is a mixed-effect Bayesian model
- Use sum-constrained BetaBinomial distribution
- Outliers detect.
- Remove unwanted effects
github.com/MangiolaLabo...
(2) Finding the mistake, tracing it back to its origin, and fixing it was only possible because the data and scripts for building the atlas are publicly available and fully reproducible. github.com/icbi-lab/luca
19.01.2025 10:54 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0(1) Maintaining a data resource is very much like maintaining software. It is never "done" but constantly improving.
19.01.2025 10:54 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Two years after publication of our single-cell lung cancer atlas, a user found a mistake in the annotation of the EGFR-status of some patients. We fixed the issue and the atlas is now updated on cell-x-gene: cellxgene.cziscience.com/collections/...
What are the takeaways from that? (1/3)
I am Stoked about our upcoming @scverse.bsky.social
and @owkin.bsky.social hackathon, focused on spatial omics data analysis.
๐
March 17-19, 2025
๐ Owkin office, Paris
Apply now: docs.google.com/forms/d/e/1F...
protein sequencing ๐
14.01.2025 19:53 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Overview of the LEMUR steps: (1) subspace alignment, (2) differential expression, (3) DE neighborhoods, (4) pseudobulking.
After 4y in the making, I am super excited that my main PhD project is published ๐๐ฅณ๐๐๐ฅณ
www.nature.com/articles/s41...
LEMUR is a tool to analyze multi-condition single-cell data and model differential expression as a continuous function of the cell-state space.
Some highlightsโฌ๏ธ
The big issue here in Germany is that we pay ~20 ct/kWh in fixed network fees and tax. Really limits how much you can save.
02.01.2025 17:47 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0While definitely interesting, dynamic tariffs have a much higher cost-saving potential for devices that consume a lot of energy and are easy to regulate automatically. Such as a heat pump or electric car - of which we have neither, for now.
02.01.2025 13:35 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0There's a certain price risk, though. In the energy crisis 2021-2022 prices increased significantly. However prices in 2023 were down to normal, while many fixed price tariffs increased their rates.
02.01.2025 13:35 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Monthly comparison of fixed price tariff (AรW) with dynamic tariff (tado).
Dynamic electricity tariffs are an incentive to use energy when it's abundant and emits little CO2.
But are they also cheaper?
โ
For 2024, we would have saved 10-13% compared to our current fixed price tariff. Without any optimization.
Full post (in German): grst.github.io/dynamischer-...
Modern tar detects the compression automatically when reading from a file. So `tar xvf` covers most of the cases already.
30.12.2024 15:02 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0To bring to light data science topics that usually donโt make it into publications I started a blog on this topic: hrovatin.github.io By interviewing different researchers, I plan to find out what is going on in the community.
16.12.2024 06:59 โ ๐ 10 ๐ 6 ๐ฌ 1 ๐ 0