Hi! I’m thinking increasingly often during your videos “She would *love* reading ‘the dawn of everything’!” (By David Graeber and David Wengrow). It touches on so many topics from your videos and will blow your mind about history!
06.06.2025 18:56 — 👍 1 🔁 0 💬 0 📌 0
Release notes
Version 1.11: 1.11.0 2025-02-14: Release candidates: rc2 2025-01-24, rc1 2024-12-20. Features: rc1 sample() supports both upsampling and downsampling of observations and variables. subsample() is n...
🎉 Scanpy 1.11.0 is out! 🎉 just after reaching 2000 stars on GitHub!
- sc.pp.sample replaces subsample with many new features
- Sparse Dask support pca
- session-info2 package for more reproducible notebooks
See the release notes:
14.02.2025 12:08 — 👍 49 🔁 19 💬 1 📌 1
Agreed on both counts. Also with brotli and zstd there are two recent, established general-purpose compression algorithms to investigate!
12.12.2024 13:16 — 👍 1 🔁 0 💬 0 📌 0
Definitely! Above a certain scale, only out-of-core is still possible. That’s why we started with AnnData’s “backed” mode in 2018 and switched to the established Dask library recently.
12.12.2024 13:07 — 👍 2 🔁 0 💬 0 📌 0
I’m unsure if eking out max compression is worth it. Using half the space isn’t that impactful, especially when we‘re mostly talking about disk space, since Dask will load things chunkwise anyway (and needs to decompress to do computation).
12.12.2024 13:01 — 👍 0 🔁 0 💬 1 📌 0