Git 2.53 provides faster insights into the repository structure with 'git repo structure'. However, to better understand this, it is helpful to be more familiar with the Git data and storage models: www.python4data.science/en/latest/pr...
#Git
@python4data.science.bsky.social
Teaching materials for the cusy training courses on a Python-based data science workflow: https://cusy.io/en/seminars
Git 2.53 provides faster insights into the repository structure with 'git repo structure'. However, to better understand this, it is helpful to be more familiar with the Git data and storage models: www.python4data.science/en/latest/pr...
#Git
The section on performance measurements and finding bottlenecks has been significantly expanded to include cProfile/profiling.tracing, tprof, and profiling.sampling/Tachyon: www.python4data.science/en/latest/pe...
#Python #Performance
We have updated the documentation section with references to README, CONTRIBUTING, CHANGELOG, etc.
python-basics-tutorial.readthedocs.io/en/latest/do...
#Python #Documentation
We have updated the FastAPI extensions. It is very surprising to us that millions of extensions are being downloaded that have not been updated for over a year.
www.python4data.science/en/latest/da...
@fastapi.tiangolo.com
#Python #FastAPI #REST
I took a look at the changes coming with Python 3.15 β and I can't wait to put them to productive use. I've already updated our tutorials:
β’ Performance measurements: www.python4data.science/en/latest/pe...
β’ Tachyon: www.python4data.science/en/latest/pe...
#Python
We have updated the section on pytest with many exciting use cases
* on command line options
* on generating markers
* and on parameterising exceptions
python-basics-tutorial.readthedocs.io/en/latest/te...
#Python #Testing #pytest
Precision-Recall-Curve comparison between workspace and HEAD
Receiver operating characteristic (ROC) comparison between workspace and HEAD
Confusion Matrix comparison between workspace and HEAD
We have updated our tutorial to data management with DVC. It also allows you to create lightweight data science and data modelling workflows and execute them in a parameterised manner: www.python4data.science/en/latest/pr...
#Data #Versioncontrol #Git #DataScience #Modeling #Python
Now we have also described how to use uv reliably for Cursor: www.python4data.science/en/latest/pr...
#CursorAI #Python #Packaging #uv
We have now described how to create a configuration for Claude Code so that it uses uv reliably: python4data.science/en/latest/pr...
#ClaudeCode #Python #Packaging #uv
Since we have recently been asked frequently whether pandas is slow and whether we should use Polars, Dask or DuckDB instead, we have now provided an initial overview of the various technologies: www.python4data.science/en/latest/wo...
#Python #Performance #DuckDB
We have now completely switched to pathlib: python-basics-tutorial.readthedocs.io/en/latest/sa...
#Python
We have finally documented Ruff β the tool greatly simplifies static code analysis for Python projects: www.python4data.science/en/latest/pr...
#Python #Ruff
We have now updated our packaging tutorial to include PEP 639, which enables SPDX-compliant licensing: python-basics-tutorial.readthedocs.io/en/latest/pa...
#Python #Packaging #SPDX #Licensing
We have added a section on additional JSON tools: www.python4data.science/en/latest/da...
#Python #JSON
We have added several geopython libraries: www.python4data.science/en/latest/da...
#Python #Geospatial #GeoPython
We have significantly expanded the section on licences for AI systems: www.python4data.science/en/latest/pr...
#AI #Licensing #OpenData #OpenSource
π₯Spack v1.0 is out!π₯
This is a huge milestone. We reworked the core to add compiler dependencies, and we're introducing a stable package API.
π1.0 also adds concurrent builds, better includes, and much more -- read it all in the release notes!
github.com/spack/spack/...
XKCD #3117: Replication Crisis
The XKCD comic on reproducible scientific results fits perfectly with our tutorial π§ π
www.python4data.science/en/latest/pr...
Graph from GitHubβs Octoverse 2024 report showing a spike in utilization of Jupyter Notebooks across GitHub. This is calculated by looking at the distinct number of public repositories with at least one Jupyter Notebook by the year the repository was created. Since 2016, we have seen this number surge from near zero to more than 1.5 million repositories using Jupyter Notebooks.
Almost more significant than the success of #Python is the growth of #Jupyter #Notebooks: βData scientists and machine learning researchers commonly use the #OpenSource application for #MachineLearning, #DataViz, and more.β
jupyter-tutorial.readthedocs.io/en/latest/in...
We have added a section on protomaps to our PyViz tutorial. Protomaps makes map visualisations so much easier.
pyviz-tutorial.readthedocs.io/en/latest/pr...
#Protomaps #Geography #World #Map @protomaps.com
We have expanded the section on geodata to include the most common (tile) file formats: www.python4data.science/en/latest/da...
#Geography #GIS
And a new section on AI/ML licences has also been added: www.python4data.science/en/latest/pr...
#AI #ML #License
We have updated our Python Basics tutorial to describe the guidelines for docstrings in more detail:
python-basics-tutorial.readthedocs.io/en/latest/do...
#Python #Documentation #DX
We have expanded the section on open source hardware licences to include the TAPR and Solderpad Hardware Licence: www.python4data.science/en/latest/pr...
#OpenSource #Hardware #Licence
thoughtworks Technology Radar has now also adopted the tools uv and Renovate: www.thoughtworks.com/radar
02.04.2025 15:21 β π 4 π 0 π¬ 0 π 0We have expanded our section on GitLab CI/CD pipelines with examples of
β’ GitLab Pages
β’ npm deployments with rsync
β’ building Docker containers
β’ multi-arch images with Buildah
β’ migrating GitHub Actions
www.python4data.science/en/latest/pr...
#GitLab #CICD #DevOps #DX
We have written down our experiences of how LLMs help us with programming: cusy.io/en/blog/how-...
#LLM #AI #programming #DX #Python
Which Python dashboard library for which purpose?
We were left with only two candidates: VoilΓ and Panel: jupyter-tutorial.readthedocs.io/en/latest/da...
#DataViz #Python
π 4000 Pythonistas and data scientists now follow us on Bluesky π€ We are very pleased about the great interest in our offer.
#Python #DataScience
Our course for the versioned and reproducible storage of code and data in data science workflows is now also referenced in the official Git documentation: git-scm.com/doc/ext
#Git #DataScience #DX