Marc Garcia's Avatar

Marc Garcia

@datapythonista.bsky.social

Founder at Quantitative Mining working on decision tree based models and Rust. Open source and Python: pandas core developer, PSF fellow, former EuroSciPy co-organizer, NumFOCUS community award. https://datapythonista.me

60 Followers  |  84 Following  |  3 Posts  |  Joined: 20.11.2024  |  1.7114

Latest posts by datapythonista.bsky.social on Bluesky

Preview
Release Python Polars 1.31.0 Β· pola-rs/polars πŸ’₯ Breaking changes Remove old streaming engine (#23103) ⚠️ Deprecations Deprecate allow_missing_columns in scan_parquet in favor of missing_columns (#22784) πŸš€ Performance improvements Improve ...

Polars 1.31 is released. With as highlight DataType expressions✨. These allow you to get the datatype of an expression dynamically.

See the full changelog here.

github.com/pola-rs/pola...

Want something more with DataType expressions, here is the RFC:

github.com/pola-rs/polars

18.06.2025 14:36 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
pandas - Python Data Analysis Library

We just reorganized the pandas ecosystem page. Is it clearer? Anything missing or not useful? Feedback welcome

pandas.pydata.org/community/ec...

16.06.2025 21:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

pandas 2.3 has been released, the last version before pandas 3.0.

05.06.2025 07:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This seems nicer than Twitter and Mastodon so far. Happy to reconnect with many people I didn't see for a while.

28.11.2024 14:55 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

pixi for science
uv for everything else

27.11.2024 12:36 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 3    πŸ“Œ 0
A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner.

A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner.

Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve.

Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve.

Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem.

Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem.

Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.

Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.

I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.

Here is the recording of the presentation:

www.youtube.com/watch?v=-gYn...

27.11.2024 14:17 β€” πŸ‘ 50    πŸ” 19    πŸ’¬ 1    πŸ“Œ 1
Vegafusion demo

Vegafusion demo

✨ New Vegafusion release (2.0)!

πŸ¦€ Modernized Rust core

➑️ Arrow PyCapsule Interface

πŸŒŠπŸ¦„ Narwhals for agnostic Dataframe support

πŸ‘‰ Full release notes vegafusion.io/posts/2024/2...

25.11.2024 21:21 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Version 1.6 Legend for changelogs something big that you couldn’t do before., something that you couldn’t do before., an existing feature now may not require as much computation or memory., a miscellaneous min...

Please help us test the first release candidate for scikit-learn 1.6: pip install scikit-learn==1.6.0rc1

Changelog: scikit-learn.org/1.6/whats_ne...

In particular, if you maintain a project with a dependency on
scikit-learn, please let us know about any regression.

22.11.2024 14:49 β€” πŸ‘ 38    πŸ” 18    πŸ’¬ 2    πŸ“Œ 2

@datapythonista is following 20 prominent accounts