Imbalanced classification: pitfalls and solutions โ Probabilistic calibration of cost-sensitive learning
Today at #EuroScipy2025, @glemaitre58.bsky.social and I presented a tutorial on pitfalls of machine learning for imbalanced classification problems.
We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values.
probabl-ai.github.io/calibration-...
19.08.2025 11:58 โ ๐ 23 ๐ 10 ๐ฌ 1 ๐ 0
A small update on the retrospective and future priorities of the open source team at @probabl.bsky.social for the next 6 months or so.
06.12.2024 17:14 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Sometimes you think you are right by doing everything "by the book." But sometimes the book is just a tiny part of the full story. Keep digging and writing a new chapter with more insights is actually fun...
05.12.2024 10:15 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 0
YouTube video by probabl
Imbalanced-learn: regrets and onwards - with Guillaume Lemaitre, core-maintainer
New podcast episode! This one is about imbalanced-learn and how the maintainer looks back with some lessons learned.
If you are dealing with imbalanced classification use-cases, like fraud, you'll want to listen in on this one!
youtu.be/npSkuNcm-Og
05.12.2024 09:58 โ ๐ 14 ๐ 4 ๐ฌ 0 ๐ 1
OK it is an interesting feedback. We could support older versions. We saw that up-to-now, we don't have any code that we are eager to drop quickly. I understand about the runtime dependencies and on our side, the idea is only depending on scikit-learn. But agreed that it is one more dependency.
29.11.2024 09:16 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
GitHub - glemaitre/sklearn-compat
Contribute to glemaitre/sklearn-compat development by creating an account on GitHub.
We are working on a small package to ease developer life: github.com/glemaitre/sk.... The idea is that recurrent work could be centralized in a single package. Once we have a minimal version, we will do a first release to support scikit-learn 1.2 to 1.6
28.11.2024 11:17 โ ๐ 15 ๐ 1 ๐ฌ 1 ๐ 0
A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner.
Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve.
Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem.
Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.
I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.
Here is the recording of the presentation:
www.youtube.com/watch?v=-gYn...
27.11.2024 14:17 โ ๐ 49 ๐ 19 ๐ฌ 1 ๐ 1
Version 1.6
Legend for changelogs something big that you couldnโt do before., something that you couldnโt do before., an existing feature now may not require as much computation or memory., a miscellaneous min...
Please help us test the first release candidate for scikit-learn 1.6: pip install scikit-learn==1.6.0rc1
Changelog: scikit-learn.org/1.6/whats_ne...
In particular, if you maintain a project with a dependency on
scikit-learn, please let us know about any regression.
22.11.2024 14:49 โ ๐ 39 ๐ 18 ๐ฌ 2 ๐ 2
With Artefact, we are delighted to invite data leaders to an exclusive Paris masterclass: โจAligning Probabilistic Classification with Business Decisions using @scikit-learn.bsky.social โจ ๐จLimited seats available!ย Secure your spot now ๐๐ปย lu.ma/fopoglzo #MachineLearning #Advanced #AI #Masterclass
22.11.2024 06:54 โ ๐ 8 ๐ 1 ๐ฌ 0 ๐ 0
Co-founder and CEO, Mistral AI
Researcher at Inria Saclay, team Soda
working on machine learning and causal inference for health data
Post-doc @ Tรฉlรฉcom SudParis
Software engineer at Quansight Labs.
Scikit-learn and Sphinx-Gallery dev.
Python Triage Member | Focusing on CPython #LKD #Python #ArchLinux #Django #eBPF
https://github.com/furkanonder/
Software engineer @quantco.com, Berlin ๐ฉ๐ช | PhD in Bioinformatics
Hon. Associate Professor UCL CS | Ex-Dir. Research AI for Good & Head of Element AI London Office | Ex-DeepMind. He/Him | https://cornebise.com
Data @ Protocol Labs.
Open Data, Open Source, Open Protocols.
Walks taker. Progressive Metal enjoyer.
davidgasquez.com
Research scientist at Apple | machine learning, optimization, language modeling
pierreablin.com
Distinguished Scientist at Google. Computational Imaging, Machine Learning, and Vision. Posts are personal opinions. May change or disappear over time.
http://milanfar.org
Neuroscientist | Biomedical Engineer | PhD Candidate at Columbia University
My research focuses on neural circuit changes, working memory deficits, and adult neurogenesis, using advanced imaging techniques like confocal and light-sheet microscopy
Doing ML & functional connectivity with a clinical twist ๐ง MIND Team, Inria #computationalneuroscience Git: http://github.com/victoris93
AI/ML product directory at the state of Maryland
I have launched Excel once
Software/data engineer at fluves.com - currently working on leak detection using fiber optics and water quality sensor data. Freelance developer/teacher. Volunteer at waterlandvzw. Living in Ghent.
Python | xarray | django | svelte
BrainsCAN Postdoctoral Fellow in cognitive computational neuroscience and neuroimaging focused in fMRI, brain atlasing, machine learning, open science, music cognition & gender equity.
Studying genomics, machine learning, and fruit. My code is like our genomes -- most of it is junk.
Assistant Professor UMass Chan
Previously IMP Vienna, Stanford Genetics, UW CSE.
Open-source Python library for building your applicationsโ frontend and backend (dashboards, chatbotsโฆ)
From simple pilots to production-ready web apps in no time.
No more compromise on performance, customization, & scalability.
github.com/Avaiga/taipy
Expert in Artificial Intelligence
Senior Data Scientist @ probabl.ai | Ph.D. in Applied Math