Max kuhn's Avatar

Max kuhn

@topepo.bsky.social

Writing modeling packages at @posit.co (née RStudio). Opinions are my own. https://max-kuhn.org/

4,889 Followers  |  297 Following  |  176 Posts  |  Joined: 31.08.2023
Posts Following

Posts by Max kuhn (@topepo.bsky.social)

I presented at Shiny in Production 2025, an incredible conference hosted by @jumpingrivers.com up in Newcastle! I was glad to share the very latest from the Shiny team directly. The topics were bleeding edge at the time, so still new and relevant now. My video: www.youtube.com/watch?v=vxai...

04.03.2026 14:34 — 👍 18    🔁 4    💬 0    📌 0
A pink and blue graphic reading "apply for our opportunity scholarship to posit::conf(2026)."

A pink and blue graphic reading "apply for our opportunity scholarship to posit::conf(2026)."

We are covering 40 people's travel, lodging, and registration for posit::conf() this fall! If you are from a group that is underrepresented in data science or open source, please consider applying for the Opportunity Scholarship—we'd love to have you join.

posit.co/blog/apply-t...

03.03.2026 15:17 — 👍 20    🔁 14    💬 2    📌 1

March's tabular playground
#rstats #databs #tidytuesday
www.kaggle.com/code/jimgrum...

02.03.2026 16:26 — 👍 3    🔁 1    💬 0    📌 0
Screenshot of both sides of the printable version of the cheatsheet

Screenshot of both sides of the printable version of the cheatsheet

Screenshot of the web version of the recipes cheatsheet

Screenshot of the web version of the recipes cheatsheet

#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio

02.03.2026 17:23 — 👍 47    🔁 14    💬 0    📌 1

A very helpful post on AI helpers, especially if you are new to AI and Claude

03.03.2026 15:04 — 👍 13    🔁 1    💬 1    📌 0

SAS’s sums of squares “types” was the ancestral language war with S(plus) back in the day. So quaint compared to now.

28.02.2026 13:03 — 👍 4    🔁 0    💬 1    📌 0

Too bad Claude can’t shovel snow.

23.02.2026 23:32 — 👍 8    🔁 0    💬 1    📌 1

Similarly, clinical trial people started calling observational data or non-randomized trials "real-world data" like _that_ is the abnormal case.

20.02.2026 23:10 — 👍 1    🔁 0    💬 0    📌 0

Yeah, it basically means a model built on the standard rectangular data structure.

That's what most of the world's data is, but since deep learning is all about images and non-tabular data (e.g., text), we have to give it a special name like it's the exception.

20.02.2026 23:09 — 👍 2    🔁 0    💬 1    📌 0
Video thumbnail

Here’s a clip from Max Kuhn (@topepo.bsky.social) of Posit breaking down how we can truly quantify LLM performance using a clear, generalizable framework.

See the full conference talk here: youtu.be/TQKbaIR-8J4

#AI #MachineLearning #DataBS

18.02.2026 16:54 — 👍 3    🔁 1    💬 0    📌 0
Preview
How to choose the best LLM using R and vitals Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.

Want to check if code using #GenAI generates the responses you want? Here's how to automate LLM evals with the {vitals} #RStats 📦 by @simonpcouch.com @posit.co
My latest at #InfoWorld:
www.infoworld.com/article/4130...
#LLMs

19.02.2026 14:26 — 👍 7    🔁 2    💬 0    📌 0

random.seed(42)

</cringe>

17.02.2026 23:29 — 👍 4    🔁 1    💬 1    📌 0

We owe sooo much to @yihui.org Thank you!

17.02.2026 02:42 — 👍 13    🔁 0    💬 1    📌 0

Let me put it this way… it was much much easier than trying to convince the same AI to just stop using the test set over and over and over again. 😃

17.02.2026 02:39 — 👍 2    🔁 0    💬 0    📌 0
Post image

With bookdown.org being decommissioned, I've been working on converting #rstats books from #bookdown to #quartopub. After spending a lot of time with Claude to get it right for my two books, here's a repo that might help you do the same (especially with Claude Code):

github.com/topepo/bookd...

16.02.2026 14:55 — 👍 62    🔁 10    💬 1    📌 2
TikZ.net – Graphics with TikZ in LaTeX Graphics with TikZ in LaTeX

I’m like this with tikz (tikz.net). The best looking figures and diagrams but I don’t have the time to sort through its arcana.

16.02.2026 12:31 — 👍 2    🔁 0    💬 0    📌 0

Dumping the main points of both these posts into a skill is a really easy way to use claude to try the conversions (and check unit tests) to see if these changes can help speed things up without regressions.

10.02.2026 13:43 — 👍 0    🔁 0    💬 0    📌 0

A while back, @simonpcouch.com wrote this relevant post for package maintainers to help them convert code from dplyr/tidyr to vctrs.

tidyverse.org/blog/2023/04...

10.02.2026 13:43 — 👍 5    🔁 1    💬 1    📌 0
Preview
`dplyr::if_else()` and `dplyr::case_when()` are up to 30x faster dplyr 1.2.0 comes with much faster and more memory efficient `if_else()` and `case_when()` functions!

Last week we released dplyr 1.2.0, but we left off something VERY important 🙂

`dplyr::if_else()` and `dplyr::case_when()` are now up to 30x faster and use 10x less memory!

We dive into how we achieved these numbers in this new #rstats post!

tidyverse.org/blog/2026/02...

10.02.2026 13:03 — 👍 127    🔁 21    💬 4    📌 1
Post image

For more than a year I have been working on a brand new Jupyter Notebook editor for Positron. This is a ground-up build of a new Jupyter Notebook experience built to leverage all the knowledge and tools Posit/Positron brings to the data science table. 🧵#jupyter

04.02.2026 12:54 — 👍 26    🔁 7    💬 1    📌 1
Preview
dplyr 1.2.0 dplyr 1.2.0 fills in some important gaps in dplyr's API: we've added a new complement to `filter()` focused on dropping rows, and we've expanded the `case_when()` family with three new recoding and re...

dplyr 1.2.0 is out now and we are SO excited!

- `filter_out()` for dropping rows

- `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools

These are huge quality of life wins for #rstats!

tidyverse.org/blog/2026/02...

04.02.2026 11:39 — 👍 465    🔁 133    💬 12    📌 14

Edgar all that work, including heroic efforts to translate TI calculator code to SQL for *prediction intervals*.

I didn’t believe it was possible until he did it. 🍌🍌🍌

02.02.2026 01:11 — 👍 1    🔁 0    💬 0    📌 0
Preview
caretForecast Conformal Time Series Forecasting Using Machine Learning

The hexagon here is priceless 😎

taf-society.github.io/caretForecast/

#rstats #timeseries

01.02.2026 16:41 — 👍 16    🔁 2    💬 1    📌 0
Large Language Models for Natural Language Processing in R or Python with the {mall} package Join us with Edgar Ruiz at the Data Science Lab Tuesday Jan 27 at 12pm ET pos.it/dslab

Large Language Models for Natural Language Processing in R or Python with the {mall} package Join us with Edgar Ruiz at the Data Science Lab Tuesday Jan 27 at 12pm ET pos.it/dslab

Tomorrow at the Data Science Lab 🧪 we are hearing from the amazing @theotheredgar.bsky.social about the {mall} package:

Run Natural Language Processing against your #RStats tibbles or #Python Polars DataFrames for sentiment analysis, text summaries, and more!

Join us at 12 pm ET: pos.it/dslab

26.01.2026 17:26 — 👍 17    🔁 2    💬 0    📌 0
Semi-automating 200 Pull Requests with Claude Code – Davis Vaughan

I sent 200 pull requests using Claude Code and wrote about the experience. It's pretty wild!

For dplyr releases, we send a PR any time we break an #rstats package. This release advances a lot of deprecated functions, triggering issues in many old packages!

blog.davisvaughan.com/posts/2026-0...

12.01.2026 13:42 — 👍 61    🔁 13    💬 6    📌 1
Preview
tidymodels & xgboost The tidymodels ecosystem is prepared for big xgboost CRAN release.

We are excited to see that xgboost recently had a big CRAN release! We have worked hard on the tidymodels team to make sure you all have a smooth transition.

Please yet us know if you are experiencing any issues with the releases

tidyverse.org/blog/2025/12...
#rstats #tidymodels

15.12.2025 17:39 — 👍 25    🔁 4    💬 2    📌 0
Screenshot of the text of the linked blogpost 1/4

Screenshot of the text of the linked blogpost 1/4

Screenshot of the text of the linked blogpost 2/4

Screenshot of the text of the linked blogpost 2/4

Screenshot of the text of the linked blogpost 3/4

Screenshot of the text of the linked blogpost 3/4

Screenshot of the text of the linked blogpost 4/4

Screenshot of the text of the linked blogpost 4/4

~~ making sense of academic statistics ~~

i wrote about the confusing relationship between statistics and data analysis, and also about how statistics relates to science

#statistics #rstats #datascience

www.alexpghayes.com/post/making-...

15.07.2025 20:15 — 👍 113    🔁 19    💬 15    📌 8

Subreddits. i09 intermittently has posts going through new releases for upcoming months too.

14.12.2025 22:16 — 👍 1    🔁 0    💬 1    📌 0