I presented at Shiny in Production 2025, an incredible conference hosted by @jumpingrivers.com up in Newcastle! I was glad to share the very latest from the Shiny team directly. The topics were bleeding edge at the time, so still new and relevant now. My video: www.youtube.com/watch?v=vxai...
04.03.2026 14:34 —
👍 18
🔁 4
💬 0
📌 0
A pink and blue graphic reading "apply for our opportunity scholarship to posit::conf(2026)."
We are covering 40 people's travel, lodging, and registration for posit::conf() this fall! If you are from a group that is underrepresented in data science or open source, please consider applying for the Opportunity Scholarship—we'd love to have you join.
posit.co/blog/apply-t...
03.03.2026 15:17 —
👍 20
🔁 14
💬 2
📌 1
March's tabular playground
#rstats #databs #tidytuesday
www.kaggle.com/code/jimgrum...
02.03.2026 16:26 —
👍 3
🔁 1
💬 0
📌 0
Screenshot of both sides of the printable version of the cheatsheet
Screenshot of the web version of the recipes cheatsheet
#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio
02.03.2026 17:23 —
👍 47
🔁 14
💬 0
📌 1
A very helpful post on AI helpers, especially if you are new to AI and Claude
03.03.2026 15:04 —
👍 13
🔁 1
💬 1
📌 0
SAS’s sums of squares “types” was the ancestral language war with S(plus) back in the day. So quaint compared to now.
28.02.2026 13:03 —
👍 4
🔁 0
💬 1
📌 0
Too bad Claude can’t shovel snow.
23.02.2026 23:32 —
👍 8
🔁 0
💬 1
📌 1
Similarly, clinical trial people started calling observational data or non-randomized trials "real-world data" like _that_ is the abnormal case.
20.02.2026 23:10 —
👍 1
🔁 0
💬 0
📌 0
Yeah, it basically means a model built on the standard rectangular data structure.
That's what most of the world's data is, but since deep learning is all about images and non-tabular data (e.g., text), we have to give it a special name like it's the exception.
20.02.2026 23:09 —
👍 2
🔁 0
💬 1
📌 0
Here’s a clip from Max Kuhn (@topepo.bsky.social) of Posit breaking down how we can truly quantify LLM performance using a clear, generalizable framework.
See the full conference talk here: youtu.be/TQKbaIR-8J4
#AI #MachineLearning #DataBS
18.02.2026 16:54 —
👍 3
🔁 1
💬 0
📌 0
How to choose the best LLM using R and vitals
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Want to check if code using #GenAI generates the responses you want? Here's how to automate LLM evals with the {vitals} #RStats 📦 by @simonpcouch.com @posit.co
My latest at #InfoWorld:
www.infoworld.com/article/4130...
#LLMs
19.02.2026 14:26 —
👍 7
🔁 2
💬 0
📌 0
random.seed(42)
</cringe>
17.02.2026 23:29 —
👍 4
🔁 1
💬 1
📌 0
We owe sooo much to @yihui.org Thank you!
17.02.2026 02:42 —
👍 13
🔁 0
💬 1
📌 0
Let me put it this way… it was much much easier than trying to convince the same AI to just stop using the test set over and over and over again. 😃
17.02.2026 02:39 —
👍 2
🔁 0
💬 0
📌 0
With bookdown.org being decommissioned, I've been working on converting #rstats books from #bookdown to #quartopub. After spending a lot of time with Claude to get it right for my two books, here's a repo that might help you do the same (especially with Claude Code):
github.com/topepo/bookd...
16.02.2026 14:55 —
👍 62
🔁 10
💬 1
📌 2
TikZ.net – Graphics with TikZ in LaTeX
Graphics with TikZ in LaTeX
I’m like this with tikz (tikz.net). The best looking figures and diagrams but I don’t have the time to sort through its arcana.
16.02.2026 12:31 —
👍 2
🔁 0
💬 0
📌 0
Dumping the main points of both these posts into a skill is a really easy way to use claude to try the conversions (and check unit tests) to see if these changes can help speed things up without regressions.
10.02.2026 13:43 —
👍 0
🔁 0
💬 0
📌 0
A while back, @simonpcouch.com wrote this relevant post for package maintainers to help them convert code from dplyr/tidyr to vctrs.
tidyverse.org/blog/2023/04...
10.02.2026 13:43 —
👍 5
🔁 1
💬 1
📌 0
`dplyr::if_else()` and `dplyr::case_when()` are up to 30x faster
dplyr 1.2.0 comes with much faster and more memory efficient `if_else()` and `case_when()` functions!
Last week we released dplyr 1.2.0, but we left off something VERY important 🙂
`dplyr::if_else()` and `dplyr::case_when()` are now up to 30x faster and use 10x less memory!
We dive into how we achieved these numbers in this new #rstats post!
tidyverse.org/blog/2026/02...
10.02.2026 13:03 —
👍 127
🔁 21
💬 4
📌 1
For more than a year I have been working on a brand new Jupyter Notebook editor for Positron. This is a ground-up build of a new Jupyter Notebook experience built to leverage all the knowledge and tools Posit/Positron brings to the data science table. 🧵#jupyter
04.02.2026 12:54 —
👍 26
🔁 7
💬 1
📌 1
dplyr 1.2.0
dplyr 1.2.0 fills in some important gaps in dplyr's API: we've added a new complement to `filter()` focused on dropping rows, and we've expanded the `case_when()` family with three new recoding and re...
dplyr 1.2.0 is out now and we are SO excited!
- `filter_out()` for dropping rows
- `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools
These are huge quality of life wins for #rstats!
tidyverse.org/blog/2026/02...
04.02.2026 11:39 —
👍 465
🔁 133
💬 12
📌 14
Edgar all that work, including heroic efforts to translate TI calculator code to SQL for *prediction intervals*.
I didn’t believe it was possible until he did it. 🍌🍌🍌
02.02.2026 01:11 —
👍 1
🔁 0
💬 0
📌 0
Large Language Models for Natural Language Processing in R or Python with the {mall} package Join us with Edgar Ruiz at the Data Science Lab Tuesday Jan 27 at 12pm ET pos.it/dslab
Tomorrow at the Data Science Lab 🧪 we are hearing from the amazing @theotheredgar.bsky.social about the {mall} package:
Run Natural Language Processing against your #RStats tibbles or #Python Polars DataFrames for sentiment analysis, text summaries, and more!
Join us at 12 pm ET: pos.it/dslab
26.01.2026 17:26 —
👍 17
🔁 2
💬 0
📌 0
Semi-automating 200 Pull Requests with Claude Code – Davis Vaughan
I sent 200 pull requests using Claude Code and wrote about the experience. It's pretty wild!
For dplyr releases, we send a PR any time we break an #rstats package. This release advances a lot of deprecated functions, triggering issues in many old packages!
blog.davisvaughan.com/posts/2026-0...
12.01.2026 13:42 —
👍 61
🔁 13
💬 6
📌 1
tidymodels & xgboost
The tidymodels ecosystem is prepared for big xgboost CRAN release.
We are excited to see that xgboost recently had a big CRAN release! We have worked hard on the tidymodels team to make sure you all have a smooth transition.
Please yet us know if you are experiencing any issues with the releases
tidyverse.org/blog/2025/12...
#rstats #tidymodels
15.12.2025 17:39 —
👍 25
🔁 4
💬 2
📌 0
Screenshot of the text of the linked blogpost 1/4
Screenshot of the text of the linked blogpost 2/4
Screenshot of the text of the linked blogpost 3/4
Screenshot of the text of the linked blogpost 4/4
~~ making sense of academic statistics ~~
i wrote about the confusing relationship between statistics and data analysis, and also about how statistics relates to science
#statistics #rstats #datascience
www.alexpghayes.com/post/making-...
15.07.2025 20:15 —
👍 113
🔁 19
💬 15
📌 8
Subreddits. i09 intermittently has posts going through new releases for upcoming months too.
14.12.2025 22:16 —
👍 1
🔁 0
💬 1
📌 0