π€©
06.03.2026 15:17 β π 2 π 1 π¬ 0 π 0π€©
06.03.2026 15:17 β π 2 π 1 π¬ 0 π 0
@drob.bsky.socialβs talk from 7 years ago covers this well
youtu.be/dT5A0sAWc2I
I presented at Shiny in Production 2025, an incredible conference hosted by @jumpingrivers.com up in Newcastle! I was glad to share the very latest from the Shiny team directly. The topics were bleeding edge at the time, so still new and relevant now. My video: www.youtube.com/watch?v=vxai...
04.03.2026 14:34 β π 22 π 5 π¬ 0 π 0A pink and blue graphic reading "apply for our opportunity scholarship to posit::conf(2026)."
We are covering 40 people's travel, lodging, and registration for posit::conf() this fall! If you are from a group that is underrepresented in data science or open source, please consider applying for the Opportunity Scholarshipβwe'd love to have you join.
posit.co/blog/apply-t...
March's tabular playground
#rstats #databs #tidytuesday
www.kaggle.com/code/jimgrum...
Screenshot of both sides of the printable version of the cheatsheet
Screenshot of the web version of the recipes cheatsheet
#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio
02.03.2026 17:23 β π 48 π 14 π¬ 0 π 1A very helpful post on AI helpers, especially if you are new to AI and Claude
03.03.2026 15:04 β π 13 π 1 π¬ 1 π 0SASβs sums of squares βtypesβ was the ancestral language war with S(plus) back in the day. So quaint compared to now.
28.02.2026 13:03 β π 4 π 0 π¬ 1 π 0Too bad Claude canβt shovel snow.
23.02.2026 23:32 β π 8 π 0 π¬ 2 π 1Similarly, clinical trial people started calling observational data or non-randomized trials "real-world data" like _that_ is the abnormal case.
20.02.2026 23:10 β π 1 π 0 π¬ 0 π 0
Yeah, it basically means a model built on the standard rectangular data structure.
That's what most of the world's data is, but since deep learning is all about images and non-tabular data (e.g., text), we have to give it a special name like it's the exception.
Hereβs a clip from Max Kuhn (@topepo.bsky.social) of Posit breaking down how we can truly quantify LLM performance using a clear, generalizable framework.
See the full conference talk here: youtu.be/TQKbaIR-8J4
#AI #MachineLearning #DataBS
Want to check if code using #GenAI generates the responses you want? Here's how to automate LLM evals with the {vitals} #RStats π¦ by @simonpcouch.com @posit.co
My latest at #InfoWorld:
www.infoworld.com/article/4130...
#LLMs
random.seed(42)
</cringe>
We owe sooo much to @yihui.org Thank you!
17.02.2026 02:42 β π 13 π 0 π¬ 1 π 0Let me put it this wayβ¦ it was much much easier than trying to convince the same AI to just stop using the test set over and over and over again. π
17.02.2026 02:39 β π 2 π 0 π¬ 0 π 0
With bookdown.org being decommissioned, I've been working on converting #rstats books from #bookdown to #quartopub. After spending a lot of time with Claude to get it right for my two books, here's a repo that might help you do the same (especially with Claude Code):
github.com/topepo/bookd...
Iβm like this with tikz (tikz.net). The best looking figures and diagrams but I donβt have the time to sort through its arcana.
16.02.2026 12:31 β π 2 π 0 π¬ 0 π 0Dumping the main points of both these posts into a skill is a really easy way to use claude to try the conversions (and check unit tests) to see if these changes can help speed things up without regressions.
10.02.2026 13:43 β π 0 π 0 π¬ 0 π 0
A while back, @simonpcouch.com wrote this relevant post for package maintainers to help them convert code from dplyr/tidyr to vctrs.
tidyverse.org/blog/2023/04...
Last week we released dplyr 1.2.0, but we left off something VERY important π
`dplyr::if_else()` and `dplyr::case_when()` are now up to 30x faster and use 10x less memory!
We dive into how we achieved these numbers in this new #rstats post!
tidyverse.org/blog/2026/02...
For more than a year I have been working on a brand new Jupyter Notebook editor for Positron. This is a ground-up build of a new Jupyter Notebook experience built to leverage all the knowledge and tools Posit/Positron brings to the data science table. π§΅#jupyter
04.02.2026 12:54 β π 26 π 7 π¬ 1 π 1
dplyr 1.2.0 is out now and we are SO excited!
- `filter_out()` for dropping rows
- `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools
These are huge quality of life wins for #rstats!
tidyverse.org/blog/2026/02...
Edgar all that work, including heroic efforts to translate TI calculator code to SQL for *prediction intervals*.
I didnβt believe it was possible until he did it. πππ
The hexagon here is priceless π
taf-society.github.io/caretForecast/
#rstats #timeseries
Large Language Models for Natural Language Processing in R or Python with the {mall} package Join us with Edgar Ruiz at the Data Science Lab Tuesday Jan 27 at 12pm ET pos.it/dslab
Tomorrow at the Data Science Lab π§ͺ we are hearing from the amazing @theotheredgar.bsky.social about the {mall} package:
Run Natural Language Processing against your #RStats tibbles or #Python Polars DataFrames for sentiment analysis, text summaries, and more!
Join us at 12 pm ET: pos.it/dslab
I sent 200 pull requests using Claude Code and wrote about the experience. It's pretty wild!
For dplyr releases, we send a PR any time we break an #rstats package. This release advances a lot of deprecated functions, triggering issues in many old packages!
blog.davisvaughan.com/posts/2026-0...
We are excited to see that xgboost recently had a big CRAN release! We have worked hard on the tidymodels team to make sure you all have a smooth transition.
Please yet us know if you are experiencing any issues with the releases
tidyverse.org/blog/2025/12...
#rstats #tidymodels
Screenshot of the text of the linked blogpost 1/4
Screenshot of the text of the linked blogpost 2/4
Screenshot of the text of the linked blogpost 3/4
Screenshot of the text of the linked blogpost 4/4
~~ making sense of academic statistics ~~
i wrote about the confusing relationship between statistics and data analysis, and also about how statistics relates to science
#statistics #rstats #datascience
www.alexpghayes.com/post/making-...