Ben Schneider

Ben Schneider

@bschneidr.bsky.social

Stats, surveys, R, and dogs. www.practicalsignificance.com

823 Followers 649 Following 327 Posts Joined Sep 2023
1 hour ago

The documentation of base functions can be pretty terrible. But I do appreciate that CRAN imposes some minimum documentation requirements rather than being totally laissez faire.

3 1 1 0
2 days ago
Post image

Still thinking about this preprint. ~ 40% of Mechanical Turk responses were from AI bots. That means Mechanical Turk is now machines pretending to be people pretending to be machines.

osf.io/preprints/ps...

13 4 0 1
3 days ago

I am so happy for this announcement!! Heather is such an incredible community partner and deeply knowledgeable of base R. This is great news for R core and the broader R community. 🎉❤️🙌🤩 #rstats

26 9 0 0
3 days ago

So very exciting and well deserved!

11 3 0 0
3 days ago

Absolutely fantastic news for #RStats 🎉 Heather has been integral in efforts to secure the long-term sustainability of The R Project, and building the R community, especially for folks from groups underrepresented in software. She's also an excellent R contributor herself. Very well deserved!

32 4 0 0
3 days ago

the non closed form integrals are the worst thing to ever happen to me or anyone

13 2 0 0
5 days ago
Preview
LLM-Assisted Issue Triage for Open Source Maintainers Nic Crane

I built a GitHub issue classifier for Apache Arrow issue language using {ellmer} - super simple and almost 100% accuracy. Blog post: niccrane.com/posts/llm-issue-triage/

#rstats #ai #llms

12 4 1 0
5 days ago
Friedman Unit - Wikipedia

Or roughly 0.08 Friedman Units

en.wikipedia.org/wiki/Friedma...

1 0 0 0
6 days ago
Preview
Some in the Trump administration are pushing to rebuild IES After DOGE gutted IES, the education and statistics agency inside the Education Department, some in the Trump administration are trying to rebuild it. A new report of ideas on how to do that was relea...

“not one of the recommendations was a new idea to NCES,” said Peggy Carr. “Many had already been implemented or we were working on when the center was dismantled."

Great reporting by @jillbarshay.bsky.social hechingerreport.org/proof-points...

25 11 1 1
6 days ago
Post image

We released DuckDB v1.5!

This release comes with a “friendly CLI” client, a new (opt-in) PEG parser, support for VARIANT types and many lakehouse features. It also ships a new network stack, a reworked geospatial extension, Azure writes and an ODBC scanner.

Read more at duckdb.org/2026/03/09/a...

63 16 0 1
6 days ago
Post image

The sky is not falling; high-quality platforms (Prolific, Verasight, CR Connect) have low rates of apparent bots. osf.io/preprints/ps... But also not zero; vigilance is very much needed!

105 49 1 2
1 week ago
R-Ladies branded graphic with purple-to-blue gradient background. The heading reads "Our Programs" in white. Six program cards are arranged in a 2-by-3 grid: Mentoring (connecting new chapter organizers with experienced leaders), RoCur (rotating curation on Bluesky spotlighting community voices), Abstract Review (80+ reviewers helping members submit to conferences), Community Slack (a safe space for connecting, learning, and sharing), Blog (tutorials, stories, and career journeys from our community), and YouTube (event recordings and learning materials).

R-Ladies is more than meetups. Our programs:

🤝 Mentoring — pairing new organizers with experienced ones
📣 RoCur — rotating curation on Bluesky
📝 Abstract Review — 80+ reviewers for conference submissions
💬 Community Slack
📖 Blog & YouTube

rladies.org

#RLadiesIWD2026

8 6 0 0
1 week ago
Preview
GitHub - Felixmil/quarto-envelope Contribute to Felixmil/quarto-envelope development by creating an account on GitHub.

Happy that I just released my first #quarto extension: quarto-envelope ✉️

I Had 100+ birth announcements letters to write, so I built a Quarto/Typst extension to generate print-ready PDFs programmatically from R.

I hope it will be helpful for the #rstats community !

github.com/Felixmil/qua...

10 2 3 0
1 week ago

Live and lapply()

4 2 0 0
1 week ago

The sapply() who loved me

36 5 3 0
1 week ago

A Quarto of Solace

28 2 0 1
1 week ago
i love data, me too meme
190 43 11 11
1 week ago
README

Just learned about the delightful R package ‘fcuk’ to help users correct typos while coding:

cran.r-project.org/web/packages...

1 0 0 0
1 week ago

Usually I can find something to appreciate and treat it as a learning experience. Like when I first had to use Python I enjoyed learning about comprehensions and itertools. It helps counterbalance the ick from things like Pandas or overstuffed Jupyter notebooks.

5 0 1 0
1 week ago

It’s usually easy but sometimes it gets stressful to make the short turnaround time to address CRAN check warnings/notes or else have your package archived.

3 0 1 0
1 week ago

With very large numbers of n’s you don’t need randomization, and with LLM’s we can generate very large numbers of n’s, so I think all of science is solved by now. I don’t see any problems with this.

96 18 5 1
1 week ago

If only AI / ML had been around when I was training, I wouldn’t have had to learn about things like causal inference, how to evaluate prediction models or even, say, the importance of data quality. What a waste of time all that was!

6 1 1 0
1 week ago
Screenshot of both sides of the printable version of the cheatsheet Screenshot of the web version of the recipes cheatsheet

#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio

49 14 0 1
1 week ago

I just learned that Ayatollah Khamenei and Ayatollah Khomenei are not the same person. Here's my plan for regime change in Iran....(1/23)

3,847 591 41 25
2 weeks ago

There's a moment in every data engineer's career when they discover they can query a 10GB Parquet file on their laptop in seconds.

That's the DuckDB moment.

It changes how you think about what requires a cluster and what doesn't. Spoiler: most things don't.

ssp.sh/blog/enterp...

54 5 0 1
2 weeks ago

……. Deep cut

46 6 2 0
2 weeks ago

THERE IS ONLY ONE TRUE WAY TO CODE AND IT IS TIDY. All others will perish on the altar of messiness. MUAHAHAHAHAAAAAAAAAAAAAA

25 6 1 0
2 weeks ago

The more I learn about #rstats the more excited I get. We have a rich ecosystem of tools / libraries such as #shiny or @quarto.org that I honestly feel like I can do anything

There's tremendous opportunity in corporations to improve and transform their workflow and reporting capabilities.

17 4 2 0
2 weeks ago

that’s a big selling point for weighted bootstraps (and things like Fay’s method), so that you don’t get a bad bootstrap sample that breaks your model

3 0 1 0