Ben Schneider 's Avatar

Ben Schneider

@bschneidr.bsky.social

Stats, surveys, R, and dogs. www.practicalsignificance.com

668 Followers  |  561 Following  |  245 Posts  |  Joined: 19.09.2023  |  2.2003

Latest posts by bschneidr.bsky.social on Bluesky

I have my students watch this now-classic video to get them to come to office hours for intellectual vibing (but so few come 😭)

08.10.2025 17:34 β€” πŸ‘ 48    πŸ” 6    πŸ’¬ 3    πŸ“Œ 2
Preview
Why most polls overstate support for political violence Misperceptions about the popularity of violence increase public support for it β€” but you can help change that.

Terrific piece from @gelliottmorris.com this morning.

Whenever you ask if something β€œmight” happen or β€œmay” be necessary, it’s very easy for respondents to think of scenarios where the answer is yes, no matter how unlikely.

07.10.2025 12:51 β€” πŸ‘ 38    πŸ” 14    πŸ’¬ 1    πŸ“Œ 1

I dare not

06.10.2025 20:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Learning about your statistics software’s quantile definitions is like learning about your knee cartilage or your house’s plumbing. You only find yourself learning about it if something’s gone wrong.

06.10.2025 17:44 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
Obama Aides Say 'Veep' More Accurate Than 'West Wing,' 'House of Cards' "You guys nail the fragility of the egos, and the, like, day-to-day idiocy of the decision-making"

Even back when it was airing, it was obviously an unrealistic fantasy of how persuasion works in politics and government. There are many reasons why Veep is viewed by politicians and staffers as the more true-to-life series. www.thewrap.com/obama-aides-...

05.10.2025 23:55 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

It’s not a physical paper, but The Banner does good local reporting for MoCo and Maryland.

05.10.2025 19:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

There has been basically no (public) reckoning with the democratic strategists who said that "raising the salience" of immigration/deportations in ~March would help Trump, which was obviously wrong at the time and has been proved wrong over the last 6 months www.gelliottmorris.com/p/kilmar-abr...

05.10.2025 13:44 β€” πŸ‘ 2219    πŸ” 463    πŸ’¬ 48    πŸ“Œ 26

Excellent reporting and data visualization in this story

05.10.2025 15:22 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

There’s no rule of journalistic neutrality that requires you to say something is β€œunclear” when that is objectively untrue. It would be illegal. There’s absolutely zero ambiguity about that fact. Saying it’s unclear is making a false statement in your reporting.

04.10.2025 04:45 β€” πŸ‘ 1328    πŸ” 372    πŸ’¬ 51    πŸ“Œ 9

House, PhD -

A drama, about a brilliant arrogant medical statistician and his team of graduate students - every week receives a new case: collected clinical data. Their job: to 'diagnose' what the research question is.

03.10.2025 13:55 β€” πŸ‘ 12    πŸ” 3    πŸ’¬ 2    πŸ“Œ 1
Quarto To Tell Your Story with Data GASP 2025

Quarto To Tell Your Story with Data GASP 2025

In June, I gave a webinar at @amstatnews.bsky.social GASP 2025, "Quarto--To Tell Your Story with Data"

and the recording is now online!

πŸ•ΈοΈ #RStats & #Python materials: ivelasq-gasp2025-quarto.share.connect.posit.cloud
πŸ“Ή Recording: www.youtube.com/watch?v=P5Yf...
πŸ™ GitHub: github.com/ivelasq/2025...

03.10.2025 15:30 β€” πŸ‘ 21    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
Who Goes Nazi?, by Dorothy Thompson

Every few months now I re-read this "Who Goes Nazi?" piece from 1941 and am blown away by how it captures the people we are dealing with 80 years later.

harpers.org/archive/1941...

01.10.2025 23:59 β€” πŸ‘ 7932    πŸ” 3172    πŸ’¬ 242    πŸ“Œ 303
Post image

Imagine saying that you’re mad at a dad for requesting sidewalks to keep his disabled son safe. Just…saying it. Out loud. To a reporter.

www.thebanner.com/community/lo...

01.10.2025 23:36 β€” πŸ‘ 64    πŸ” 13    πŸ’¬ 4    πŸ“Œ 0
Video thumbnail

We are thrilled to announce that our NEW Large Language Model will be released on 11.18.25.

01.10.2025 14:38 β€” πŸ‘ 24615    πŸ” 8215    πŸ’¬ 648    πŸ“Œ 2183
Preview
Government watchdog websites go dark as OMB withholds funds from IG committee The Council of the Inspectors General on Integrity and Efficiency is funded largely through a no-year revolving account, rather than the regular appropriations that lapsed Oct. 1.

NEW: At least 10 watchdog websites are dark after OMB withheld funds from a watchdog council, removing access to thousands of watchdog reports and required whistleblower links.

OMB's move to withhold funds - which has received Republican pushback - is separate from the shutdown.

01.10.2025 18:10 β€” πŸ‘ 31    πŸ” 23    πŸ’¬ 2    πŸ“Œ 2

Essentially, you might have categories β€œa”, β€œb”, β€œc” (unordered) or β€œlow”, β€œmedium”, β€œhigh” (ordered). For the former you would set ordered=FALSE, for the latter you might want to set ordered=TRUE, so that certain R functions know that your categorical variable is ordinal.

01.10.2025 00:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
16Β  Factors – R for Data Science (2e)

Factors are meant to represent categorical variables, and ordinal categorical variables are a special case. Section 16.6 in the book β€œR for Data Science” explains the distinction between unordered and ordered factors:

r4ds.hadley.nz/factors.html...

01.10.2025 00:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Excited to introduce Vibes from Meta. Eat your slop, piggies!

30.09.2025 17:59 β€” πŸ‘ 7388    πŸ” 2674    πŸ’¬ 115    πŸ“Œ 202
The following R code:

library(dplyr)
library(forcats)

# Order the `species` levels by frequency
  penguins2 <- penguins |>
    mutate(
      species = fct_infreq(factor(species))
    )

# Check levels before reordering
  penguins$species |> table()

# Check levels after reordering
  penguins2$species |> table()

The following R code: library(dplyr) library(forcats) # Order the `species` levels by frequency penguins2 <- penguins |> mutate( species = fct_infreq(factor(species)) ) # Check levels before reordering penguins$species |> table() # Check levels after reordering penguins2$species |> table()

Yeah, exactly. That's the right tool for this.

30.09.2025 14:53 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Many of you have asked about gifts to us during a shutdown, mainly doordash types of things. No. Instead, donate to local food banks like Manna in MoCo, Capital Area Food Bank in DC, Find Food in PGC, and Food for Others in Fairfax. These serve everyone. Please repost. #fedstrong #shutdown

30.09.2025 12:43 β€” πŸ‘ 164    πŸ” 89    πŸ’¬ 0    πŸ“Œ 2

Yep. It’s very common and reasonable in social surveys. For instance in NHANES or NHATS where sensitive detailed information is used in weighting but can’t be shared without greatly risking respondent confidentiality.

29.09.2025 20:03 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In general I think it's hard to combat scientific misinformation when some of the best research is locked behind an academic paywall, while lots of nonsense gets published free for everyone to read in predatory journals.

28.09.2025 17:25 β€” πŸ‘ 498    πŸ” 125    πŸ’¬ 20    πŸ“Œ 30

Problem #2 is pretty self-evident. Once you include more predictors in your model, you have a different model. This can be annoying in terms of interpreting the model, but it can be a more serious problem if, say, the added predictors are collider variables.

29.09.2025 15:12 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

Problem #1 is elaborated on in this response from Breidt and Opsomer to the famous Gelman paper on weighting: projecteuclid.org/journals/sta...

29.09.2025 15:10 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Raking A Survey to Another Survey – Practical Significance When raking survey data to improve representativeness, we ideally would have population data to use for raking. But sometimes all we have is data from another survey. What should we do in this situati...

New blog post on the right and wrong way to rake a survey's data to estimates from another survey. As an example, it shows what can go wrong when we rake data from a poll to benchmark data from Pew's National Public Opinion Reference Survey.

www.practicalsignificance.com/posts/npors-...

28.09.2025 21:00 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

There’s good literature on this. But there are two problems with this approach. #1 is the data usually doesn’t include all the variables used in weighting, to protect confidentiality. #2 is that by including all these vars in the model, you get a different model than the one you wanted.

28.09.2025 16:56 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I’ve had very good experiences with using β€˜uv’ instead of PyEnv/poetry/etc. and with using Positron instead of VS Code.

27.09.2025 23:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A little bit of self-promotion, I @ivelasq3.bsky.social will be giving a talk on Positron for RStudio Users!

Hoping to share some perspectives, extensions, and other tips in case you're interested in exploring Positron.

Sign up here: www.meetup.com/rladies-gabo...

25.09.2025 17:57 β€” πŸ‘ 20    πŸ” 10    πŸ’¬ 0    πŸ“Œ 0
Preview
Did an honesty researcher fabricate data? Duke professor and behavioral scientist Dan Ariely has been accused of using falsified data in research into ways to make people more honest. New info makes the case against him look stronger.

I mean, Dan Ariely is also infamous for polluting academic literature and pop psychology with dirty data.

www.npr.org/2023/07/28/1...

25.09.2025 16:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Q: how did you come to know him?
A: I know a lot of painters
Q: but he wasn't just a painter
A: True. A lot of people don't realize his main interest was architecture!
Q: and his political career?
A: I don't know much about that.
Q: the war?
A: I don't mind if we go light on the Nazi stuff

23.09.2025 19:56 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

@bschneidr is following 20 prominent accounts