Frank Harrell's Avatar

Frank Harrell

@f2harrell.bsky.social

Professor of Biostatistics Vanderbilt University School of Medicine Expert Biostatistics Advisor FDA Center for Drug Evaluation and Research https://hbiostat.org https://fharrell.com

7,426 Followers  |  134 Following  |  883 Posts  |  Joined: 12.10.2023  |  2.0596

Latest posts by f2harrell.bsky.social on Bluesky

Anyone who gets to work with Aki is very lucky …

06.10.2025 11:38 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

The extremely problematic use of change scores is so poorly understood by researchers that it’s almost sickening. Most don’t even understand what is needed for the subtraction operator to work. hbiostat.org/bbr/change

06.10.2025 11:36 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

An excellent argument. One related way I try to get course managers’ attention is to quote the cost of SPSS licenses to students once they leave the comfort of the university site license. The cost is obsene, for a product that does far less than R, & LLM can even help students learn to program R

04.10.2025 12:09 β€” πŸ‘ 14    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Good grief ICE!

04.10.2025 11:26 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0

If you've ever wanted to learn how to make beautiful websites with #QuartoPub and #rstats , check out this workshop I'm giving in a couple weeks! It'll be a blast (and we're covering Quarto's brand new _brand dot yaml system!)

03.10.2025 18:36 β€” πŸ‘ 85    πŸ” 28    πŸ’¬ 3    πŸ“Œ 1

"Uncooperative statistician": the term used (typically by a senior clinician) to describe a well-trained and knowledgeable statistician who refuses to conduct flawed or fraudulent research.

03.10.2025 12:59 β€” πŸ‘ 52    πŸ” 14    πŸ’¬ 3    πŸ“Œ 4
Goal-Driven Flexible Bayesian Design – Statistical Thinking The majority of clinicals trials that are successfully launched end with equivocal results, with confidence intervals that are too wide to allow drawing a conclusion other than β€œthe money was spent”. ...

Flexible Goal-Drive Bayesian Design - video of presentation is now available at www.fharrell.com/talk/gdesign/ #Statistics #StatsSky #bayes #rct

30.09.2025 14:12 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 0    πŸ“Œ 2

In addition an equally important requirement is that the study is actually designed, i.e., that the investigators cared enough about the question to get funding for prospective data collection, QC, and unbiased outcome evaluation.

30.09.2025 14:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
August 2025 Top 40 New CRAN Packages – R Works Here are my picks for the best from the ninety-seven new packages that landed on CRAN in August, organized in eighteen categories: Causal Inference, Data, Differential Privacy, Ecology, Environmental ...

August 2025's Top 40 picks are out, highlighting excellent #RStats packages from Differential Privacy to Toxicology to Causal Inference.

This marks 1 year of Top 40 posts since we started R Works! πŸŽ‰

Check out the post here: rworks.dev/posts/august...

29.09.2025 21:53 β€” πŸ‘ 12    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

After decades as a teacher experiencing such issues, my conclusion is to have a single pre-course personal laptop R setup help session and put an absolute requirement that setup be completed a week before class starts.

30.09.2025 12:40 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Welcome! You are invited to join a webinar: Research Methods Academy: Seminar - The Importance of Sample Size for the Development and Evaluation of Clinical Prediction Models. After registering, you w... Led by Professor Richard Riley Join this Online Seminar on clinical prediction models estimate individual risk for outcomes from disease. Many studies lack rigorous design; this session will focus o...

Giving a talk today which anyone can watch online: 1pm - 2pm
"The Importance of Sample Size for the Development and Evaluation of Clinical Prediction Models"
- intended for a broad audience (anyone doing or using prediction model research)
imperial-ac-uk.zoom.us/webinar/regi...

30.09.2025 07:55 β€” πŸ‘ 13    πŸ” 7    πŸ’¬ 0    πŸ“Œ 1

But I’d rather see such laziness than have researchers say β€œconsuming soy lowers the risk of breast cancer” etc. And I’d like to see the modest β€œpredicts” used more often, supplemented with β€œfurther understanding of pathways is required for a causal conclusion about the predictor”.

30.09.2025 12:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Thanks. It may be interesting to study the meaning of estimands when not conditioning on a strong factor when the estimator is non-collapsible as with binary Y.

29.09.2025 20:00 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It would be interesting to see an example where that is reasonable. I know this situation exists but can’t believe it would be very common. We also have the age variable even though it raises a privacy issue, hence age is sometimes truncated at 80 or so to prevent re-identification.

29.09.2025 17:58 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

If X is predictive of outcome and X was used in probability sampling but X was not stored in the dataset, isn’t that a major design flaw?

29.09.2025 13:19 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Illogic of Weighting While Estimating Model Parameters Consider a situation where some types of subjects are over- or under-sampled from a population where the sampling probabilities are known. For example, to increase efficiency we might sample a greate...

But see discourse.datamethods.org/t/illogic-of...

29.09.2025 13:00 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

fharrell.com/post/marg

29.09.2025 12:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That’s the main reference, a simple demonstration of how unadjusted estimates are ill-defined and easily manipulated by subject inclusion criteria even with complete homogeneity of effects.

29.09.2025 12:49 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The main reason Revenge Dump's in office again and not in prison where he belongs is cos Mitch McConnell violated his oath of office and didn't lead the Senate to vote for impeachment. Thanks Mitch that decision will haunt you to your grave and as long as there's still recorded history afterwards.

28.09.2025 05:08 β€” πŸ‘ 2784    πŸ” 667    πŸ’¬ 146    πŸ“Œ 35
Preview
Reviewer notes: In a randomized experiment, the pre-post differences are not effect estimates Reviewer notes are a new short format with brief explanations of basic ideas that might come in handy during (for example) the peer-review process. They are a great way to keep Julia from writing 10,0...

Psychologists are also definitely guilty of confusing change with treatment response, which is why I wrote about it here:

www.the100.ci/2025/01/22/r...

27.09.2025 12:40 β€” πŸ‘ 31    πŸ” 8    πŸ’¬ 1    πŸ“Œ 3

It’s not just confounding; when the outcome is binary or time-to-event, failure to condition on important prognostic factors results in ill-defined effect estimates even with zero confounding.

28.09.2025 13:17 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

The fact that most meta-analyses use unadjusted estimates casts doubt on the meta-analysis field.

28.09.2025 13:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

β€œDoing research is a choice, and unless you’re involved in some urgent projectβ€”curing a disease or winning a war or righting some injustice or raising living standards or whateverβ€”or some interesting projectβ€”baseball statistics or the theory of random walks or whateverβ€”you shouldn’t do it.”

27.09.2025 04:22 β€” πŸ‘ 39    πŸ” 4    πŸ’¬ 5    πŸ“Œ 2

Always a good read

27.09.2025 12:14 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I am sickened that journals are still letting authors compute change from baseline in parallel group studies. hbiostat.org/bbr/change

27.09.2025 12:05 β€” πŸ‘ 18    πŸ” 9    πŸ’¬ 0    πŸ“Œ 2
Preview
Avoiding One-Number Summaries of Treatment Effects for RCTs with Binary Outcomes – Statistical Thinking This article presents an argument that for RCTs with a binary outcome the primary result should be a distribution and not any single number summary. The GUSTO-I study is used to exemplify risk differe...

Join the club. There is nothing wrong with ORs. Like all other one-number summaries it’s best to accompany them with n-number summaries: www.fharrell.com/post/rdist/i... #EpiSky #StatsSky

25.09.2025 13:13 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 2    πŸ“Œ 1

What a refreshing change from the Gideon bible I find in so many U.S. hotel rooms.

25.09.2025 13:07 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
22Β  Controlling Ξ± vs.Β Probability of a Decision Error – Biostatistics for Biomedical Research

#Statistics thought of the day: To the entire non-statistical world, 'error' means 'mistake' but not in frequentism. Type I assertion probability Ξ± isn't P(mistake in asserting ΞΈ>0), i.e., P(ΞΈ <= 0 | p < 0.05); the assertion is made without knowing ΞΈ. hbiostat.org/bbr/alpha #StatsSky #bayes

19.09.2025 15:28 β€” πŸ‘ 13    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

E Fieller also had a famous saying called β€œFieller’s forecasting lemma”: β€œGiven them a number or give them a date, but never both.”

19.09.2025 11:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And I thought it was only 92% :-)

17.09.2025 11:19 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@f2harrell is following 19 prominent accounts