Clintin Davis-Stober's Avatar

Clintin Davis-Stober

@clintin.bsky.social

Professor, quantitative psychology, decision theory, data science, mathematics, statistics, open science, modeling, weight lifting, photography, enjoyer of poetry www.davis-stober.com

2,509 Followers  |  2,071 Following  |  51 Posts  |  Joined: 18.09.2023
Posts Following

Posts by Clintin Davis-Stober (@clintin.bsky.social)

  The decline effect (Protzko & Schooler, 2017) is an observed phenomenon where effect sizes in experiments apparently diminish in size from the first paper demonstrating the effect to later replications. This has been taken as a symptom of an unhealthy scientific ecosystem, possibly caused by the "winner's curse" (selection on significance and regression to the mean), publication bias or opportunistic analyses. I show that decline effects can arise as an artifact from a much simpler source: the original article determining the sign of the effect in a meta-analysis. Moreover, such artifactual decline effects will show correlations with some of the same experimental properties that one would expect from biases from poor behavior, such as the sample size of the original study.

The decline effect (Protzko & Schooler, 2017) is an observed phenomenon where effect sizes in experiments apparently diminish in size from the first paper demonstrating the effect to later replications. This has been taken as a symptom of an unhealthy scientific ecosystem, possibly caused by the "winner's curse" (selection on significance and regression to the mean), publication bias or opportunistic analyses. I show that decline effects can arise as an artifact from a much simpler source: the original article determining the sign of the effect in a meta-analysis. Moreover, such artifactual decline effects will show correlations with some of the same experimental properties that one would expect from biases from poor behavior, such as the sample size of the original study.

New draft: "Decline effects, statistical artifacts, and a meta-analytic paradox". In this manuscript I show how a common practice in meta-analysis (eg the 2015 Open Science Collaboration) creates artifactual signatures of poor scientific behavior. PDF: raw.githubusercontent.com/richarddmore... 1/x

02.02.2026 14:56 β€” πŸ‘ 77    πŸ” 29    πŸ’¬ 7    πŸ“Œ 4
Preview
a man with a mustache is standing in front of a sign that says farm ALT: a man with a mustache is standing in front of a sign that says farm
02.12.2025 19:11 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My front yard :)

16.10.2025 18:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Simonsohn has now posted a blog response to our recent paper about the poor statistical properties of the P curve. @clintin.bsky.social and I are finishing up a less-technical paper that will serve as a response. But I wanted to address a meta-issue *around* this that may clarify some things. 1/x

25.09.2025 10:07 β€” πŸ‘ 76    πŸ” 30    πŸ’¬ 2    πŸ“Œ 8

I love this post about science and metascience. A lot of quotables but I’ll lead with this:

β€œThose seeking a scientific method – one that can be written down and followed mechanically […] – betray a kind of childish impatience with a process they clearly don’t understand.”

14.08.2025 03:16 β€” πŸ‘ 36    πŸ” 7    πŸ’¬ 2    πŸ“Œ 2
Preview
AI slop and the destruction of knowledge This week I was looking for info on what cognitive scientists mean when they speak of β€˜domain-general’ cognition. I was curious, because the nuances are relevant for something I am researching at t…

AI slop and the destruction of knowledge irisvanrooijcogsci.com/2025/08/12/a...

12.08.2025 22:12 β€” πŸ‘ 524    πŸ” 266    πŸ’¬ 22    πŸ“Œ 50
Preview
RFK Jr. in interview with Scripps News: β€˜Trusting the experts is not science’ HHS Secretary RFK Jr. sat down with Scripps News for a wide-ranging interview, discussing mRNA vaccine funding policy changes and a recent shooting at the Centers for Disease Control and Prevention.

1. "'Trusting the experts is not a feature of either a science or democracy," Kennedy said."

It's literally a vital feature of both science and of representative democracy.

I've written a fair bit about trust in expertise as a vital mechanism in the collective epistemology of science.

12.08.2025 04:48 β€” πŸ‘ 9978    πŸ” 2849    πŸ’¬ 534    πŸ“Œ 476

We proved they could not scale link.springer.com/article/10.1...

11.08.2025 18:28 β€” πŸ‘ 75    πŸ” 23    πŸ’¬ 4    πŸ“Œ 3

Definitely something worth digging into. I’ll give it some thought

10.08.2025 20:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
The case for formal methodology in scientific reform | Royal Society Open Science Current attempts at methodological reform in sciences come in response to an overall lack of rigor in methodological and scientific practices in experimental sciences. However, most methodological ref...

here was our call for methodological standards in metaresearch four years ago. instead of getting fixated on a particular inference we'd like to make, we need to maintain scientific standards, do the hard work, respect the evidence. we can't keep jumping at self-serving solutions without question.

10.08.2025 15:40 β€” πŸ‘ 21    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

some of the discussion around the p-curve paper is depressing. i still see many missing the point clearly stated in the conclusion, and instead of demanding strong standards or questioning whether they're even asking good questions, they've immediately started asking for replacement methods.

10.08.2025 15:36 β€” πŸ‘ 30    πŸ” 7    πŸ’¬ 3    πŸ“Œ 0

I would add that while pcurve is comprised of tests, that indeed correspond to error rates, the actual hypotheses being tested have little to do with β€œevidential value”

10.08.2025 16:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Test statistics being used are just simple sums, no 3rd moment information enters the test.

10.08.2025 12:36 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Would this be grounds for dismissing the remaining 54 studies as lacking value? It makes no sense. Part of the problem is that the original p curve papers aren’t clear on what exactly is being tested. The authors claim they are tests of skew, but this is incorrect as the

10.08.2025 12:36 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Happy to clarify. Pcurve is used to test whether a set of studies have (or lack) β€œevidential value” (which is not really defined). But the actual hypotheses being tested by pcurve don’t permit this, as Richard and I show. Suppose one study WAS underpowered in a set of 55 studies -

10.08.2025 12:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

All pcurve tests are just a simple sum of transformed pvalues. There is a fundamental disconnect between the null hypotheses being tested by p-curve and the claims being made.

09.08.2025 21:18 β€” πŸ‘ 15    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

What this means is that a significant result for either test only allows one to claim that β€œat least one” study (out of the set) doesn’t have the property being considered. Why does this happen? Because pcurve completely ignores the configuration of the pvalues being considered.

09.08.2025 21:18 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The test for evidential value simply examines whether the effect size is zero for all studies. The test for lack of evidential value tests whether all studies are β€œunderpowered”, I.e., have small non-centrality parameters.

09.08.2025 21:18 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The developers of p-curve claim that p-curve can be used to make claims about the evidential value (or lack thereof) of whole sets of studies. We show that the actual hypotheses being tested do not allow for such strong conclusions.

09.08.2025 21:18 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The basic idea of p-curve rests on the idea that the skew of a set of p-values is informative about whether QRPs are occurring. As we show, the p-curve tests have nothing to do with skew. It is trivial to create left skewed pvalues that p-curve would confidently label as right skewed.

09.08.2025 21:18 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

New paper with @richarddmorey.bsky.social now out in JASA, where we critically examine p-curve. Below is Richard’s excellent summary of the many poor statistical properties of p-curve (with link to paper). I wanted to add some conceptual issues that we also tackle in the paper.

09.08.2025 21:18 β€” πŸ‘ 52    πŸ” 20    πŸ’¬ 2    πŸ“Œ 2

New paper by my PhD student @semihaktepe.bsky.social now published πŸš€

"Revisiting the effect of discrepant perceptual fluency on truth judgments" tinyurl.com/2eepue5y

--> Two experiments & a meta-analysis indicate that high visual contrast does not lead to higher truth judgments.

12.06.2025 22:38 β€” πŸ‘ 14    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1

I'm so sorry this happened to you. There is no excuse for such bs.

11.06.2025 19:50 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Why Trump’s push for β€˜gold-standard science’ has researchers alarmed Many scientists fear the Trump administration’s new standard means putting political appointees in charge, which could undercut independent research.

wherein I'm quoted with "Science is essentially effective to the extent that it can remain independent, decentralized, and democratic." (gift link, email required)

31.05.2025 13:10 β€” πŸ‘ 87    πŸ” 22    πŸ’¬ 3    πŸ“Œ 2
Postdoc

πŸš€Postdoc position @unimarburg.bsky.social in the project:

"Bridging the Gap Between Verbal Psychological Theories & Formal Statistical Modeling with Large Language Models"
(funded by @volkswagenstiftung.de)

πŸ“…Start: 01.10.2025 | ⏳4 years
πŸ”— Apply now: uni-marburg.de/jhbCen
πŸ”„ Thanks for sharing!

28.05.2025 11:52 β€” πŸ‘ 28    πŸ” 23    πŸ’¬ 1    πŸ“Œ 1
Table 1
Typology of traps, how they can be avoided, and what goes wrong if not avoided. Note that all traps in a sense constitute category errors (Ryle & Tanney, 2009) and the success-to-truth inference (Guest & Martin, 2023) is an important driver in most, if not all, of the traps.

Table 1 Typology of traps, how they can be avoided, and what goes wrong if not avoided. Note that all traps in a sense constitute category errors (Ryle & Tanney, 2009) and the success-to-truth inference (Guest & Martin, 2023) is an important driver in most, if not all, of the traps.

NEW paper! πŸ’­πŸ–₯️

β€œCombining Psychology with Artificial Intelligence: What could possibly go wrong?”

β€” Brief review paper by @olivia.science & myself, highlighting traps to avoid when combining Psych with AI, and why this is so important. Check out our proposed way forward! πŸŒŸπŸ’‘

osf.io/preprints/ps...

14.05.2025 21:23 β€” πŸ‘ 348    πŸ” 105    πŸ’¬ 15    πŸ“Œ 25

Great pic

20.05.2025 18:53 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Screenshot of the APS program for "Current Issues in Meta-Science", Saturday, May 24, 3:00pm

Screenshot of the APS program for "Current Issues in Meta-Science", Saturday, May 24, 3:00pm

Titles and authors in the session:

* Statistical Power in the Light of Methodological Reform (Jolynn Pek)

* The Poor Statistical Properties of the P-Curve Procedures (Richard Morey)

* Consistent Methods Protect Against False Findings Produced By p Hacking (Duane Wegener)

* Accumulating Evidence across Studies (Blakeley McShane)

Titles and authors in the session: * Statistical Power in the Light of Methodological Reform (Jolynn Pek) * The Poor Statistical Properties of the P-Curve Procedures (Richard Morey) * Consistent Methods Protect Against False Findings Produced By p Hacking (Duane Wegener) * Accumulating Evidence across Studies (Blakeley McShane)

If you'll be at APS2025 in DC next week, I'll be talking about the terrible statistical properties of the p curve procedure in the "Current Issues in Meta-Science" session Saturday, May 24th at 3pm. This will likely be my last US conference in a very long time. A brief summary follows. 1/

15.05.2025 07:25 β€” πŸ‘ 47    πŸ” 18    πŸ’¬ 2    πŸ“Œ 2

I am looking forward to expanding the scope of my professorship by combining cognitive and statistical modeling with LLMs😊

There will be two job openings for postdoc positions soon - one starting in September 2025 and another one a year later.

23.04.2025 10:26 β€” πŸ‘ 26    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0

Another reminder that this online event will take place this Friday. I might be alone representing the West Coast (it's at 7am PT) but hope colleagues from other time zones consider stopping by and joining the discussion. My talk will be not-so-subtly titled "Replication Is (Not)".

21.04.2025 21:14 β€” πŸ‘ 38    πŸ” 14    πŸ’¬ 3    πŸ“Œ 1