It's harder to understand reality than we intuitively think
13.02.2026 16:51 โ ๐ 10 ๐ 2 ๐ฌ 0 ๐ 0@annemscheel.bsky.social
Assistant prof at Utrecht University, trying to make science as reproducible as non-scientists think it is. Blogs at @the100ci.
It's harder to understand reality than we intuitively think
13.02.2026 16:51 โ ๐ 10 ๐ 2 ๐ฌ 0 ๐ 0New paper, on a worrying trend in meta-science: the practice of anonymising datasets on, e.g., published articles. We argue that this is at odds with norms established in research synthesis, explore arguments for anonymisation, provide counterpoints, and demonstrate implications and epistemic costs.
13.02.2026 16:50 โ ๐ 92 ๐ 51 ๐ฌ 6 ๐ 7New blog post about the age-period-cohort identification problem!
In which, for the first time ever, I ask "What's the mechanism?" and also suggest that sometimes you may actually *not* be interested in causal inference.
www.the100.ci/2026/02/13/o...
Adding to the confusion, entrepreneur Dr. Jing Liang noted on X that publicly available trial data appear to show Moderna conducted analyses involving higher-dose comparators in older adults. If accurate, it would raise further questions about the basis for issuing a refusal-to-file letter in the first place. No matter which account ultimately proves correct, this episode leaves us with two uncomfortable possibilities. If the FDA changed its regulatory position after years of clinical development, that reinforces a persistent concern about regulatory inconsistency. If, alternatively, Moderna met the agencyโs highest level of stated expectations, actually did use a high-dose comparator and still received a refusal-to-file letter, the implications are even more serious, raising questions about transparency and internal coherence at the agency.
Great post @ruxandrabio.bsky.social on FDAโs refusal to review Modernaโs flu vaccine, changing its position on what trial design was acceptable.
Regulatory uncertainty doesnโt just mean missing out on this vaccine, but also reduces future R&D investment:
clinicaltrialsabundance.blog/p/the-modern...
Like the โuneducatedโ explanation, the โreligiousโ one depends on a confusion between correlation and causation. Even if all women today with large families are religious, not all women who are religious have large families. Religion is correlated with total fertility, but not obviously causal.12 That is, something else mediates the association between religion and fertility. We donโt know what that is.
Some people confuse correlation and causation, some people confuse causation and โdeterministic direct 1:1 causationโ ๐ฅฒ
11.02.2026 05:03 โ ๐ 55 ๐ 9 ๐ฌ 7 ๐ 3I agree that assumptions underlying tasks in dev psych are often heroic and untested, but I donโt see how that frees you from the responsibility of discussing the assumptions underlying your own task?
11.02.2026 09:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0I wasnโt expecting a response to my review obviously, but I was expecting a serious engagement with substantive issues affecting the interpretation of the results of this exceptionally big (and likely very impactful) study that you were made aware of before resubmitting this paper.
11.02.2026 09:06 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0The comparison is literally the first sentence of your Results section.
11.02.2026 09:01 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Then I misunderstood you. I thought with โsuch assumptionsโ you meant the assumptions I had listed above. So those are not mentioned/discussed in the paper, right?
bsky.app/profile/elma...
Re โexternal validityโ (it sounds like you mean criterion validity?): Iโm not claiming that your task doesnโt track _something_, but that it tracks a bunch of things beyond what you claim it does. That many of these are correlated with cognitive development is unsurprising but also not v informative
11.02.2026 08:39 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Very possible that I missed something! Can you show me where in the paper you mention these assumptions and discuss their implications?
11.02.2026 08:22 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0> inter-cultural differences may be due to changes/differences in any of these assumptions. I personally think that some of the group differences you found may quite plausibly be due to different priors about e.g. eyeball shape/size and distances between houses and hedges in different populations.
10.02.2026 21:02 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0As I wrote in my review, this means that you cannot interpret the โaccuracyโ of participantsโ responses (because there is no unambiguous ground truth). It also means that responses will depend on participantsโ assumptions about all parameters listed above. Developmental changes as well as >
10.02.2026 21:02 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0* Of course it _can_ be inferred, but that requires many more assumptions than you claim it does, namely about the size and shape of the agentโs eyes and the balloon, and about the distances between all relevant features in the video. The โtrueโ values for all these are undetermined (itโs a cartoon)
10.02.2026 20:47 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0The problem with your 2D cartoon abstraction is that it has no true solution. You coded the โtrueโ target location as some point on the screen, but that point canโt be inferred from the agentโs pupils in a test trial.*
The paper you linked seems to use the same task so Iโm not sure what it adds?
I reviewed an earlier submission of this paper and raised what I think is a very serious issue with your task โ in short, that the exact location of the target of the agentโs gaze is unknowable. Iโm surprised to see that this doesnโt seem to be acknowledged in the publication at all.
10.02.2026 16:09 โ ๐ 4 ๐ 2 ๐ฌ 1 ๐ 0I learned, in the same semester, how methodological rigor is the bedrock of all progress in psychological science, and that a moderated mediation model based on questionnaires from 86 undergrad students is rock-solid proof for a theory in social psychology.
06.02.2026 21:35 โ ๐ 10 ๐ 2 ๐ฌ 0 ๐ 0Interesting to know!
06.02.2026 22:26 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0The things I have in mind are neither new (ie, not a reaction to observed standards) nor very hard. Itโs really basic stuff like โdefine your target populationโ.
06.02.2026 22:24 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Yeah, good point and probably true. Though I feel like the disconnect starts in-house already โ at least in my experience, empirical practices and thesis projects donโt tend to live up to the intro stats ideal. But IMO thatโs because as researchers, weโre used to ignoring a lot of important steps.
06.02.2026 22:18 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Fair!
06.02.2026 14:32 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0If this was indeed the core problem, I think there wouldโve been way more progress over the past 10 years. What I see now in paper after paper are fantasy constructs studied either in glorified convenience samples or with obviously terrible measures (or both) that correlate for trivial reasons.
06.02.2026 14:26 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Yes, 100% this! Thanks for the tip, Iโll check out your paper.
06.02.2026 13:52 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Thanks, but this strikes me as a very odd and unhelpful definition of โgarbageโ. If anything, I think one of the things that have hampered progress in psychology is an obsession with โtheoretical progressโ before establishing any robust phenomena that make mature theories possible in the first place
06.02.2026 13:51 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0โฆwhich includes an awful amount of active un-learning in psych. I just wonder if thatโs normal in, say, biology as well
06.02.2026 13:45 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Of course thereโs variation. But what I have in mind are really basic things that are covered in all intro courses Iโve encountered so far. Sampling methods, validity, error inflation, you name it
06.02.2026 13:43 โ ๐ 11 ๐ 0 ๐ฌ 2 ๐ 0Is this a thing in other fields too?
The (IMO large) bad portion of the psych literature typically violates basic principles that we teach undergrads in intro methods & stats. Feeling increasingly embarrassed when I think of how awful published papers look compared to what Iโm asking of my students
Bad news: additional capacity goes to patients who look remarkably similar to those already receiving care. Hence, outcomes are nonreactive. Thus, expanding the workforce doesn't inevitably reach those with the greatest need. 7/n
05.02.2026 08:35 โ ๐ 10 ๐ 2 ๐ฌ 1 ๐ 2Literally
04.02.2026 08:41 โ ๐ 8 ๐ 0 ๐ฌ 0 ๐ 0violin plot of results by testosterone measure
violin plot of result by risk pref measure
New meta finds no correlation between testosterone and risk preference, buuuuut testosterone was mainly measured using 2D:4D (nonsense) or saliva (still bad) and we know from Frey et al. 2017 that the risk measures don't tap into a coherent preference.
www.sciencedirect.com/science/arti...