Histogram figure illustrating selection bias in estimating intervention effects. The central blue histogram shows the full population distribution of pre-scores (mean = 100, SD = 10). A green histogram (top left) shows a random sample of 1000 participants with similar mean (β100) and SD (β10), producing a raw intervention effect of ~8.7 and standardised mean difference (SMD) of ~0.86. A red histogram (top right) shows a conditionally sampled group (n = 1000, pre-score β₯ 110), with higher mean (β115) and smaller SD (β4.4). This sample yields a similar raw effect (~9) but an inflated SMD (~2.0) compared to the population. A corrected SMD brings it back closer to the true effect (~0.9). The figure illustrates how restricting samples to higher-performing individuals (e.g., βeliteβ athletes) compresses variance and artificially inflates standardised effect sizes, despite unbiased raw effects.
#SportScience, especially in elite sports, has an issue that can make effects seem more impressive than they really are... selection bias on the sample (explanationπ§΅ and a simulation pictured to illustrate below).
1/12
28.09.2025 18:58 β π 13 π 2 π¬ 1 π 1
Congrats Peder! See you soon in Eindhoven!
26.09.2025 14:03 β π 1 π 0 π¬ 1 π 0
Screenshot of first page of slidecrafting-book.com website
I'm exited to announce a new resource about making slides with quarto and revealjs. This book is the combination of all the work I have done in this area, reordered and polished up
There isn't a lot of new information yet, but this format allows me to add more easily
slidecrafting-book.com
#quarto
24.09.2025 16:12 β π 179 π 64 π¬ 11 π 6
Version 2.0.0 of quarto-revealjs-editable now lets you edit the text in divs
github.com/EmilHvitfeld...
#quarto
21.08.2025 18:30 β π 16 π 5 π¬ 0 π 2
So a spurious correlation from an observational study is βgold standard scienceβ, but testing vaccines in multiple RCTs with over 30,000 participants each is not?
23.09.2025 14:01 β π 86 π 13 π¬ 5 π 0
Figure 2 from preprint illustrating simulation workflow. From left to right, it goes from simulation grid, to generation function, to analysis function, to results table
Tidy simulation: Designing robust, reproducible, and scalable Monte Carlo simulations #StatsSky
arxiv.org/abs/2509.11741
It is not formally linked to the {tidyverse}, bit affinity is obvious.
The paper does a solid job in describing the simulation workflow, could be useful for intros to simulation
23.09.2025 09:25 β π 9 π 6 π¬ 0 π 0
Sage Journals: Discover world-class research
Subscription and open access journals from Sage, the world's leading independent academic publisher.
Excited to finally have my first paper officially published! "How Do Psychology Journals Handle Postpublication Critique?" is now online with AMPPS. Huge thanks to my supervisors @tomhardwicke.bsky.social and @simine.com and co-authors @bethclarke.bsky.social, N Moodie, S Schiavone, and R Thibault βΊοΈ
17.09.2025 23:23 β π 42 π 16 π¬ 1 π 3
37Β Performance β Model to Meaning
The new {marginaleffects} release for #RStats (0.30.0) comes with two new vignettes:
1. Speed up computation with automatic differentiation (often 10x gains) marginaleffects.com/bonus/perfor...
2. Power analyses with {marginaleffects} and {DeclareDesign}. marginaleffects.com/bonus/power....
13.09.2025 18:37 β π 145 π 34 π¬ 3 π 3
Time is running out to register for our pre-symposium workshop and Meta Research Symposium event at Paul Meehl Graduate School
Check the information and registration links below:
paulmeehlschool.github.io/workshops/
10.09.2025 13:53 β π 6 π 6 π¬ 0 π 1
A further problem is illustrated by the authors equating no treatment with placebo. For example, describing their second assumption, they state, βImplicit in this notation is that there is a single version of βno treatmentβ that is consistently defined across all subjects in the RCT and external controlsβ (p4). However, most clinical trials are add-on trials (Senn, 2002), even if not specifically identified as such. In placebo-controlled trials, one starts with standard of care and then subjects are either allocated to receive in addition the active treatment or a placebo to it. The statistical analysis plan for SUNFISH (F Hoffman-La-Roche Ltd, 2020) has a protocol summary as Appendix 1, which has this to say, βIn addition to the study drug treatment, patients may continue to receive concomitant drug medicationβ¦β (p186). This is very standard for clinical trials. It highlights that a key assumption in borrowing control data in this way is that there has been no evolution in the standard of care in the period since the trial.
One of the most common misunderstandings about the use and value of placebos in clinical RCTs, often made by both methodological experts and experienced trialists:
(from @stephensenn.bsky.social in academic.oup.com/jrsssa/advan...)
05.09.2025 05:52 β π 45 π 15 π¬ 3 π 0
Spending 3 full days coding up a simulation for a method that *should* be better, but performs identically to a common R function (that takes about 5 seconds to apply). Absolutely priceless.
#rstats
02.09.2025 15:45 β π 15 π 4 π¬ 0 π 1
I love looking at this graph, showing it to my students, and sharing it on social media, even though it unerringly brings out the trolls.
This is the efficacy curve of Pfizer's mRNA COVID vaccine.
This, people, is what ended the emergency phase of the COVID pandemic. Despite what RFK Jr says.
02.09.2025 03:31 β π 1522 π 525 π¬ 31 π 15
It was a rewarding labour of love to work on this paper with a fantastic team of authors, βUnderstanding Treatment Response Heterogeneity Using Crossover Randomized Controlled Trials: A Primer for Exercise and Nutrition Scientistsβ. @hk-ijsnem.bsky.social journals.humankinetics.com/view/journal...
28.08.2025 17:50 β π 12 π 6 π¬ 2 π 0
Models as Prediction Machines: How to Convert Confusing Coefficients into Clear Quantities
Abstract
Psychological researchers usually make sense of regression models by interpreting coefficient estimates directly. This works well enough for simple linear models, but is more challenging for more complex models with, for example, categorical variables, interactions, non-linearities, and hierarchical structures. Here, we introduce an alternative approach to making sense of statistical models. The central idea is to abstract away from the mechanics of estimation, and to treat models as βcounterfactual prediction machines,β which are subsequently queried to estimate quantities and conduct tests that matter substantively. This workflow is model-agnostic; it can be applied in a consistent fashion to draw causal or descriptive inference from a wide range of models. We illustrate how to implement this workflow with the marginaleffects package, which supports over 100 different classes of models in R and Python, and present two worked examples. These examples show how the workflow can be applied across designs (e.g., observational study, randomized experiment) to answer different research questions (e.g., associations, causal effects, effect heterogeneity) while facing various challenges (e.g., controlling for confounders in a flexible manner, modelling ordinal outcomes, and interpreting non-linear models).
Figure illustrating model predictions. On the X-axis the predictor, annual gross income in Euro. On the Y-axis the outcome, predicted life satisfaction. A solid line marks the curve of predictions on which individual data points are marked as model-implied outcomes at incomes of interest. Comparing two such predictions gives us a comparison. We can also fit a tangent to the line of predictions, which illustrates the slope at any given point of the curve.
A figure illustrating various ways to include age as a predictor in a model. On the x-axis age (predictor), on the y-axis the outcome (model-implied importance of friends, including confidence intervals).
Illustrated are
1. age as a categorical predictor, resultings in the predictions bouncing around a lot with wide confidence intervals
2. age as a linear predictor, which forces a straight line through the data points that has a very tight confidence band and
3. age splines, which lies somewhere in between as it smoothly follows the data but has more uncertainty than the straight line.
Ever stared at a table of regression coefficients & wondered what you're doing with your life?
Very excited to share this gentle introduction to another way of making sense of statistical models (w @vincentab.bsky.social)
Preprint: doi.org/10.31234/osf...
Website: j-rohrer.github.io/marginal-psy...
25.08.2025 11:49 β π 942 π 283 π¬ 49 π 19
Been working on a tutorial on synthetic data for open science for @lmu-osc.bsky.social
A draft version is now up: lmu-osc.github.io/synthetic-da...
It covers model building, evaluating synthetic data utility with density ratio estimation, and disclosure risk.
Feedback is very welcome!
21.08.2025 15:11 β π 38 π 13 π¬ 3 π 1
Happy to announce β¨quarto-revealjs-editableβ¨
This fully supersedes the imagemover extension, as I back then didn't realize the potential. You can now also move, resize, change font size and alignment for text in your slides
github.com/EmilHvitfeld...
#quarto #slidecrafting
20.08.2025 17:38 β π 119 π 38 π¬ 10 π 8
DSS Instructor Resources
Teaching/learning intro stats and R? #rstats
Explore my teaching materials, including syllabus, lecture slides, exercises, interactive graphs, and self-graded review exercises: ellaudet.github.io/dss_instructor_resources
Instructors using DSS: source files from PUP have just been updated!
20.08.2025 20:17 β π 30 π 9 π¬ 2 π 0
Meta Research Symposium 2025 PMGS
PMGS Meta Research Symposium 2025 16-17 October 2025, TU/e Eindhoven Conference website: https://paulmeehlschool.github.io/workshops/ Program Day 1 - Pre-Symposium Mini-Workshop Time Activityβ¦
The full program for the PMGS Meta Research Symposium 2025 is online: docs.google.com/document/d/1... If you are interested in causal inference, systematic review, hypothesis testing, and preregistration, join is October 17th in Eindhoven! Attendance is free!
20.08.2025 14:34 β π 21 π 13 π¬ 0 π 3
Symposium Program Announced | Paul Meehl Graduate School
We are pleased to announce that the program for the second Paul Meehl Graduate School Meta-Research Symposium is now available!...
The PMGS Meta-Research Symposium program is out!
π
Oct 16β17, Eindhoven University of Technology
π§βπ« Workshop: Aaron Peikert
π€ Keynotes: Duygu Uygun-TunΓ§ & Lisa Spitzer
π©βπ¬ Talks by early-career researchers
Check the program + register by Oct 1 π paulmeehlschool.github.io/2025-08-17-p...
18.08.2025 09:40 β π 6 π 4 π¬ 0 π 0
For those interested, here is a link to a new power paper:
Hancock, G. R., & Feng, Y. (2026). nmax and the quest to
restore caution, integrity, and practicality to the sample size planning process. Psychological Methods.
yifengquant.github.io/Publications...
19.08.2025 01:04 β π 63 π 24 π¬ 4 π 2
Excited to share my newest quarto revealjs plugin: imagemover
Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience
github.com/EmilHvitfeld...
#quarto
13.08.2025 19:20 β π 205 π 51 π¬ 8 π 10
A letter from myself and @jdwilko.bsky.social that expands on this a bit more. Thanks to Fertility and Sterility for the opportunity.
doi.org/10.1016/j.fe...
12.08.2025 09:53 β π 54 π 25 π¬ 2 π 3
Your intuitions about individual responses to training are probably wrong π
ππ§΅ 1/11
11.08.2025 10:16 β π 5 π 2 π¬ 1 π 0
New Preprint: Rethinking Type S and M Errors. We argue Type S and M errors are not useful when designing or evaluating studies, and present alternative solutions to address the issues they were supposed to address (minimum effect tests, and correcting for bias with p-uniform). osf.io/preprints/ps...
05.08.2025 14:35 β π 20 π 8 π¬ 1 π 2
Translation: The weighted average of the thing that can't be less than zero was very much significantly different from zero. Science!
23.07.2025 09:11 β π 10 π 1 π¬ 0 π 0
SOMEONE PLEASE STOP ME
23.07.2025 09:05 β π 3 π 1 π¬ 1 π 0
CLS is home to a unique series of UK national cohort studies. We conduct research and produce policy evidence to improve lives.
We are based at the IOE, UCLβs Faculty of Education and Society. We are funded by the ESRC.
https://cls.ucl.ac.uk
|| PhD Statistics | Research Software Engineer | Loves #rstats | Outdoors person | Coffee person | Photography person | Serial Hobbiest | he/him ||
Assoc Prof Computer Science and Communication Studies at Northwestern. Infovis, HCI. Author of tidybayes & ggdist R pkgs. he/him. π³οΈβπ https://mjskay.com/
Co-director https://mucollective.northwestern.edu
Co-founder https://journalovi.org
Independent R consultant. Apache Arrow PMC Member & #rstats π¦ maintainer.
Arrow course launching early 2026: https://big-data-r.thinkific.com/
More of my stuff at https://niccrane.com/
Professor of Medical Statistics @qmul.ac.uk. I study how to design evaluations of health technologies and health service interventions. Sometimes I just make patterns.
The Centre for Evaluation and Methods (CEM), based in the QMUL WIPH, specialises in evaluating the clinical effectiveness, cost-effectiveness and implementation of healthcare innovations, and methodology to support these evaluations
A national service delivered via 8 RSS Hubs, with Specialist Centres in public health and social care research, providing expert research and methods advice on the design and delivery of research.
Website: www.nihr.ac.uk/rss
Senior lecturer at the Department of Multilingualism at https://unifr.ch/. Master's student statistics at https://unibe.ch/. Saxophonist with https://beatmoustache.ch/. Aficionado of Danish ska.
Blog & website: https://janhove.github.io
Professor of Biostatistics
Vanderbilt University School of Medicine
Expert Biostatistics Advisor
FDA Center for Drug Evaluation and Research
https://hbiostat.org https://fharrell.com
Anthropologist - Bayesian modeling - science reform - cat and cooking content too - Director @ MPI for evolutionary anthropology https://www.eva.mpg.de/ecology/staff/richard-mcelreath/
Data Elixir is a weekly newsletter with curated data science picks from around the web. Subscribe at dataelixir.com and follow us here for selections between issues. Covering machine learning, data visualization, analytics, and strategy.
Clinical psychologist and psychotherapist, part-time researcher with a focus on suicide prevention and psychopharmacology.
https://ploederlm.github.io/publications/
https://scholar.google.at/citations?user=76cO6AEAAAAJ&hl=de
Nature, espresso, cycling.
https://emilhvitfeldt.com/
making modeling easier in #rstats with tidymodels at @posit.co
writing about feature engineering (https://feaz-book.com/) and Slidecrafting. He/Him
Director, Qualitative Data Repository (personal account).
Data, Zotero, Social Science Methods
https://sebastiankarcher.com
Wastewater Integrated Surveillance for Public Health in Europe. An initiative coming from HADEA. An EU Joint Action committed to enhance Wastewater Surveillance.
https://www.eu-wish.eu/
π pipe dreams #RStats #Python π @Posit.co (formerly RStudio) || βΆ βΆ βΆ βΆ ||
The Centre for Logic and Philosophy of Science (CLPS) at the Institute of Philosophy (@kuleuvenuniversity.bsky.social) focuses on #logic and #philsci, with a concentration on the philosophies of the special sciences β’ https://hiw.kuleuven.be/clps #philsky
An #openaccess journal from #TheBMJ for specialist research that promotes multidisciplinary collaboration to improve the health of patients
PhD Candidate, conducting meta-research in biomedicine, University of Sydney.
BA in psychology, University of Melbourne.
Other interests include dance, cultural astronomy, physiology, languages, and foraging.
Professor of Biostatistics. University of Melbourne & Murdoch Childrenβs Research Institute. Research in causal inference and missing data methods + child, lifecourse and social epidemiology