Donald Szlosek's Avatar

Donald Szlosek

@dszlosek.bsky.social

Biostatistician @IDEXX formerly at harvardmed, @BIDMChealth, @nasa. Big data, clinical trials, and medical diagnostics. Mainer. Opinions are my own. he/him

812 Followers  |  4,424 Following  |  168 Posts  |  Joined: 13.11.2024  |  1.9125

Latest posts by dszlosek.bsky.social on Bluesky

"Uncooperative statistician": the term used (typically by a senior clinician) to describe a well-trained and knowledgeable statistician who refuses to conduct flawed or fraudulent research.

03.10.2025 12:59 β€” πŸ‘ 52    πŸ” 14    πŸ’¬ 3    πŸ“Œ 4

If you've ever wanted to learn how to make beautiful websites with #QuartoPub and #rstats , check out this workshop I'm giving in a couple weeks! It'll be a blast (and we're covering Quarto's brand new _brand dot yaml system!)

03.10.2025 18:36 β€” πŸ‘ 85    πŸ” 28    πŸ’¬ 3    πŸ“Œ 1
What Americans die from
and the causes of death the US media reports on
Causes of death in the US in 2023
Heart disease (29%)
Cancer (26%)
Accidents (9.5%)
Stroke (6.9%)
Lower respiratory diseases
(6.2%)
Alzheimer's disease (4.8%)
Diabetes (4.0%)
Kidney failure (2.4%)
Liver disease (2.2%)
Homicide (<1%)
Terrorism (<0.001%)|
COVID-19 (2.1%)
Influenza/Pneu
monia (19%6)

Media coverage of these causes of death in 2023 in...
The New York Times
The Washington Post
Fox News
Heart disease (2.8%)
Heart disease (2.9%)
Cancer (4.1%)
Cancer (4.7%)
Accidents (5.9%)
Cancer (3.8%)
Accidents (6.1%)
Accidents (9.7%)
Suicide (4.1%)
Suicide (3.3%)
COVID-19 (6.0%)
COVID-19 (7.9%)
Suicide (3.8%)
COVID-19 (5.3%)
Drug overdose (7.5%)
Drug overdose (9.8%)
Drug overdose (9.5%)
Cancer (26%)
Accidents (9.5%)
Stroke (6.9%)
Lower respiratory diseases
(6.2%)
Alzheimer's disease (4.8%)
Diabetes (4.0%)
Kidney failure (24%)
Suicide (2.1%0)
COVID-19 (2.1%0
Homicide (42%)
Homicide (52%)
Homicide (46%)
Terrorism (18%)
Terrorism (12%)
Terrorism (11%)
Homicide (<1%)
Terrorism (<0.001%)
Note: Based on the share of causes of death in the US and the share of mentions for each of the causes in the New York Times, the Washington Post and Fox News. All values are normalized to 100%, so the shares are relative to all deaths caused by the 12 most common causes + drug overdoses, homicides and terrorism. These causes account for more than 75% of deaths in the US.
A "media mention" is a published article in one of the outlets which mentions the cause (e,g. "influenza) or related keywords (e.g. "fu") least twice.
Data sources: Media mentions from Media Cloud (2025): deaths data from the US CDC (2025) and Global Terrorism Index.|
CC BY

What Americans die from and the causes of death the US media reports on Causes of death in the US in 2023 Heart disease (29%) Cancer (26%) Accidents (9.5%) Stroke (6.9%) Lower respiratory diseases (6.2%) Alzheimer's disease (4.8%) Diabetes (4.0%) Kidney failure (2.4%) Liver disease (2.2%) Homicide (<1%) Terrorism (<0.001%)| COVID-19 (2.1%) Influenza/Pneu monia (19%6) Media coverage of these causes of death in 2023 in... The New York Times The Washington Post Fox News Heart disease (2.8%) Heart disease (2.9%) Cancer (4.1%) Cancer (4.7%) Accidents (5.9%) Cancer (3.8%) Accidents (6.1%) Accidents (9.7%) Suicide (4.1%) Suicide (3.3%) COVID-19 (6.0%) COVID-19 (7.9%) Suicide (3.8%) COVID-19 (5.3%) Drug overdose (7.5%) Drug overdose (9.8%) Drug overdose (9.5%) Cancer (26%) Accidents (9.5%) Stroke (6.9%) Lower respiratory diseases (6.2%) Alzheimer's disease (4.8%) Diabetes (4.0%) Kidney failure (24%) Suicide (2.1%0) COVID-19 (2.1%0 Homicide (42%) Homicide (52%) Homicide (46%) Terrorism (18%) Terrorism (12%) Terrorism (11%) Homicide (<1%) Terrorism (<0.001%) Note: Based on the share of causes of death in the US and the share of mentions for each of the causes in the New York Times, the Washington Post and Fox News. All values are normalized to 100%, so the shares are relative to all deaths caused by the 12 most common causes + drug overdoses, homicides and terrorism. These causes account for more than 75% of deaths in the US. A "media mention" is a published article in one of the outlets which mentions the cause (e,g. "influenza) or related keywords (e.g. "fu") least twice. Data sources: Media mentions from Media Cloud (2025): deaths data from the US CDC (2025) and Global Terrorism Index.| CC BY

Nice chart from @ourworldindata.org showing the contrast between what Americans die of (heart disease and cancer) v what the US media reports on (homicide and terrorism). This naturally leads to it being trickier to build a fact based world view
ourworldindata.org/does-the-new...

06.10.2025 18:19 β€” πŸ‘ 120    πŸ” 66    πŸ’¬ 5    πŸ“Œ 3

If I'm doing a lot of written work, I would get tired of writing the sigma notation or the triangular numbers you suggested. I see this as an easy and fast way of saving my wrists from cramping!

07.10.2025 01:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In Ch 19 (nyu-cdsc.github.io/learningr/as...) of his 2nd edition, Kruschke used *residual* SD as a standardizer for group differences from a multilevel ANCOVA. Is there any precedent for using a *residual* SD as a standardizer for a standardized mean difference effect size? #RStats

06.10.2025 16:06 β€” πŸ‘ 14    πŸ” 5    πŸ’¬ 3    πŸ“Œ 0
Preview
What is the term for a factorial type operation, but with summation instead of products? (Pardon if this seems a bit beginner, this is my first post in math - trying to improve my knowledge while tackling Project Euler problems) I'm aware of Sigma notation, but is there a function/nam...

great discussion: math.stackexchange.com/questions/60...

06.10.2025 15:14 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image 06.10.2025 15:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I always wondered if there was a shorthand for summation similar to factorials #math #mathsky #statssky #statistics

06.10.2025 15:13 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 3    πŸ“Œ 0

Thinking about odds ratios...

An odds is a ratio of events to non-events. For example, if the event is survival, the odds of survival is the number of survivors per death. If the event is getting a disease, the odds is the number of diseased individuals per healthy individual.

24.04.2025 15:51 β€” πŸ‘ 31    πŸ” 6    πŸ’¬ 1    πŸ“Œ 4

For me this is a hard red line in psychological science. If you advocate the use of "silicon samples" you do not understand what it is we're supposed to be doing (and likely don't understand LLMs, or are a grifter). Luckily I haven't seen much of this among people I'd consider my peer group.

04.10.2025 08:27 β€” πŸ‘ 58    πŸ” 13    πŸ’¬ 1    πŸ“Œ 2
Preview
The best evidence Tylenol causes autism isn't great On Monday, RFK Jr announced Tylenol β€˜causes’ autism referencing three studies as evidence. Let's dive in.

If you’ve been following the RFK Jr autism news, then you’ve probably heard that there’s a systematic review β€œproving” Tylenol causes autism.

Here’s my review of that paperπŸ‘‡πŸΌ

open.substack.com/pub/epiellie...

25.09.2025 20:20 β€” πŸ‘ 579    πŸ” 171    πŸ’¬ 43    πŸ“Œ 14

Which one should I do next? The big Swedish study that RFK & his buddies pretend doesn’t exist? Or one of the other 2 studies he mentioned at the press conference?

Vote by commenting!

27.09.2025 18:11 β€” πŸ‘ 32    πŸ” 8    πŸ’¬ 6    πŸ“Œ 0

It is *impossible* to "adjust for socioeconomic status" in a regression model. Discuss.

And good morning! 🌞

26.09.2025 05:50 β€” πŸ‘ 54    πŸ” 8    πŸ’¬ 12    πŸ“Œ 5

Just posted an updated/revised version of this β€œStatistical Methods in Public Policy Research” chapter, now under review post-R&R 🀞

I'm kinda partial and unbiased here, but I really really like this piece!

HTML/PDF: stats.andrewheiss.com/snoopy-spring/
SocArXiv: doi.org/10.31235/osf...

26.09.2025 15:41 β€” πŸ‘ 65    πŸ” 12    πŸ’¬ 3    πŸ“Œ 0
Post image

The single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices.

Encoding matrices as graphs is a cheat code, making complex behavior simple to study. #Statistics #Mathemathis #Math

Excellent example from @tivadardanka.bsky.social

26.09.2025 16:36 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
R+AI 2025

rconsortium.github.io/RplusAI_webs...

26.09.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

R+AI - Join us at R+AI 2025, our inaugural conference dedicated to the open-source R community and every facet of artificial intelligence - 100% online

26.09.2025 15:06 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Data Science programs often put too little emphasis on causal inference, and it’s hurting their graduates on the job market! The econometrics people are coming for your jobs lol

26.09.2025 02:04 β€” πŸ‘ 61    πŸ” 10    πŸ’¬ 8    πŸ“Œ 3
Statistical Society of Australia - World Statistics Day Webinar – β€œData we can trust”

Will I be celebrating my birthday on October 20th? No!

Will I be celebrating World #Statistics Day by joining several much cooler panelists to discuss the topic "data we can trust"? Absolutely!

πŸ—“οΈ October 20th, 1pm AEDT
πŸ“ Online webinar
πŸ”— statsoc.org.au/event-6365055

#statssky #databs #datascience

26.09.2025 02:59 β€” πŸ‘ 14    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1

Tell me something you do when you code that other people would tell you that you shouldn't do.

Tell me the rules you break!

I'll go first: I work in untitled files in the wrong project directories all the time. Like, all the time. Yes, I do tend to lose things πŸ˜‚ #databs #rstats #python

24.09.2025 05:52 β€” πŸ‘ 147    πŸ” 26    πŸ’¬ 78    πŸ“Œ 13

Did you know there are two contests happening right now? ✨

πŸ“Š Table Contest: use any #RStats or #Python package.
πŸ“ˆ Plotnine Contest: use the Plotnine #Python package.

Show off your skills, share your work, and earn some well-deserved open-source cred!

Learn more: posit.co/blog/announc...

24.09.2025 18:39 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

Anyone else ever feel like this? #StatsSky #Statistics

23.09.2025 19:45 β€” πŸ‘ 18    πŸ” 1    πŸ’¬ 1    πŸ“Œ 2

A nice discussion on stats.exchange on central limit theorem including how Socrates would have handled it #Statistics #StatsSky: stats.stackexchange.com/questions/47...

23.09.2025 15:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Jason Brinkley talks about his job search in AMSTAT News.

Jason Brinkley talks about his job search in AMSTAT News.

@amstatnews.bsky.social covers @drjasonbrinkley.bsky.social 's recent job search. It is a great piece, #statssky #hpss #geronsky and #AIsky.

23.09.2025 12:15 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0
It’s JAMA time, baby! Junk science presented as public health research | Statistical Modeling, Causal Inference, and Social Science

It’s JAMA time, baby! Junk science presented as public health research
statmodeling.stat.columbia.edu/2025/09/22/i...

22.09.2025 15:53 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
The Essentials of R for Statistical Analysis - CSCU The aim of this workshop is to teach essential concepts and skills for conducting statistical analyses in R so that participants have the tools they need to work with and analyze their own datasets. T...

Tomorrow (9/23) is the final session of our Essentials of R for Statistical Analysis workshop series. Open to all @cornelluniversity.bsky.social. Join us at 2pm!
cscu.cornell.edu/workshop/the...
@cornellgrad.bsky.social @research-and-innovation.cornell.edu

22.09.2025 14:53 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I wish more meta analyses reported on the change in primary endpoint between trial registration and publication.

22.09.2025 12:12 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A common challenge in measurement in any context. E.g. some hospitals have worse outcomes because they take the hardest cases *because* they are the best hospital, etc. It's a different kind of problem than to that of e.g. university rankings, which just flat out measure the wrong things.

21.09.2025 17:51 β€” πŸ‘ 22    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0
Preview
From Zero to a Dockerized Development Environment in Minutes with GitHub Repository Templates An effective approach for setting up a development environment

The following tutorial focuses onΒ efficiently setting up a new Dockerized development environment with minimal time usingΒ GitHub repository templatesΒ withΒ VScodeΒ and theΒ Dev ContainersΒ extension.

The tutorial is available on Medium:

medium.com/p/6193f6d4ecb4

#docker #github #mlops #vscode

21.09.2025 14:00 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I was messing around with a few beta regression packages around that time. Needed survey weights so landed on glmmTMB

21.09.2025 04:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@dszlosek is following 20 prominent accounts