Tom Carpenter, PhD's Avatar

Tom Carpenter, PhD

@tcarpenter.bsky.social

๐Ÿงช Data science, survey science, social science ๐Ÿ’ป Director of Data Science @ Microsoft Garage [Posts do not represent my employer] ๐Ÿงฎ Stats, R, python ๐Ÿ“ Science, Research: measurement, social biases, emotion. Ex-academic but scientist at heart

3,477 Followers  |  3,850 Following  |  478 Posts  |  Joined: 07.05.2023  |  2.0062

Latest posts by tcarpenter.bsky.social on Bluesky

45. Academia doesn't reward building useful tools nearly as much as it should

28.07.2025 00:44 โ€” ๐Ÿ‘ 28    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

14. We mostly evaluate latent variable models with the equivalent of Rorschach tests

27.07.2025 16:20 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

5. You should use a precision-recall curve for a binary classifier, not an ROC curve

27.07.2025 13:42 โ€” ๐Ÿ‘ 23    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Wow. Scientists have edited mosquito DNA to prevent the spread of malaria to humans "while supporting essential physiological functions... and negligible fitness costs" to the mosquito population.

Potentially ending the mosquito-born spread of malaria to humans.

www.nature.com/articles/s41...

25.07.2025 15:23 โ€” ๐Ÿ‘ 1103    ๐Ÿ” 313    ๐Ÿ’ฌ 43    ๐Ÿ“Œ 45

โ€ฆ set of paths consistently supported by the data. Even getting that down is a trick. And making sense of it is fraught and doesnโ€™t get you much further than one would get from regression. But at least then we would have some confidence we understand the correlational relationships!

23.07.2025 16:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

โ€ฆ the model is correct and then gives you what the path would be under that specification. Thereโ€™s nothing different when we go to SEM other than your ability to p-hack goes up exponentially. IMO this would be a great place to use machine learning approaches to train / tune models to find โ€ฆ

23.07.2025 16:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

โ€ฆ all those hypotheses together (in the same way that ANOVA contest many multiple comparisons at once). Thereโ€™s nothing different between this and running a bunch of regressions and claiming the results support the way you specified those models. In reality, itโ€™s the reverse. Regression assumes โ€ฆ

23.07.2025 16:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yes and see this a lot in social too. Proper use of SEM implies a particular philosophy of hypothesis testing in regression contexts. An omitted path is hypothesizing that path is exactly 0. A non-omitted path hypothesizing it is non-zero. Model fit is effectively the joint set of โ€ฆ

23.07.2025 16:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yikes!

23.07.2025 03:52 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

โ€ฆ SEM for causal discovery. However, if you have a good read on the causal process, it can be great for estimating parameters such as factor, loadings or paths with latent variables

23.07.2025 03:35 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This is probably not anything you donโ€™t already know โ€ฆ. But I did a lot of SEM work and will repeat it anyway. The model assumes you know the causal structure. Fit indices will confirm that the model is a fit to the data, but many incorrect models can fit the data. So I would not use โ€ฆ

23.07.2025 03:34 โ€” ๐Ÿ‘ 9    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

Curious how this compares to the cost of living per state

19.07.2025 20:27 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
09.07.2025 05:30 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Plot that depicts the average importance people in my data assign to their friendships (y-axis, on a scale from 1 to 5, depicted with 95% confidence intervals) by their age (x-axis, from 18 to 60).

Depicted are 3 different ways to model importance of friends as a function of age.
Using age as a linear predictor: this imposes a linear trajectory which comes with very tight confidence intervals (i.e., uncertainty is low).
Using age as a categorical predictor: this imposes no trajectory whatsoever but instead simply reproduces the means by age. The confidence intervals are very wide, in particular for those ages not well represented in the data (i.e., uncertainty is high).
Age splines: This results in a smooth trajectory that follows some of the bumps in the data, but not all of them. The confidence intervals are somewhere between the linear and the categorical case (i.e., uncertainty is medium)

Plot that depicts the average importance people in my data assign to their friendships (y-axis, on a scale from 1 to 5, depicted with 95% confidence intervals) by their age (x-axis, from 18 to 60). Depicted are 3 different ways to model importance of friends as a function of age. Using age as a linear predictor: this imposes a linear trajectory which comes with very tight confidence intervals (i.e., uncertainty is low). Using age as a categorical predictor: this imposes no trajectory whatsoever but instead simply reproduces the means by age. The confidence intervals are very wide, in particular for those ages not well represented in the data (i.e., uncertainty is high). Age splines: This results in a smooth trajectory that follows some of the bumps in the data, but not all of them. The confidence intervals are somewhere between the linear and the categorical case (i.e., uncertainty is medium)

Let's say you want to include age as a predictor in your model. How do you do that?

Here's an illustration of three options -- it's for a paper I'm working on (so if you feel like anything could be tweaked...).

08.07.2025 12:03 โ€” ๐Ÿ‘ 159    ๐Ÿ” 31    ๐Ÿ’ฌ 33    ๐Ÿ“Œ 2

There should be a corner at Home Depot where a guy with a table saw will slice you off custom lengths of hot dog from an infinite hot dog coming out of the wall

05.07.2025 00:15 โ€” ๐Ÿ‘ 4023    ๐Ÿ” 610    ๐Ÿ’ฌ 89    ๐Ÿ“Œ 52
Adele shattering a glass in her hand

Adele shattering a glass in her hand

There were two girls at Wawa just now talking about funny movies and one said, โ€œHave you ever seen the movie Office Space? Itโ€™s an old people movie but itโ€™s funnyโ€

25.06.2025 01:10 โ€” ๐Ÿ‘ 6995    ๐Ÿ” 379    ๐Ÿ’ฌ 421    ๐Ÿ“Œ 63

Counterpoint: the ability to chat with an article or literature and find patterns in our own work that perhaps we missed I think has a lot of potential to augment our scientific work

24.06.2025 19:09 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image 24.06.2025 03:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek This study examines information suppression mechanisms in DeepSeek, an open-source large language model (LLM) developed in China. We propose an auditing framework and use it to analyze the model's res...

๐Ÿค–Thrilled to share our latest workโ˜„๏ธ

Have you ever wondered what LLMs know but they are not saying?

We built an auditing framework to study information suppression in LLMs, and demonstrated it to quantify and characterize censorship in DeepSeek.

Read more:

arxiv.org/abs/2506.12349

22.06.2025 23:52 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Pleased to share our ICML Spotlight with @eberleoliver.bsky.social, Thomas McGee, Hamza Giaffar, @taylorwwebb.bsky.social.

Position: We Need An Algorithmic Understanding of Generative AI

What algorithms do LLMs actually learn and use to solve problems?๐Ÿงต1/n
openreview.net/forum?id=eax...

20.06.2025 15:48 โ€” ๐Ÿ‘ 148    ๐Ÿ” 37    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3

building intuition around systems of matrix data and how we manipulate them should be right after (or right before) basic calc (integrals, derivatives, partial derivatives)

18.06.2025 05:29 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Why?

18.06.2025 03:50 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Someone please explain why linear algebra isnโ€™t taught more in high school in the United States? Seems like maybe if youโ€™re lucky you get a few lectures on matrices and thatโ€™s it.

18.06.2025 03:46 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Is it 2015 or 2025?

17.06.2025 15:26 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

โ€œData available upon requestโ€

17.06.2025 15:25 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Electricity Prices Are Surging. The G.O.P. Megabill Could Push Them Higher.

Under the Trump agenda, energy will cost more. And when energy costs more, everything costs more.
www.nytimes.com/2025/06/04/c...

08.06.2025 17:39 โ€” ๐Ÿ‘ 23    ๐Ÿ” 12    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Many academics are taught โ€œdo moreโ€ instead of โ€œprioritizeโ€. They are taught the academic superhero myth, that โ€œtruly great/smartโ€ scholars can handle it. So they sabotage their own success in service to the cult of personality

05.06.2025 14:00 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Stability and Change in Subjective, Psychological, and Social Well-Being: A Latent State-Trait Analysis of Mental Health Continuum-Short Form in Korea and the Netherlands - PubMed Mental well-being consists of hedonic/subjective, psychological, and social dimensions. Research has yet to determine how much of the variance in these three dimensions is stable or variable over time...

Indeed!

pubmed.ncbi.nlm.nih.gov/35917203/

05.06.2025 13:56 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Galaxy brain: what does anything measure?

05.06.2025 13:35 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I love this technique because it gives a cool way to isolate change and stability components of within-person measurement using latent variables. The stable portion of IATs is far more predictive of other individual-difference measures than one would think given traditional scoring/analyses

04.06.2025 22:59 โ€” ๐Ÿ‘ 7    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

@tcarpenter is following 20 prominent accounts