Ask copilot or chat gpt. It can output a csv no sweat.
06.12.2025 15:52 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0@tcarpenter.bsky.social
๐งช Data science, survey science, social science ๐ป Director of Data Science @ Microsoft Garage [Posts do not represent my employer] ๐งฎ Stats, R, python ๐ Science, Research: measurement, social biases, emotion. Ex-academic but scientist at heart
Ask copilot or chat gpt. It can output a csv no sweat.
06.12.2025 15:52 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Possibly. We might also admit some things are not very knowableโ without other โways of knowingโ
28.11.2025 21:49 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0I often hear researchers feeling bad about this. But itโs called prioritization. The key is to do it intentionally and strategically. You have finite time and canโt do everything.
28.11.2025 20:39 โ ๐ 4 ๐ 1 ๐ฌ 1 ๐ 0On the one hand, people need better education about AI
On the other hand, do we even understand what it means to be conscious in the first place?
Ah, but read what I said. Iโm not saying itโs impossible to do valid online research. I am saying that volume will need to shrink. Itโs going to be harder to stay on top of this. Some folks will give up or mess it up. There will be an arms race. Reviewers will start raising flags. Etc.
19.11.2025 15:50 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0The volume of social psych papers that ran on prolific and mturk is going to need to shrink soon, and that will have a big impact on a lot of small labs
19.11.2025 08:19 โ ๐ 5 ๐ 2 ๐ฌ 1 ๐ 0A huge pain, but doable with undergrad participants essentially for free (or at least historically so).
Iโm not saying what is rightโฆ Iโm saying what is driving behavior
If we want this to change, then we need to make it feasible
TMP factor: Time, money, pain
Whether a study gets done depends a lot on how difficult it is. The question is one of ROI. I suspect many researchers are thinking to themselves, โthatโs a great thing, but itโs not something I can doโ
My 11-year-old sitting with her pile of Halloween candy, sorting it into a bar graph
We have progressed from data collection to data analysis.
01.11.2025 00:31 โ ๐ 34956 ๐ 4152 ๐ฌ 996 ๐ 367ladies and gentlemen...we got him
30.10.2025 19:10 โ ๐ 18350 ๐ 4070 ๐ฌ 178 ๐ 186* hoppening
10.10.2025 04:20 โ ๐ 1018 ๐ 148 ๐ฌ 12 ๐ 3But where does 1a say anything about personhood? Iโm reading it and it seems clear that you canโt restrict speechโand that would obviously apply to one or more people organized under a LLC. IMO the bigger question is โwhen is money free speech, vs when is it corruptionโ
04.09.2025 04:24 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0Slide titled: "Assumptions of the model and model checking" with a scatterplot with axes how much people should worry vs how much people do worry.
this slide is from a colleague's introductory stats course, I think it fits many statisticians' experiences
18.08.2025 00:31 โ ๐ 93 ๐ 24 ๐ฌ 8 ๐ 245. Academia doesn't reward building useful tools nearly as much as it should
28.07.2025 00:44 โ ๐ 35 ๐ 4 ๐ฌ 2 ๐ 114. We mostly evaluate latent variable models with the equivalent of Rorschach tests
27.07.2025 16:20 โ ๐ 11 ๐ 1 ๐ฌ 1 ๐ 05. You should use a precision-recall curve for a binary classifier, not an ROC curve
27.07.2025 13:42 โ ๐ 23 ๐ 2 ๐ฌ 1 ๐ 1Wow. Scientists have edited mosquito DNA to prevent the spread of malaria to humans "while supporting essential physiological functions... and negligible fitness costs" to the mosquito population.
Potentially ending the mosquito-born spread of malaria to humans.
www.nature.com/articles/s41...
โฆ set of paths consistently supported by the data. Even getting that down is a trick. And making sense of it is fraught and doesnโt get you much further than one would get from regression. But at least then we would have some confidence we understand the correlational relationships!
23.07.2025 16:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0โฆ the model is correct and then gives you what the path would be under that specification. Thereโs nothing different when we go to SEM other than your ability to p-hack goes up exponentially. IMO this would be a great place to use machine learning approaches to train / tune models to find โฆ
23.07.2025 16:18 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0โฆ all those hypotheses together (in the same way that ANOVA contest many multiple comparisons at once). Thereโs nothing different between this and running a bunch of regressions and claiming the results support the way you specified those models. In reality, itโs the reverse. Regression assumes โฆ
23.07.2025 16:18 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Yes and see this a lot in social too. Proper use of SEM implies a particular philosophy of hypothesis testing in regression contexts. An omitted path is hypothesizing that path is exactly 0. A non-omitted path hypothesizing it is non-zero. Model fit is effectively the joint set of โฆ
23.07.2025 16:18 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Yikes!
23.07.2025 03:52 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0โฆ SEM for causal discovery. However, if you have a good read on the causal process, it can be great for estimating parameters such as factor, loadings or paths with latent variables
23.07.2025 03:35 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0This is probably not anything you donโt already know โฆ. But I did a lot of SEM work and will repeat it anyway. The model assumes you know the causal structure. Fit indices will confirm that the model is a fit to the data, but many incorrect models can fit the data. So I would not use โฆ
23.07.2025 03:34 โ ๐ 9 ๐ 1 ๐ฌ 2 ๐ 1Curious how this compares to the cost of living per state
19.07.2025 20:27 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Plot that depicts the average importance people in my data assign to their friendships (y-axis, on a scale from 1 to 5, depicted with 95% confidence intervals) by their age (x-axis, from 18 to 60). Depicted are 3 different ways to model importance of friends as a function of age. Using age as a linear predictor: this imposes a linear trajectory which comes with very tight confidence intervals (i.e., uncertainty is low). Using age as a categorical predictor: this imposes no trajectory whatsoever but instead simply reproduces the means by age. The confidence intervals are very wide, in particular for those ages not well represented in the data (i.e., uncertainty is high). Age splines: This results in a smooth trajectory that follows some of the bumps in the data, but not all of them. The confidence intervals are somewhere between the linear and the categorical case (i.e., uncertainty is medium)
Let's say you want to include age as a predictor in your model. How do you do that?
Here's an illustration of three options -- it's for a paper I'm working on (so if you feel like anything could be tweaked...).
There should be a corner at Home Depot where a guy with a table saw will slice you off custom lengths of hot dog from an infinite hot dog coming out of the wall
05.07.2025 00:15 โ ๐ 4157 ๐ 617 ๐ฌ 87 ๐ 55Adele shattering a glass in her hand
There were two girls at Wawa just now talking about funny movies and one said, โHave you ever seen the movie Office Space? Itโs an old people movie but itโs funnyโ
25.06.2025 01:10 โ ๐ 6911 ๐ 373 ๐ฌ 416 ๐ 62Counterpoint: the ability to chat with an article or literature and find patterns in our own work that perhaps we missed I think has a lot of potential to augment our scientific work
24.06.2025 19:09 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0