Many error remain to be found in clinical trials. Patients deserve reliable results. Kudos to these authors for their persistent work to correct the record.
02.12.2025 10:12 โ ๐ 7 ๐ 2 ๐ฌ 0 ๐ 0@jamiecummins.bsky.social
Currently a visiting researcher at Uni of Oxford. Normally at Uni of Bern. Meta-scientist building tools to help other scientists. NLP, simulation, & LLMs. Creator and developer of RegCheck (https://regcheck.app). 1/4 of @error.reviews. ๐ฎ๐ช
Many error remain to be found in clinical trials. Patients deserve reliable results. Kudos to these authors for their persistent work to correct the record.
02.12.2025 10:12 โ ๐ 7 ๐ 2 ๐ฌ 0 ๐ 0A thread about being wrong:
5 years ago, we wrote a paper about how how newly enfranchised 16-year-olds vote in Austria. But we were wrong.
This year, @elisabethgraf.bsky.social, @schnizzl.bsky.social, Sylvia Kritzinger and I are setting the record straight: authors.elsevier.com/c/1juT5xRaZk...
"In 2019 we notified journals about serious integrity concerns in 172 clinical trials. Over five years later, only 22 have been retracted. The 135 unretracted trials have 1989 citations in systematic reviews, clinical guidelines, and consensus statements"
[paraphrased]
www.bmj.com/content/390/...
If you'd like to learn more about how OpenSAFELY works - and how we solved the privacy and efficiency challenges, to make national GP data securely accessible - here's a 5 minute video!
www.youtube.com/watch?v=GRjR...
๐จ SynthNet is out ๐จ
Researchers propose new constructs and measures faster than anyone can track. We (@anniria.bsky.social @ruben.the100.ci) built a search engine to check what already exists and help identify redundancies; indexing 74,000 scales from ~31,500 instruments in APA PsycTests. ๐งต1/3
I was thrilled to have been invited by @sakshighai.bsky.social to speak to folk at LSE on Wednesday about methodological and inferential issues that have cropped up in social science attempts to study large language models!
28.11.2025 13:00 โ ๐ 22 ๐ 1 ๐ฌ 1 ๐ 0Congratulations to @simine.com for winning the Einstein Foundation Individual Award! ๐
A well-deserved recognition for her seminal efforts to improve scientific rigor, which includes instituting detailed checks for errors and computational reproducibility at Psychological Science.
๐ Individual: @simine.com, psychologist at @unimelb.bsky.social & editor-in-chief of Psychological Science, is recognized for pioneering methodological rigor, reproducibility & collaborative research, driving initiatives such as @improvingpsych.org & the journal Collabra @ucpress.bsky.social. (2/5)
24.11.2025 09:59 โ ๐ 90 ๐ 22 ๐ฌ 3 ๐ 8The speaker at the lectern
Title slide
Next: Jack Wilkinson @jdwilko.bsky.social with 'Problematic clinical trials and the threat to evidence synthesis'
Systematic reviews are considered the cornerstone of medicine. But some of the eligible trials that could be included might be problematic. They could get included.
#IRICSydney
I think this is an overly pessimistic take from the @bmj.com.
Sharing data does not inherently increase trust, rather it enables verification which allows for trust calibration.
This example is a win. Serious issues were rapidly detected that would not have been without mandatory data sharing.
With every LLM since GPT-4, I've tried a game: ask it to commit a 20 Questions guess to a cipher, we play 20 Questions, and then we see if what it claims to have been its original choice is consistent with its cipher.
ChatGPT-5.1 Thinking is the first model to do this successfully!
Synchronous Robustness Reports could explore implications of different analytical choices โ but they could still suffer from bias. Hardwicke argues that preregistration is crucial to prevent it.
@tomhardwicke.bsky.social
Are methodological and causal inference errors creating a false impression that the gut microbiome causes autism? In this strong analysis, Mitchell, Dahly, and Bishop question the evidence.
They show that triangulation in science requires multiple robust lines of research.
Yes, like a Netflix documentary included IN EVERY SOCIAL PSYCHOLOGY TEXTBOOK
13.11.2025 16:11 โ ๐ 22 ๐ 5 ๐ฌ 0 ๐ 0There is a lot of fuss today over whether chatbots can replace human participants in social sciences research when the solution is obvious: ask chatbots to simulate the views of social scientists and survey them on attitudes towards chatbots as substitutes for human subjects.
10.11.2025 22:45 โ ๐ 170 ๐ 27 ๐ฌ 5 ๐ 2Delighted to support MU Psych Soc's invited lecture on Forensic Metascience by departmental alum, Dr Jamie Cummins @jamiecummins.bsky.social whose work in this area seeks to enhance rigour & accuracy in scientific reporting.
Sincere thanks to Dr Cummins. #MUPsychologyAt25
Super interesting, looking forward to reading this later. You may find this of interest: arxiv.org/abs/2509.13397
07.11.2025 11:20 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0LLMs are now widely used in social science as stand-ins for humansโassuming they can produce realistic, human-like text
But... can they? We donโt actually know.
In our new study, we develop a Computational Turing Test.
And our findings are striking:
LLMs may be far less human-like than we think.๐งต
It was such an honour and privilege to be back at my alma mater 9 years (!!!) after finishing my undergraduate degree to give a talk as part of psych department's 25 year anniversary!
07.11.2025 10:58 โ ๐ 9 ๐ 0 ๐ฌ 0 ๐ 0Lovely to welcome back Dr @jamiecummins.bsky.social for tonight's @mupsychology.bsky.social talk as part of our #MUpsychologyAt25 events @maynoothuniversity.ie
06.11.2025 18:48 โ ๐ 8 ๐ 1 ๐ฌ 1 ๐ 1My master thesis file name on my old university's thesis archive site still makes me chuckle.
30.10.2025 12:21 โ ๐ 37 ๐ 4 ๐ฌ 0 ๐ 0example #2345432 that nobody really knows what they mean by "AI"
30.10.2025 12:21 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0This year Demis Hassabis predicted AI could cure all disease in a decade.
But other scientists like Claus Wilke & Derek Lowe say biology is far more complex, or progress will be limited by clinical trials & economics.
In a new 4hr podcast episode of *Hard Drugs*, we answer: Will AI solve medicine?
me with some garden hoses connected in a X -> Z <- Y fashion. If I shut the valve at Z, water from X spills out at Y
I built a DAG diagram with garden hoses for teaching.
Pictured: a collider bias diagram, inspired by a blocked pipe situation I experienced (which I credit with giving me the intuition though it also ruined my belongings in the flooded cellar).
"Traumatized Mr. Incredible" meme with "Data and code available", "After looking at data & code"
27.10.2025 15:15 โ ๐ 18 ๐ 6 ๐ฌ 0 ๐ 0The 2011 Presidential Debate where Sean Gallagher loses the election
part 1 #aras25
Can AI simulations of human research participants advance cognitive science? In @cp-trendscognsci.bsky.social, @lmesseri.bsky.social & I analyze this vision. We show how โAI Surrogatesโ entrench practices that limit the generalizability of cognitive science while aspiring to do the opposite. 1/
21.10.2025 20:24 โ ๐ 281 ๐ 117 ๐ฌ 9 ๐ 25New hobby:
Remaking article abstracts as movie trailers to expose hype and fearmongering.
"Silicon samples" - using LLMs to generate fake survey responses instead of recruiting humans. Sounds efficient until you realize small model tweaks completely flip your results. Shortcuts in research usually aren't.
09.10.2025 13:05 โ ๐ 8 ๐ 2 ๐ฌ 0 ๐ 0Psychologists running empirical studies to rediscover engineering design choices is such a strange genre of papers. By all means, run studies on LLM judgments -- but what else than lexical co-occurence and statistical priors would they be based on??
17.10.2025 10:59 โ ๐ 33 ๐ 6 ๐ฌ 5 ๐ 4