We spent months training grad student RAs and GPT-5 mini still beat them by a lot
11.02.2026 17:12 โ ๐ 21 ๐ 6 ๐ฌ 1 ๐ 0@ef110econ.bsky.social
Applying behavioural science to help make people happier, healthier, and wealthier with @irrationallabs. All views are personal. Retweets are not endorsements.
We spent months training grad student RAs and GPT-5 mini still beat them by a lot
11.02.2026 17:12 โ ๐ 21 ๐ 6 ๐ฌ 1 ๐ 0We have a new pre-print! ๐๐จ๏ธ
We find that conversing with a disagreeing LLM helped improve people's inaccurate predictions!
osf.io/preprints/ps...
Let me tell you all about it:
Yeah no shade your way. I retweet in the same way.
11.02.2026 17:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Given your other posts lately on blue Monday and stuff, it was interesting to see this without the caveat of "hey this may or may not be a real thing".
But yeah realistically it's just me reading another blog post at 4am while watching the baby and getting cranky.
As I said originally, it doesn't seem crazy that there could be some small effect. But this post is not evidence for one in my view.
11.02.2026 13:42 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0More thoughts: 6) no statistical tests here. 7) no time series modelling 8) effect for any actual business would be tiny!! 9) February Mondays include presidents Day in US, how is that accounted for? 10) no comparison to other holiday Mondays 11) no comparison to other Mondays for last minute PTO
11.02.2026 13:42 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I feel so many ways about this. 1) seems highly believable. 2) seems like one of those cutesy findings that with more poking into data could disappear. 3) this company only cover 34,000 very non representative companies. 4) effect size is small. 5) this company wants a splashy marketing story
11.02.2026 13:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Introducing โPretend Battleshipโ: youโre told where all the ships are but then have to play like you never got that information. Could you do it? And what would your performance reveal about your understanding of your own mind? A joy to be part of this creative project led by @matanmazor.bsky.social
10.02.2026 20:40 โ ๐ 29 ๐ 8 ๐ฌ 0 ๐ 1Large Language Models have shown both remarkable reasoning ability, and significant reasoning failures.
Research by @psong1.bsky.social et al has categorized this phenomenon in a clear taxonomy to explore how and why the performance is so variable:
buff.ly/hI7DYYN
Thrilled to share our latest paper, out now in Science Advances! We explored the development of cooperative behaviors โ fairness, trustworthiness, forgiveness, & honesty โย across five societies, culturally contextualizing them & seeing how they correlate. (1/5) www.science.org/doi/full/10....
07.02.2026 15:09 โ ๐ 121 ๐ 40 ๐ฌ 1 ๐ 2Looks very promising! Thanks for sharing it here
06.02.2026 05:08 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0Forthcoming in AER: Insights: "The Lasting Effects of Working while in School: A Long-Term Follow-Up" by Mery Ferrando, Noemรญ Katzkowicz, Thomas Le Barbanchon, and Diego Ubfal.
30.01.2026 09:52 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0Users of survey data, lovers of DAGs, and general methodological enthusiasts, gather round!
I'm so excited to share this new paper, joint work with my brilliant colleagues @rjsilverwood.bsky.social, @pwgtennant.bsky.social, and Liam Wright.
๐งต
Very important work
19.01.2026 17:06 โ ๐ 13 ๐ 6 ๐ฌ 0 ๐ 0๐ฅณ Stoked to share that our paper with Christina Strobel, "A Taxonomy of Al Experiments," has been accepted at the Journal of Behavioral and Experimental Economics!
Huge thanks to Christina for collab and to the reviewers for extremely helpful feedback!
papers.ssrn.com/sol3/papers....
#EconSky
Forthcoming in AEJ: Economic Policy: "When the Effects of Informational Interventions Are Driven by Salience โ Evidence from School Parents in Brazil" by Guilherme Lichand, Nina Cunha, Ricardo A. Madeira, and Eric Bettinger. www.aeaweb.org/articles?id=...
14.01.2026 14:37 โ ๐ 5 ๐ 2 ๐ฌ 0 ๐ 0I need everyone in the political communications world who found the moral foundations reframing approach promising (which includes me) to read this paper
It doesn't replicate in new research. It just doesn't work.
www.tandfonline.com/doi/full/10....
Excited to announce that this year's Advances with Field Experiments conference will take place at the University of Chicago on September 17-18, 2026.
@johnlist.bsky.social and I will send out a call for abstracts early in the Spring.
bfi.uchicago.edu/events/event...
@katymilkman.bsky.social
Spotify limit old music so you listen to new things
09.01.2026 16:01 โ ๐ 130 ๐ 9 ๐ฌ 5 ๐ 0The AEA has posted eight "Recent Developments" lectures exploring highly topical issues in economics, presented by the best scholars in the field:
www.aeaweb.org/conference/w...
Well worth a watch!
"Survey incentives can worsen bias! Randomized incentives help detect & account for nonresponse bias. Methods using both incentives & reminders outperform existing approaches."
New paper by Dutz, Huitfeldt, Lacouture, Mogstad, Torgovitsky & van Dijk
www.restud.com/selection-in...
#EconSky #REStud
Published @cp-trendscognsci.bsky.social with @drewlinsley.bsky.social & @tonyfeng.bsky.social: As vision models scale to human/superhuman accuracy, theyโre becoming worse models of primate visionโbenchmark engineering isnโt neuroscience. @carneyinstitute.bsky.social @browncopsy.bsky.social
05.01.2026 15:33 โ ๐ 45 ๐ 20 ๐ฌ 0 ๐ 1Had an interesting, hard interview with @adamconover.net on his podcast. I think he is a great example of a smart AI skeptic.
My main messages were that AI is a really big deal, it has good & bad impacts, and that, by sitting things out, skeptics canโt guide use. open.spotify.com/episode/5cFK...
This is a very special conference which you should absolutely consider attending if your research is at the intersection of behavioral science and AI
See you in Berlin!
Non-paywalled version:
osf.io/preprints/ps...
Thrilled to announce the Handbook of Computational Social Science is officially out! 956 pages, 118 authors, and truly global, interdisciplinary perspectives. Deep thanks to the contributors and anonymous reviewers who shaped this over 4 years. Buy your copy now!
@elgarpublishing.bsky.social
Post: metr.org/blog/2025-03...
20.12.2025 02:48 โ ๐ 13 ๐ 1 ๐ฌ 0 ๐ 0There will soon be proctoring center franchises in which people will take tests, do job interviews, and complete paid surveys without access to AI.
They'll be strategically placed so that most people will live within a short distance of a proctoring center.
WTP: Epistemic, Behavioral, and Philosophical Challenges.
papers.ssrn.com/sol3/papers....
๐จ Now out in Psych Science ๐จ
We report an adversarial collaboration (with @donandrewmoore.bsky.social) testing whether overconfidence is genuinely a trait
The paper was led by Jabin Binnendyk & Sophia Li (who is fantastic and on the job market!) Free copy here: journals.sagepub.com/eprint/7JIYS...