This is one of the most outstanding examples of circuit understanding I've seen in a long time. The unification of theory and experiment is beautiful.
When Malcolm presented this in my lab, the audience was cheering at the end, and one person shouted (non-ironically) "You did it!"
19.09.2025 13:37 β π 105 π 21 π¬ 4 π 0
Now out in Cognition, work with the great @gershbrain.bsky.social @tobigerstenberg.bsky.social on formalizing self-handicapping as rational signaling!
π authors.elsevier.com/a/1lo8f2Hx2-...
19.09.2025 03:46 β π 33 π 13 π¬ 1 π 1
Interesting! Weβre trying to figure out _why_ LLMs donβt quite rely on counterfactual reasoning when judging responsibility. It could beβas you suggestedβthat theyβre worse at counterfactual simulations, or that they simply donβt think counterfactuals are relevant here. Excited to dig further π
24.07.2025 12:07 β π 1 π 0 π¬ 1 π 0
Come by our poster at CogSci (Poster Session 2, P2-X-215), Friday 8/1 at 10:30am!
24.07.2025 00:23 β π 1 π 0 π¬ 0 π 0
Our results shed light on how we can make LLMs more human-like and how to study the mechanisms underlying complex behavior in LLMs. Co-led by me and @ebig.bsky.social, with the great @tobigerstenberg.bsky.social @tomerullman.bsky.social @gershbrain.bsky.social (4/4)
24.07.2025 00:23 β π 1 π 0 π¬ 2 π 0
LLM and human data are highly correlated, BUT they are best explained by different factors! LLMs evaluate collaborators based on force (how much output they contribute), whereas humans evaluate collaborators based on their actual and counterfactual eο¬ort. (3/4)
24.07.2025 00:20 β π 4 π 0 π¬ 1 π 1
We adapted materials from human studies on responsibility and reward attributions and compared LLMsβ responses to human data and seven cognitive models. (2/4)
24.07.2025 00:20 β π 3 π 0 π¬ 1 π 0
Our latest on the cognitive science of LLMs! To be presented @CogSciβ¬2025 π
LLMs are increasingly involved in human collaborations. How do LLMs assign responsibility and reward to collaborators? Is it similar to how humans do it? π€π§
π gershmanlab.com/pubs/XiangBi... (1/4)
24.07.2025 00:19 β π 38 π 9 π¬ 2 π 1
OSF
@gershbrain.bsky.social, @yangxiang.bsky.social, and I have a new project out in preprint form!
osf.io/preprints/ps...
Here are the main takeaways: (1/6)
23.03.2025 00:59 β π 13 π 4 π¬ 4 π 0
OSF
By offering a systematic explanation of self-handicapping, we hope to lay the groundwork for developing effective interventions that target academic self-handicapping, helping people to realize their full potential. A preprint of the paper is available on PsyArxiv: osf.io/preprints/ps... (5/5)
25.11.2024 03:26 β π 1 π 0 π¬ 0 π 0
We tested the theory's predictions in two experiments, showing that self-handicapping occurs more often when itβs unlikely to affect the outcome and when it increases a naive observer's perceived competence. With sophisticated observers, itβs less effective when followed by failure. (4/5)
25.11.2024 03:25 β π 0 π 0 π¬ 0 π 0
Theory schematic
We developed a signaling theory of self-handicapping, involving a naive observer who evaluates the actorβs competence, an actor who seeks to impress the naive observer through strategic self-handicapping, and a sophisticated observer who considers the actorβs decision whether to self-handicap. (3/5)
25.11.2024 03:24 β π 0 π 0 π¬ 0 π 0
Self-handicapping is a strategy where people deliberately impede their performance to protect perceived competence in case of failure, or enhance it in case of success. Despite much prior research, it is unclear why, when, and how self-handicapping occurs. (2/5)
25.11.2024 03:23 β π 0 π 0 π¬ 0 π 0
Really excited about this project, and thanks so much to my wonderful collaborators @gershbrain.bsky.social @tobigerstenberg.bsky.social for making this happen! Some main takeaways in thread π§΅ (1/5)
25.11.2024 03:22 β π 13 π 1 π¬ 4 π 1
Prof. of Computational Cognitive Science at TU Darmstadt & PI of the Human and Machine Cognition lab at the University of TΓΌbingen | hmc-lab.com
Inspired by Cognitive Science and Philosophy.
https://www.juniorokoroafor.com/
Cognitive science PhD student at Stanford, studying iterated learning and reasoning.
Cognitive Neuroscientist @ Harvard, AI Researcher @ Motional
Models of human & robot decision making in complex environments, including video games and urban driving. https://www.momchiltomov.com/
Postdoc in the Kuhl Lab at the University of Oregon, PhD from UT Austin. Episodic Memory | Computational Neuroscience | Cognitive Neuroscience | Machine Learning. πΏ -> π -> π -> π¦, he/him
soroushmirjalili.com
Associate Prof @warwickpsych.bsky.social . Works on language, prediction, joint action and is curious about curiosity. She/her
phd student @ uc irvine cog sci w/ Megan Peters.
π§ structure learning, metacognition, perception, comp cog neuro.
https://tinyurl.com/rochellekaper
she/her
PhD student @University of Washington #huskypsych| Formerly @stanfordpsych & @VanderbiltU | Driven by a thirst for knowledge and kept in check by my catπββ¬ | She/Her
Cognitive scientist studying the development of the social mind. Assistant professor at UCSB. π¨π¦π³οΈβπ (he/him)
bmwoo.github.io
Studying multi-agent collaboration π€π§©π€
PhD Candidate at Princeton CS with Tom Griffiths & Natalia VΓ©lez @cocoscilab.bsky.social @velezcolab.bsky.social
Prev: Cornell CS, MIT BCS
Philosophy Professor at the University of Puget Sound. Working on philosophy of mind, AI, science.
Postdoctoral research associate at UCL
https://tianweigong.github.io/
Cognitive scientist at the University of Edinburgh. Causality, computation, evolution.
Lab: https://quillienlab.github.io/
cognitive scientist. postdoc at center for humans and machines, MPI for human development, Berlin. Interested in moral psychology, human-AI interaction, (experimental) philosophy and other things.
neeleengelmann.com
Cognitive development researcher at OSU, foodie, music lover/deadhead, techie, and concerned about the nation. He/him
https://scholar.google.com/citations?user=XEkYhiAAAAAJ&hl=en
https://u.osu.edu/madlab
CogSci, Philosophy & AI, Postdoc at Max Planck Institute Berlin.
Associate Professor, Dept of Psychology, UC Berkeley.
PI of @shenhavlab.bsky.social
https://www.shenhavlab.org/
Computational cognitive neuroscience. Perception, representation, inference and decision-making. Postdoc at Harvard with Sam Gershman.
Cognitive scientist working at the intersection of moral cognition and AI safety. Currently: Google Deepmind. Soon: Assistant Prof at NYU Psychology. More at sites.google.com/site/sydneymlevine.