π₯ New position piece! π₯ In this paper we lay out our vision for AI Alignment as guided by "Resource Rational Contractualism" (RRC).
But wait -- what's that? A π§΅.
π₯ New position piece! π₯ In this paper we lay out our vision for AI Alignment as guided by "Resource Rational Contractualism" (RRC).
But wait -- what's that? A π§΅.
π¨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient?
Our work explains this & *predicts Transformer behavior throughout training* without its weights! π§΅
1/
Excited to share a new CogSci paper co-led with @benpry.bsky.social!
Once a cornerstone for studying human reasoning, the think-aloud method declined in popularity as manual coding limited its scale. We introduce a method to automate analysis of verbal reports and scale think-aloud studies. (1/8)π§΅
π€π€Most AI systems assume thereβs just one right answerβbut many tasks have reasonable disagreement. How can we better model human variation? πβ¨
We propose modeling at the individual-level using open-ended, textual value profiles! π£οΈπ
arxiv.org/abs/2503.15484
1/13 New Paper!! We try to understand why some LMs self-improve their reasoning while others hit a wall. The key? Cognitive behaviors! Read our paper on how the right cognitive behaviors can make all the difference in a model's ability to improve with RL! π§΅
04.03.2025 18:15 β π 57 π 17 π¬ 2 π 3There are also many papers on information diffusionβ¦..
26.11.2024 20:30 β π 8 π 0 π¬ 0 π 0Noah way!
23.11.2024 21:44 β π 4 π 0 π¬ 0 π 0So whereβs your pack of AI researchers that defy categorization?
23.11.2024 21:01 β π 5 π 0 π¬ 2 π 0But @roydanroy.bsky.social where do I belong?? A crisis of affective self theory.
23.11.2024 16:54 β π 2 π 0 π¬ 1 π 0@tomerullman.bsky.social I made one for your lab!
21.11.2023 14:35 β π 15 π 0 π¬ 0 π 0This seems like a good first post: what happens to psychology in the LLM future? I ponderβ¦ open.substack.com/pub/noahgood...
13.11.2023 23:07 β π 48 π 11 π¬ 0 π 4