π¨Our preprint is online!π¨
www.biorxiv.org/content/10.1...
How do #dopamine neurons perform the key calculations in reinforcement #learning?
Read on to find out more! π§΅
@mhburrell.bsky.social
π¨Our preprint is online!π¨
www.biorxiv.org/content/10.1...
How do #dopamine neurons perform the key calculations in reinforcement #learning?
Read on to find out more! π§΅
Read more about it and see our smiling faces: https://www.mcb.harvard.edu/department/news/decoding-learning-how-cues-and-rewards-shape-behavior-and-dopamine-signals/ @harvardmcb.bsky.social (12/12)
18.03.2025 15:04 β π 6 π 1 π¬ 0 π 0This work is a product of a tremendous team: Lechen (Selina) Qian, Jay A Hennig (@jhennig.bsky.social), Sara Matias (@saramatias.bsky.social), Venki Murthy (@neurovenki.bsky.social), Sam Gershman (@gershbrain.bsky.social) and Naoshige Uchida (@naoshigeuchida.bsky.social)
(11/12)
We thank the reviewers for their comments, who helped us refine our explanations of the various models and why they succeed or fail.
(10/12)
In total, we show how TD learning can be used as a comprehensive explanation of the effects of contingency on associative learning and discuss how this guides our future study of how the brain learns causality. (9/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0Finally, we showed a novel model that relies on retrospective contingency, ANCCR (doi:10.1126/science.abq6740), does not explain our results under any parameter combination. (8/12)
18.03.2025 15:04 β π 3 π 0 π¬ 1 π 0Moreover, working with @jhennig.bsky.social, we showed small RNNS develop similar state space representations and explain our results in the same manner. (7/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0We sought to identify a TD model to explain these changes. While several classic implementations of TD did not working (e.g. CSC & microstimuli), with a state representation that incorporated the animalβs learned knowledge of the task structure, TD was able to explain all our results. (6/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0Another group, received the same increase in the number of rewards, but new rewards were preceded by a novel cue. This important control reveals that a classic definition of contingency βπ does not adequately describe the pattern of changes in dopamine and behavior. (5/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0We performed a Pavlovian contingency degradation task to examine how behavior and dopamine activities are modulated in contingency learning. In this task, mice were first trained in a simple conditioning task. Then one group of mice received both cued and uncued rewards. (4/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0Contingency, the degree to which a stimulus predicts an outcome, is a critical factor in shaping animal behavior during associative learning. But the neural mechanisms linking contingency to behavior have been elusive. (3/12)
18.03.2025 15:04 β π 2 π 0 π¬ 1 π 0In short, we found that we can explain the effects of contingency on both an animalβs behavior and ventral striatum dopamine response using temporal difference learning when equipped with appropriate state space representations. (2/12)
18.03.2025 15:04 β π 3 π 0 π¬ 1 π 0Iβm happy to share our latest work (co-lead by Selina Qian) has today been published in its final form in @:natureneuro.bsky.social: Read here: https://www.nature.com/articles/s41593-025-01915-4
(1/12)