Marcel Binz's Avatar

Marcel Binz

@marcelbinz.bsky.social

Natural and artificial general intelligence. https://marcelbinz.github.io/

498 Followers  |  545 Following  |  18 Posts  |  Joined: 20.11.2024  |  2.1517

Latest posts by marcelbinz.bsky.social on Bluesky

Interdisciplinary approaches connecting cognitive science and machine learning to study and evaluate metacognition are especially welcome.
sites.google.com/view/metacog...

01.10.2025 00:36 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We are organizing a workshop on Metacognition in Generative AI at @euripsconf.bsky.social in Copenhagen later this year.
Submission deadline for short papers is on October 17th.

01.10.2025 00:36 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

check out our GAC on benchmarks at CCN happening later today!

13.08.2025 09:34 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Scientist Use A.I. To Mimic the Mind, Warts and All To better understand human cognition, scientists trained a large language model on 10 million psychology experiment questions. It now answers questions much like we do.

What happens when you train AI on psychological experiments? It behaves a lot like a human mind. Here's my story on Centaur, and the debate about what AI has to offer to cognitive science. Gift link nyti.ms/3ZYqXcg πŸ§ͺ

02.07.2025 15:23 β€” πŸ‘ 77    πŸ” 12    πŸ’¬ 7    πŸ“Œ 3

Huge thanks to the team and collaborators who made this possible.

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Centaur - a Hugging Face Space by marcelbinz This application generates text based on the input you provide. You can enter a prompt in the text box, and the app will produce a response or continuation. The text should be phrased in natural la...

You can also explore the model via our @hf.co space: huggingface.co/spaces/marce...

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Centaur

More information on the project landing page: marcelbinz.github.io/centaur

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Automated scientific minimization of regret We introduce automated scientific minimization of regret (ASMR) -- a framework for automated computational cognitive science. Building on the principles of scientific regret minimization, ASMR leverag...

We also present a case study showing how Centaur can support scientific discovery.
An updated version of this approach is available in our new preprint: arxiv.org/abs/2505.17661

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Centaur can also be adapted to predict secondary measurements like neural activity and response times -- despite never being trained to do so.

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We find that Centaur generalizes to unseen experiments and accurately predicts human behavior under modified cover stories, problem structures, and even in entirely novel domains.

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Centaur was trained on Psych-101, a new dataset with trial-by-trial data from 160 psychological experiments, containing over 60,000 participants and 10,000,000 choices.

02.07.2025 15:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
A foundation model to predict and capture human cognition - Nature A computational model called Centaur, developed by fine-tuning a language model on a huge dataset called Psych-101, can predict and simulate human nature in experiments expressible in natural language...

Paper available at www.nature.com/articles/s41...

02.07.2025 15:33 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

Excited to see our Centaur project out in @nature.com.
TL;DR: Centaur is a computational model that predicts and simulates human behavior for any experiment described in natural language.

02.07.2025 15:33 β€” πŸ‘ 42    πŸ” 12    πŸ’¬ 6    πŸ“Œ 2
Preview
Automated scientific minimization of regret We introduce automated scientific minimization of regret (ASMR) -- a framework for automated computational cognitive science. Building on the principles of scientific regret minimization, ASMR leverag...

New short-form preprint in which we use Centaur to identify gaps in interpretable cognitive models and revise them accordingly using Qwen3 -- fully automated and without a human-in-the-loop.

arxiv.org/abs/2505.17661

01.06.2025 11:32 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1
registration | IICCSSS International Interdisciplinary Computational Cognitive Science Summer School

Registration for IICCSSS 2025 in Darmstadt is open! πŸ₯³ Sign up now for a week of exciting talks, hands-on projects and inspiring discussions! www.iiccsss.org/registration/
As always, IICCSSS is free, and open to all students who are excited about computational cognitive science πŸ’‘πŸ§ 

09.05.2025 20:25 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

#AIinScience: Ethical & Practical Challenges

🎀Interview with Dr. Marcel Binz, #HelmholtzMunich, on how Large Language Models are transforming scientific methods & the need of reshaping the scientific mindset:

πŸ‘‰ t1p.de/p82vj

@marcelbinz.bsky.social @ericschulz.bsky.social ‬@zeynepakata.bsky.social

24.04.2025 09:16 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Hi Yifei, sadly we don't have any intern positions available right now (and we are in general constrained to hiring interns who are enrolled at German universities).

09.04.2025 19:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We are looking for two PhD students at our institute in Munich.

Both postions are open-topic, so anything between cognitive science and machine learning is possible.

More information: hcai-munich.com/PhDHCAI.pdf

Feel free to share broadly!

09.04.2025 12:11 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

Ever wondered why only some memories 🧠 come easily? Our latest work (osf.io/preprints/ps...) led by S. Haridi, with @ericschulz.bsky.social, shows that targeted memory retrieval speeds up with precise semantic and temporal retrieval cues. Hence, crafting cues can give you instant access to memories⚑

03.02.2025 13:41 β€” πŸ‘ 12    πŸ” 7    πŸ’¬ 0    πŸ“Œ 0
Post image

In previous work we found that VLMs fall short of human visual cognition. To make them better, we fine-tuned them on visual cognition tasks. We find that while this improves performance on the fine-tuning task, it does not lead to models that generalize to other related tasks:

25.02.2025 10:45 β€” πŸ‘ 8    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2
Preview
Towards Automation of Cognitive Modeling using Large Language Models Computational cognitive models, which formalize theories of cognition, enable researchers to quantify cognitive processes and arbitrate between competing theories by fitting models to behavioral data....

About a month late posting this, but here's a new project with @ericschulz.bsky.social, @akjagadish.bsky.social, @marvinmathony.bsky.social and Tobias Ludwig

We are using LLMs to propose cognitive models in learning and decision making data. Presenting this work at RLDM!

arxiv.org/abs/2502.00879

26.02.2025 10:08 β€” πŸ‘ 21    πŸ” 8    πŸ’¬ 0    πŸ“Œ 4
Participate Website for CogSci PhD symposium in TΓΌbingen in 2025 on the topic β€œUnderstanding context in cognition”, funded by the German Cognitive Science society

The German Cognitive Science Society is organizing a PhD symposium in Tuebingen in April.
If you are a PhD student in the vicinity, you should definetely register (by February 28th) -- it will be fun!

cogsciprag.github.io/context-in-c...

07.02.2025 15:24 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

In our latest article, published in @pnas.org and led by @marcelbinz.bsky.social and Stephan Alaniz, we got together four diverse groups of scientists to reflect on how LLMs should affect science. From treating them like co-authors to using other tools instead, many interesting arguments emerged.

29.01.2025 09:11 β€” πŸ‘ 13    πŸ” 6    πŸ’¬ 1    πŸ“Œ 1
Preview
GitHub - marcelbinz/Psych-201 Contribute to marcelbinz/Psych-201 development by creating an account on GitHub.

We are currently building the largest, cross-domain data set of human behavior as part of an open collaborative project. Contributions of any form are welcome, but especially experiments with meta-data from developmental, cross-cultural, or clinical studies.

More details: github.com/marcelbinz/P...

27.01.2025 12:40 β€” πŸ‘ 34    πŸ” 15    πŸ’¬ 2    πŸ“Œ 1
Preview
Visual cognition in multimodal large language models - Nature Machine Intelligence Modern vision-based language models face challenges with complex physical interactions, causal reasoning and intuitive psychology. Schulze Buschoff and colleagues demonstrate that while some models ex...

Have we built machines that learn and think like people?
In our new paper, we find that vision large language models still fall short when it comes to cognitive abilities in the domains of causal reasoning, intuitive physics, and theory of mind.

www.nature.com/articles/s42...

15.01.2025 11:50 β€” πŸ‘ 26    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2
Seminar Series – Centre for Cognition, Computation and Modelling

and mark in your calendars the following dates & speakers:

David Danks, Jan. 7
Dimitri Coelho Mollo, Jan 14
Raphael Milliere, Jan 21
Ben Bergen, Feb 4
David Garcia, Feb 18
Jay McClelland, Mar 4
Chris Summerfield, Mar 18
Marcel Binz, April 1st
Tom Griffiths, April 29
Thomas Icard, May 13

17.12.2024 09:39 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Preprint alert! We explore 3 exploration tasks, testing if they measure a stable construct & its link to real-world exploration. We find improved robustness of latent factors compared to single-task estimates.
With Mirko Thalmann & @ericschulz.bsky.social
πŸ”—https://osf.io/preprints/psyarxiv/tzuey

10.12.2024 09:21 β€” πŸ‘ 29    πŸ” 12    πŸ’¬ 2    πŸ“Œ 1

If you are at NeurIPS, and interested in human alignment, representations, or cognitive modeling, don't miss out Can's Poster tomorrow!

10.12.2024 15:43 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@marcelbinz is following 20 prominent accounts