π₯³ I am incredibly humbled and grateful to share that our work, "Aligning machine and human visual representations across abstraction levels," has been published today in @nature.com β¬οΈ
12.11.2025 17:05 β π 21 π 5 π¬ 0 π 0π₯³ I am incredibly humbled and grateful to share that our work, "Aligning machine and human visual representations across abstraction levels," has been published today in @nature.com β¬οΈ
12.11.2025 17:05 β π 21 π 5 π¬ 0 π 0No two humans behave exactly alike. But what about neural networks? We found early evidence that human-like individual differences in behavior emerge from networks trained with different initializations. Hereβs a peek at our resultsβto be presented at UniReps & DBM @NeurIPS. Full paper on the way!
26.10.2025 23:39 β π 11 π 3 π¬ 2 π 1
Presenting our #NeurIPS2025 work on modelβbehavior alignment today.
Could we even recognize the βrightβ model of behavior under flexible evaluation?
Come chat about DNNs & human visual preception!
Hall C-E #2010
Friday (today!) 4:30 β 7:30 PM
neurips.cc/virtual/2025...
Kudos to our NeurIPS 2025 reviewers for thoughtful, human-generated reviews. Iβll be presenting poster #2010 in San Diego on Fri, 5 Dec from 4:30β7:30 p.m. PT. Come say hi!
arXiv : arxiv.org/abs/2510.23321
Code and data: github.com/brainsandmachines/oddoneout_model_recovery
Our work reveals a sharp trade-off between predictive accuracy and model identifiability. Flexible mappings maximize predictivity, but blur the distinction between competing computational hypotheses.
20.11.2025 14:05 β π 3 π 1 π¬ 1 π 0Further analyses showed that linear probing was the culprit. The linear fit warps each model's original feature space, erasing its unique signature and making all aligned models converge toward a human-like representation.
20.11.2025 14:05 β π 3 π 0 π¬ 1 π 0The key dependent measure is how often the data-generating model actually achieves the highest prediction accuracy. The surprising result: even with massive datasets (millions of trials), the best-performing model is often not the right one.
20.11.2025 14:05 β π 1 π 0 π¬ 1 π 0Each simulation worked like this: (1) pick one model from 20 candidate NNs and fit it to human responses; (2) sample a synthetic dataset from that model using NEW triplets; (3) test all 20 models on this generated data, measuring cross-validated prediction accuracy.
20.11.2025 14:05 β π 1 π 0 π¬ 1 π 0We ran model recovery simulations using models fitted to the massive THINGS odd-one-out data shared by @martinhebart.bsky.social , @cibaker.bsky.social et al. Each simulation tested whether a neural network model would βwinβ the model comparison if it had generated the behavioral data.
20.11.2025 14:05 β π 1 π 0 π¬ 1 π 0In our new NeurIPS 2025 paper, we ask: does better predictive accuracy necessarily mean better mechanistic correspondence between neural networks and human representations? neurips.cc/virtual/2025...
20.11.2025 14:05 β π 3 π 0 π¬ 1 π 0They also showed that if we nudge the NN representations toward human judgments by linearly transforming the representation space itself crossvalidated prediction accuracy is boosted almost to the reliability bound. arxiv.org/abs/2211.01201
20.11.2025 14:05 β π 2 π 0 π¬ 1 π 0@lukasmut.bsky.social , @lorenzlinhardt.bsky.social et al, showed that neural network representations can be strong predictors of human odd-one-out judgments: the image humans select as βoddβ among three is often the one whose activation pattern differs most from the other two.
20.11.2025 14:05 β π 3 π 0 π¬ 1 π 0Excited to share my first paper: ModelβBehavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isnβt the Right One (NeurIPS 2025). link below.
20.11.2025 14:05 β π 18 π 4 π¬ 1 π 2