thanks a lot Martin!
05.10.2025 16:41 β π 0 π 0 π¬ 0 π 0@alexeykoshevoy.bsky.social
PhD student at Laboratoire de Psychologie Cognitive, AMU & Institut Jean Nicod, ENS-PSL | Interested in how communication shapes languages https://alexeykosh.github.io
thanks a lot Martin!
05.10.2025 16:41 β π 0 π 0 π¬ 0 π 0thanks a lot Oleg!!
30.09.2025 13:09 β π 1 π 0 π¬ 0 π 0thanks Mason!!
26.09.2025 11:13 β π 0 π 0 π¬ 0 π 0thanks a lot Natalia!
25.09.2025 23:07 β π 0 π 0 π¬ 0 π 0thanks Jakub! everyone has enjoyed our paper
25.09.2025 17:07 β π 2 π 0 π¬ 1 π 0Huge congratulations to @alexeykoshevoy.bsky.social l who defended his PhD thesis today!! with his co-supervisor @sblldtrch.bsky.social and jury members @simonkirby.bsky.social @gboleda.bsky.social Paula Rubio Fernandez & Benjamin Spector.
25.09.2025 16:42 β π 17 π 2 π¬ 5 π 0We present our new preprint titled "Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation". We quantify LLM hacking risk through systematic replication of 37 diverse computational social science annotation tasks. For these tasks, we use a combined set of 2,361 realistic hypotheses that researchers might test using these annotations. Then, we collect 13 million LLM annotations across plausible LLM configurations. These annotations feed into 1.4 million regressions testing the hypotheses. For a hypothesis with no true effect (ground truth $p > 0.05$), different LLM configurations yield conflicting conclusions. Checkmarks indicate correct statistical conclusions matching ground truth; crosses indicate LLM hacking -- incorrect conclusions due to annotation errors. Across all experiments, LLM hacking occurs in 31-50\% of cases even with highly capable models. Since minor configuration changes can flip scientific conclusions, from correct to incorrect, LLM hacking can be exploited to present anything as statistically significant.
π¨ New paper alert π¨ Using LLMs as data annotators, you can produce any scientific result you want. We call this **LLM Hacking**.
Paper: arxiv.org/pdf/2509.08825
Image of labubu doll labeled labubu next to image of spiky labubu doll labeled lakiki
::slowly stands while clapping::
10.09.2025 23:07 β π 1014 π 257 π¬ 6 π 11Congrats, well deserved!
04.09.2025 14:56 β π 1 π 0 π¬ 0 π 0New paper! β‘ With Gabriel Aguirre and Marcelo SΓ‘nchez, looking at patterns of blowgun types and use across societies of the world. We find areal patterns, similarities mediated by cultural connections, and specific types characterizing distinct branches of the Austronesian language tree. π―
27.08.2025 21:37 β π 22 π 8 π¬ 0 π 0I am on a 6 hour train journey without air conditioning, but itβs worth it because I am heading to #SLE2025! This is my first linguistics conference in a while.
25.08.2025 14:32 β π 5 π 0 π¬ 0 π 0Models as Prediction Machines: How to Convert Confusing Coefficients into Clear Quantities Abstract Psychological researchers usually make sense of regression models by interpreting coefficient estimates directly. This works well enough for simple linear models, but is more challenging for more complex models with, for example, categorical variables, interactions, non-linearities, and hierarchical structures. Here, we introduce an alternative approach to making sense of statistical models. The central idea is to abstract away from the mechanics of estimation, and to treat models as βcounterfactual prediction machines,β which are subsequently queried to estimate quantities and conduct tests that matter substantively. This workflow is model-agnostic; it can be applied in a consistent fashion to draw causal or descriptive inference from a wide range of models. We illustrate how to implement this workflow with the marginaleffects package, which supports over 100 different classes of models in R and Python, and present two worked examples. These examples show how the workflow can be applied across designs (e.g., observational study, randomized experiment) to answer different research questions (e.g., associations, causal effects, effect heterogeneity) while facing various challenges (e.g., controlling for confounders in a flexible manner, modelling ordinal outcomes, and interpreting non-linear models).
Figure illustrating model predictions. On the X-axis the predictor, annual gross income in Euro. On the Y-axis the outcome, predicted life satisfaction. A solid line marks the curve of predictions on which individual data points are marked as model-implied outcomes at incomes of interest. Comparing two such predictions gives us a comparison. We can also fit a tangent to the line of predictions, which illustrates the slope at any given point of the curve.
A figure illustrating various ways to include age as a predictor in a model. On the x-axis age (predictor), on the y-axis the outcome (model-implied importance of friends, including confidence intervals). Illustrated are 1. age as a categorical predictor, resultings in the predictions bouncing around a lot with wide confidence intervals 2. age as a linear predictor, which forces a straight line through the data points that has a very tight confidence band and 3. age splines, which lies somewhere in between as it smoothly follows the data but has more uncertainty than the straight line.
Ever stared at a table of regression coefficients & wondered what you're doing with your life?
Very excited to share this gentle introduction to another way of making sense of statistical models (w @vincentab.bsky.social)
Preprint: doi.org/10.31234/osf...
Website: j-rohrer.github.io/marginal-psy...
Congrats!
07.07.2025 14:59 β π 1 π 0 π¬ 0 π 0Experimentology cover: title and curves for distributions.
Experimentology is out today!!! A group of us wrote a free online textbook for experimental methods, available at experimentology.io - the idea was to integrate open science into all aspects of the experimental workflow from planning to design, analysis, and writing.
01.07.2025 18:25 β π 534 π 228 π¬ 10 π 15Want to easily scrape data from news media sites?
There's an R package for that!
paperboy
"paperboy offers writers of web scrap[ers] a clear path to publish their code & earn co-authorship on the package, while deliver[ing] news media data from many websites in a consistent format."
For some reason bluesky doesnβt work on Firefox anymore, event after I updated it. Is it the case for anyone else?
11.06.2025 09:50 β π 0 π 0 π¬ 0 π 0:: (not really) Trigger warning::
Stay in your lane, or pay a career penalty
www.nature.com/articles/s41...
Language depends on copying (e.g. of words, signs). And language in turn is needed for many other things.
When and why did our ancestors gain this ability to copy? Our (Ron, Elisa & me) archaeological reanalysis says: in the last million years. Just out:
dx.doi.org/10.1093/oxfo...
π¨New publication alert!
"Corpus-based approaches to evolutionary dynamics in language" (w/ @stefanhartmann.bsky.social) out now in the in the Oxford Handbook of Approaches to Language Evolution. Kudos to @limorraviv.bsky.social & @cedricboeckx.bsky.social for putting this great volume together!
you know all gen 1 pokΓ©mon, of course you could.
reminded of brandmaier.github.io/ggx/
How are humans able to make sense of time? Not with special biology but with βtime toolsββideas, practices, and artifacts that render time more concrete.
My new paper explores this vast, varied toolkitβone that makes use of knots, nuts, hands, flowers, mountains, shadows, and much more.
(link π)
Out now in @pnas.org! πΉIs a rose by any other name still as roselike?πΉ
We study the prevalence of iconicity (does a word look/sound like what it means?) and systematicity (are pronunciation/meaning relationships shared across multiple words?) in large datasets of ASL, English, and Spanish.
π§΅1/N
Knowledge transmission, culture and the consequences of social disruption in wild elephants royalsocietypublishing.org/doi/10.1098/...
05.05.2025 12:28 β π 17 π 11 π¬ 0 π 1"Emissions linked to Chevron, the highest-emitting investor-owned company in our data, for example, very likely caused between US $791βbillion and $3.6βtrillion in heat-related losses over the period 1991β2020"
www.nature.com/articles/s41...
Apply for our PhD position in language acquisition / computational linguistics in Groningen until 24 April! Job ad is here:
www.rug.nl/about-ug/wor...
Psych-DS is (1) spellcheck for your datasets and (2) a pathway to standardizing data in our academic fields that *everyone* can learn.
And it's live RIGHT NOW!
psych-ds.github.io
(This is the announcement post I've been leading up to)
This looks useful -- A Dataset on Linguistic Connectivity Across and Within Countries
#linguistics #languages
www.nature.com/articles/s41...
New paper on misperceptions out in PNAS @pnas.org
www.pnas.org/doi/10.1073/...
Why do people overestimate the size of politically relevant groups (immigrant, LGBTQ, Jewish) and quantities (% of budget spent on foreign aid, % of refugees that are criminals)?π§΅π
Game changing study in @science.org by @berthetmelissa.bsky.social and co.
www.science.org/doi/10.1126/...