Stefan F. Schouten @stefanfs.me

Latest posts by stefanfs.me on Bluesky

📢 Our paper on 'Truth-value Judgment in LLMs' was accepted to @colmweb.org #COLM2025!

In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.

14.07.2025 14:54 — 👍 4 🔁 1 💬 1 📌 0

Truth-value judgment in language models: 'truth directions' are context sensitive Recent work has demonstrated that the latent spaces of large language models (LLMs) contain directions predictive of the truth of sentences. Multiple methods recover such directions and build probes t...

📄 Paper: arxiv.org/abs/2404.18865

Thanks to my coauthors @pbloem.sigmoid.social.ap.brid.gy, @ilia-markov.bsky.social and Piek Vossen

14.07.2025 14:54 — 👍 0 🔁 1 💬 1 📌 0

Finally, when intervening on hidden states, we find that the truth-value directions identified are causal mediators in the inference process.

14.07.2025 14:54 — 👍 0 🔁 0 💬 1 📌 0

Even directions identified from single sentences show some sensitivity to the context, but sensitivity increases when probes are based on examples where sentences appear in inferential contexts.

14.07.2025 14:54 — 👍 0 🔁 0 💬 1 📌 0

Regardless of probing method and dataset, truth-value directions are found to be sensitive to context. However, we also find they are sensitive to the presence of irrelevant information.

14.07.2025 14:54 — 👍 0 🔁 0 💬 1 📌 0

We use probing techniques that identify directions in the model's latent space which encode if sentences are more or less likely to be true. By manipulating inputs and hidden states we evaluate whether probabilities update in appropriate ways.

14.07.2025 14:54 — 👍 0 🔁 0 💬 1 📌 0

📢 Our paper on 'Truth-value Judgment in LLMs' was accepted to @colmweb.org #COLM2025!

In this paper, we investigate how LLMs keep track of the truth of sentences when reasoning.

14.07.2025 14:54 — 👍 4 🔁 1 💬 1 📌 0

Even truth-value directions identified from individual sentences still show some sensitivity to context, although the sensitivity increases when probes are based on sentences appearing in an inferential context.

14.07.2025 14:39 — 👍 0 🔁 0 💬 0 📌 0

We find that regardless of probing method and dataset, models are found to incorporate in-context information when assigning truth-values to sentences. However, we also find they are sensitive to irrelevant information.

14.07.2025 14:39 — 👍 0 🔁 0 💬 1 📌 0

We use probing techniques that identify directions in the model's latent space used to represent sentences as more or less likely to be true. By manipulating both inputs as well as hidden states, we test if probabilities update as expected.

14.07.2025 14:39 — 👍 0 🔁 0 💬 1 📌 0

@stefanfs.me is following 20 prominent accounts

Grace
@gracekind.net

A latent space odyssey gracekind.net

Institute of Science and Technology Austria (ISTA)
@istaresearch

The Institute of Science and Technology Austria (ISTA) is dedicated to cutting-edge basic research and graduate education. www.ista.ac.at

Eugene Vinitsky 🍒
@eugenevinitsky

Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU https://emerge-lab.github.io https://www.admonymous.co/eugenevinitsky

Yuval Pinter: home, then laundry
@uvp

Karaoke enthusiast 🇮🇱 en/he/him

EurIPS Conference
@euripsconf

EurIPS is a community-organized, NeurIPS-endorsed conference in Copenhagen where you can present papers accepted at @neuripsconf.bsky.social eurips.cc

Science HR
@sciencehr

A career network featuring science jobs in academia and industry. Visit our platform at www.science.hr

Piek
@piekvossen

Approaching the end of life.

Grant Peace
@thx

BlueSky only. Tech enthusiast (optimist). Futurist and early-adopter of everything.

Bridgy Fed for the fediverse
@ap.brid.gy

Bridgy Fed (https://fed.brid.gy/) bot user for the fediverse. To bridge your Bluesky account to the fediverse, follow this account. To ask a fediverse user to bridge their account, send their address (eg @user@instance) to this account in a chat message.…

Kendall Clark
@kendallclark

A person who works on AI, knowledge graphs, LLM. ⭐️🐶

larrytheliquid
@larrytheliquid

Formal Methods / Programming Language Theory / Neuro-Symbolic AI Founder at colimit.ai / @colimit.bsky.social

@theonly92

Coding, AI, assistant professor

Verna Dankers
@vernadankers

#NLProc PhD student in #Edinburgh 🏴󠁧󠁢󠁳󠁣󠁴󠁿 Incoming postdoc at ‪#Mila‬ 🇨🇦 interpretability x memorisation x (non-)compositionality. she/her 👩‍💻 🇳🇱

Jessica Hullman
@jessicahullman

Ginni Rometty Prof @NorthwesternCS | Fellow @NU_IPR | Uncertainty + decisions | Humans + AI/ML | Blog @statmodeling

Iwan Williams
@iwan-williams

Philosophy postdoc @ CPAI, University of Copenhagen. Researching artificial intelligence, mental representation, representational formats, concepts. PhD 2021 @ Monash University 🏠 https://iwanrwilliams.wordpress.com

Cameron Buckner
@cameronbuckner

All views my own, links posted w/o endorsement AI, philosophy of science, philosophy of mind, animal cognition Http://cameronbuckner.net

Isabelle Augenstein
@iaugenstein

Professor at the University of Copenhagen. Explainable AI, Natural Language Processing, ML. Head of copenlu.bsky.social lab. #NLProc #NLP #XAI http://isabelleaugenstein.github.io/

Tal Linzen
@tallinzen

NYU professor, Google research scientist. Good at LaTeX.

@eth-ai-center

Welcome to ETH AI Center! We are ethz.ch/en 's central hub leading the way towards trustworthy, accessible and inclusive #artificialintelligence ai.ethz.ch

Levi Lelis
@programsynthesis

Associate Professor - University of Alberta Canada CIFAR AI Chair with Amii Machine Learning and Program Synthesis he/him; ele/dele 🇨🇦 🇧🇷 https://www.cs.ualberta.ca/~santanad