Tim Baumgärtner timbmg - Bluesky Statics

💡 TIL @overleaf.com is basically a git repo.

In my research workflow, I directly added it as submodule to my code repo. Now I can produce figures and tables, and have them magically uploaded to Overleaf just by pushing the repo.

No more renaming, keeping versions straight, and manual uploading 😇

10.11.2025 10:44 — 👍 0 🔁 0 💬 0 📌 0

💡 TIL, it's super easy to fetch data from Google Sheets into Pandas. Makes it really convenient to annotate some data.

Previously, I was always downloading CSVs, losing track of file versions, and loading and merging them sluggishly in Python.

👉 find the code here: gist.github.com/timbmg/6c2d6...

04.11.2025 16:00 — 👍 0 🔁 0 💬 0 📌 0

🔍 𝗪𝗮𝗻𝘁 𝘁𝗼 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗲 𝗺𝗼𝗱𝗲𝗹𝘀 𝗼𝗻 𝘀𝗰𝗶𝗲𝗻𝘁𝗶𝗳𝗶𝗰 𝗤𝗔, 𝗯𝘂𝘁 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮𝘀𝗲𝘁 𝗹𝗮𝗰𝗸𝘀 𝗿𝗲𝗮𝗹-𝘄𝗼𝗿𝗹𝗱 𝗾𝘂𝗲𝘀𝘁𝗶𝗼𝗻𝘀 𝗮𝘀𝗸𝗲𝗱 𝗯𝘆 𝗲𝘅𝗽𝗲𝗿𝘁𝘀?

🚀 PeerQA is the solution: a dataset with questions from peer reviews and answers from the original authors. (1/🧵)

#NLProc

25.04.2025 07:46 — 👍 2 🔁 1 💬 1 📌 0

Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps When prompted to think step-by-step, language models (LMs) produce a chain of thought (CoT), a sequence of reasoning steps that the model supposedly used to produce its prediction. However, despite mu...

🚨🚨 New preprint 🚨🚨

Ever wonder whether verbalized CoTs correspond to the internal reasoning process of the model?

We propose a novel parametric faithfulness approach, which erases information contained in CoT steps from the model parameters to assess CoT faithfulness.

arxiv.org/abs/2502.14829

21.02.2025 12:42 — 👍 48 🔁 13 💬 2 📌 3

Towards real-world fact-checking with large language models Misinformation poses a growing threat to our society. It has a severe impact on public health by promoting fake cures fear and distrust. Current research

𝗙𝗮𝗰𝘁-𝗖𝗵𝗲𝗰𝗸𝗶𝗻𝗴 𝗶𝗻 𝘁𝗵𝗲 𝗔𝗴𝗲 𝗼𝗳 𝗔𝗜 – 𝗔 𝗧𝗮𝗹𝗸 𝗯𝘆 𝗜𝗿𝘆𝗻𝗮 𝗚𝘂𝗿𝗲𝘃𝘆𝗰𝗵 @𝗔𝗜 𝗳𝗼𝗿 𝗚𝗼𝗼𝗱

Misinformation is a new weapon disrupting public debates, scientific discussions, and political decisions. How can we identify and counter misleading content?
(1/🧵)

18.02.2025 08:27 — 👍 3 🔁 1 💬 1 📌 0

🤔 An Energy Star for AI? Introducing AI Energy Score: First-ever rating system comparing 166 AI models' energy consumption!

From LLaMa to Gemma, get transparent ⭐️1-5 efficiency ratings.

Incredible work led by @sashamtl.bsky.social

huggingface.co/blog/sasha/a...

11.02.2025 09:44 — 👍 25 🔁 8 💬 0 📌 0

Excited to share that our Paper "PeerQA: A Scientific Question Answering Dataset from Peer Reviews" as been accepted to #NAACL2025 Looking forward to presenting it in Albuquerque 🏜️!

27.01.2025 14:13 — 👍 5 🔁 0 💬 0 📌 0

Perspectives on Intelligence: Community Survey Research survey exploring how NLP/ML/CogSci researchers define and use the concept of intelligence.

What do YOU mean by "intelligence", and does ChatGPT fit your definition?
We collected the major criteria used in CogSci and other fields, and designed a survey to find out!

Access link: www.survey-xact.dk/collect
Code: 4S7V-SN4M-S536
Time: 5-10 mins

04.12.2024 07:48 — 👍 32 🔁 13 💬 2 📌 10

Posts by Tim Baumgärtner (@timbmg.bsky.social)