Tim Baumgรคrtner's Avatar

Tim Baumgรคrtner

@timbmg.bsky.social

๐Ÿ‘จโ€๐Ÿ’ป NLP PhD Student @ukplab.bsky.social

81 Followers  |  262 Following  |  1 Posts  |  Joined: 21.11.2024  |  1.4241

Latest posts by timbmg.bsky.social on Bluesky

Post image

๐Ÿ” ๐—ช๐—ฎ๐—ป๐˜ ๐˜๐—ผ ๐—ฒ๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฎ๐˜๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐—ผ๐—ป ๐˜€๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—ถ๐—ณ๐—ถ๐—ฐ ๐—ค๐—”, ๐—ฏ๐˜‚๐˜ ๐˜†๐—ผ๐˜‚๐—ฟ ๐—ฑ๐—ฎ๐˜๐—ฎ๐˜€๐—ฒ๐˜ ๐—น๐—ฎ๐—ฐ๐—ธ๐˜€ ๐—ฟ๐—ฒ๐—ฎ๐—น-๐˜„๐—ผ๐—ฟ๐—น๐—ฑ ๐—พ๐˜‚๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—ฎ๐˜€๐—ธ๐—ฒ๐—ฑ ๐—ฏ๐˜† ๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐˜๐˜€?

๐Ÿš€ PeerQA is the solution: a dataset with questions from peer reviews and answers from the original authors. (1/๐Ÿงต)

#NLProc

25.04.2025 07:46 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps When prompted to think step-by-step, language models (LMs) produce a chain of thought (CoT), a sequence of reasoning steps that the model supposedly used to produce its prediction. However, despite mu...

๐Ÿšจ๐Ÿšจ New preprint ๐Ÿšจ๐Ÿšจ

Ever wonder whether verbalized CoTs correspond to the internal reasoning process of the model?

We propose a novel parametric faithfulness approach, which erases information contained in CoT steps from the model parameters to assess CoT faithfulness.

arxiv.org/abs/2502.14829

21.02.2025 12:42 โ€” ๐Ÿ‘ 46    ๐Ÿ” 13    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3
Preview
Towards real-world fact-checking with large language models Misinformation poses a growing threat to our society. It has a severe impact on public health by promoting fake cures fear and distrust. Current research

๐—™๐—ฎ๐—ฐ๐˜-๐—–๐—ต๐—ฒ๐—ฐ๐—ธ๐—ถ๐—ป๐—ด ๐—ถ๐—ป ๐˜๐—ต๐—ฒ ๐—”๐—ด๐—ฒ ๐—ผ๐—ณ ๐—”๐—œ โ€“ ๐—” ๐—ง๐—ฎ๐—น๐—ธ ๐—ฏ๐˜† ๐—œ๐—ฟ๐˜†๐—ป๐—ฎ ๐—š๐˜‚๐—ฟ๐—ฒ๐˜ƒ๐˜†๐—ฐ๐—ต @๐—”๐—œ ๐—ณ๐—ผ๐—ฟ ๐—š๐—ผ๐—ผ๐—ฑ

Misinformation is a new weapon disrupting public debates, scientific discussions, and political decisions. How can we identify and counter misleading content?
(1/๐Ÿงต)

18.02.2025 08:27 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿค” An Energy Star for AI? Introducing AI Energy Score: First-ever rating system comparing 166 AI models' energy consumption!

From LLaMa to Gemma, get transparent โญ๏ธ1-5 efficiency ratings.

Incredible work led by @sashamtl.bsky.social

huggingface.co/blog/sasha/a...

11.02.2025 09:44 โ€” ๐Ÿ‘ 25    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Excited to share that our Paper "PeerQA: A Scientific Question Answering Dataset from Peer Reviews" as been accepted to #NAACL2025 Looking forward to presenting it in Albuquerque ๐Ÿœ๏ธ!

27.01.2025 14:13 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Perspectives on Intelligence: Community Survey Research survey exploring how NLP/ML/CogSci researchers define and use the concept of intelligence.

What do YOU mean by "intelligence", and does ChatGPT fit your definition?
We collected the major criteria used in CogSci and other fields, and designed a survey to find out!

Access link: www.survey-xact.dk/collect
Code: 4S7V-SN4M-S536
Time: 5-10 mins

04.12.2024 07:48 โ€” ๐Ÿ‘ 32    ๐Ÿ” 13    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 10

@timbmg is following 19 prominent accounts