Atharva Kulkarni's Avatar

Atharva Kulkarni

@athrvkk.bsky.social

CS PhD at USC | Prev - CMU, Apple, IIIT Delhi | Robust, Generalizable, and Trustworthy NLP https://athrvkk.github.io/

660 Followers  |  502 Following  |  11 Posts  |  Joined: 15.11.2024  |  1.6041

Latest posts by athrvkk.bsky.social on Bluesky

πŸ™ŒπŸ₯³Had great fun doing this during my summer internship with folks from Apple (Yuan Zhang, Joel Ruben Antony Moniz, Xiou Ge, Bo-Hsiang Tseng, Dhivya Piraviperumal, Hong Yu) and USC (@swabhs.bsky.social)

Looking forward to the feedback! πŸ™‚
#LLMs #NLProc

(7/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

🚫Bottom line: There’s no single metric that captures hallucinations reliably across the board.

🎯Our work highlights the need for robust, context-aware, and generalizable hallucination detection tools as a prerequisite to meaningful mitigation.

(6/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

βœ…What works better?
Unsurprisingly, GPT4-based evaluators show the highest reliability with humans across settings 🌟
Using ensembles of multiple metrics is a promising avenue⭐️
Instruction tuning & mode-seeking decoding help reduce hallucinationsπŸ“ˆ

(5/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Our findings highlight:
⚠️Many existing metrics show poor alignment with human judgments
⚠️The inter-metric correlation is also weak
⚠️The show limited generalization across datasets, tasks, and models
⚠️They do not consistent improvement with larger models

(4/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🧐Focusing on faithfulness and factuality errors in QA and dialogue tasks, we study diverse metrics spanning:
1. Syntactic and semantic similarity
2. Natural language inference
3. Multi-step question answering pipelines
4. Custom-trained models
5. SOTA LLMs as judge.

(3/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ€”Despite a surge in research on hallucination mitigation, few ask the critical questions:
1. Are the metrics capturing the hallucinations effectively?
2. Do they align with each other and the human notion of hallucination?
3. Do they generalize across different settings?

(2/n)

30.04.2025 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Hallucinations in LLMs are realβ€”and so are the problems with how we measure them πŸ“‰

Our latest work questions the generalizability of hallucination detection metrics across tasks, datasets, model sizes, training methods, and decoding strategies πŸ’₯

arxiv.org/abs/2504.18114

(1/n)

30.04.2025 18:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Reasoning about the "why" behind user behavior can improve LLM personas! βœ¨πŸ§ πŸ“ˆ

πŸ“Excited to share our new work: Improving LLM Personas via Rationalization with Psychological Scaffolds

πŸ”— arxiv.org/abs/2504.17993
🧡 (1/n)

29.04.2025 01:05 β€” πŸ‘ 14    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1
Preview
NLP grad students Join the conversation

There's too many starter packs.
πŸ‘‡ Here's a list, mostly for NLP, ML, and related areas.

01.12.2024 03:05 β€” πŸ‘ 40    πŸ” 11    πŸ’¬ 3    πŸ“Œ 2
Post image Post image

#socalnlp is the biggest it's ever been in 2024! We have 3 poster sessions up from 2! How many years until it's a two-day event?? 🀯

22.11.2024 21:50 β€” πŸ‘ 26    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Started a SoCal AI/ML/NLP researchers starter pack! It's a bit sparse right now, and perhaps more NLP heavy, but hey, nominate yourself and others! go.bsky.app/6QckPj9

19.11.2024 15:28 β€” πŸ‘ 43    πŸ” 8    πŸ’¬ 17    πŸ“Œ 1

πŸ™‹πŸ»β€β™‚οΈπŸ™‹πŸ»β€β™‚οΈ

19.11.2024 23:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hey John, thanks for starting this packet! Could you please add me as well?

18.11.2024 18:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Can you please add me to the pack! Looking forward to interacting with everyone!

15.11.2024 06:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Great initiative!! Can you please add me! Looking forward to interacting with everyone!!πŸ’―

15.11.2024 06:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@athrvkk is following 20 prominent accounts