Maitrey Mehta @my-tray - Bluesky Profile

Latest posts by my-tray.bsky.social on Bluesky

Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, Yonatan Belinkov. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

Outstanding paper (5/7):

"Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps"
by Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, and Yonatan Belinkov
aclanthology.org/2025.emnlp-m...

6/n

07.11.2025 22:32 — 👍 11 🔁 3 💬 1 📌 0

1/ 🚨NEW PAPER: "BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs", accepted to ACL Findings 2025!
We introduce the first benchmark specifically designed to help LLMs assist lawyers in writing legal briefs 🧑‍⚖️

📄 arxiv.org/abs/2506.06619
🗂️ huggingface.co/datasets/jw4...

20.06.2025 22:07 — 👍 7 🔁 4 💬 1 📌 2

What Has Been Lost with Synthetic Evaluation? Large language models (LLMs) are increasingly used for data generation. However, creating evaluation benchmarks raises the bar for this emerging paradigm. Benchmarks must target specific phenomena, pe...

𝐖𝐡𝐚𝐭 𝐇𝐚𝐬 𝐁𝐞𝐞𝐧 𝐋𝐨𝐬𝐭 𝐖𝐢𝐭𝐡 𝐒𝐲𝐧𝐭𝐡𝐞𝐭𝐢𝐜 𝐄𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧?

(arxiv.org/abs/2505.22830)

I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social

04.06.2025 22:24 — 👍 11 🔁 4 💬 1 📌 1

🙋

17.11.2024 19:55 — 👍 1 🔁 0 💬 0 📌 0

@my-tray is following 19 prominent accounts

Salem Alotaibi
@otb-ub

Ph.D. candidate in Artificial Intelligence at University of Liverpool | MSc in Advanced Computer Science from Swansea University | Lecturer at University of Bisha

Fateme Hashemi Chaleshtori
@fatemehc

PhD student at Utah NLP, Mechanistic Interpretability, Trustworthy AI, Human-centered AI

Tokenization Workshop (TokShop) @ICML2025
@tokshop

Let's Talk about Tokenization https://tokenization-workshop.github.io

Andreas Waldis
@tresiwald

Behavioral and Internal Interpretability 🔎 Incoming PostDoc Tübingen University | PhD Student at @ukplab.bsky.social, TU Darmstadt/Hochschule Luzern

Ankita
@ankitagupta

PhD@UMass

Alisa Liu
@alisawuffles

phd student at @uwcse

A Aditya Bhardwaj
@aditya-bhardwaj

PhD student at IIIT Delhi | #NLProc #AIforHealthcare #socialcomputing

Barbara Plank
@barbaraplank

Prof, Chair for AI & Computational Linguistics, Head of MaiNLP lab @mainlp.bsky.social, LMU Munich Co-director CIS @cislmu.bsky.social Visiting Prof ITU Copenhagen @itu.dk ELLIS Fellow @ellis.eu Vice-President ACL PI MCML @munichcenterml.bsky.social

Kamala Sreepada
@ksreepada

CS Undergrad @ UMD | CLIP Lab @ UMD | Prev @ Uber

Mohit Iyyer
@miyyer

associate prof at UMD CS researching NLP & LLMs

Vilém Zouhar #EMNLP
@zouharvi

PhD student @ ETH Zürich | all aspects of NLP but mostly evaluation and MT | go vegan | https://vilda.net

Juan Diego Rodriguez
@juand-r

CS PhD student at UT Austin in #NLP Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models! Other interests: math, philosophy, cinema https://www.juandiego-rodriguez.com/

Rachit Bansal
@brachit

CS PhD @Harvard • Pre-doc @GoogleDeepMind • Anything `science', ~cosmos, and Oxford commas

Lucas Resck
@lucasresck

PhD student in NLP at Cambridge | ELLIS PhD student https://lucasresck.github.io/

Shramay Palta
@shramaypalta

https://shramay-palta.github.io CS PhD student . #NLProc at CLIP UMD| Commonsense + xNLP, AI, CompLing | ex Research Intern @msftresearch.bsky.social

Mina Lee
@mnlee

Assistant Professor @ UChicago CS/DSI (NLP & HCI) | Writing with AI ✍️ https://minalee-research.github.io/

Alon Jacoby
@alon-j

PhD student @ Penn alonj.github.io

Andrea Santilli
@asantilli

PhD student in NLP at Sapienza | Prev: Apple MLR, @colt-upf.bsky.social , HF Bigscience, PiSchool, HumanCentricArt #NLProc www.santilli.xyz

Donghee Choi
@donghee

Assistant Professor at Pusan National University (https://sites.google.com/view/pnu-clink) Former Research Associate at Imperial College London. Bio/Clinical NLP, Food/Financial AI