Maitrey Mehta's Avatar

Maitrey Mehta

@my-tray.bsky.social

Ph.D. Student at Utah NLP | Low-resource NLP | Multilinguality

461 Followers  |  322 Following  |  1 Posts  |  Joined: 13.08.2024  |  2.175

Latest posts by my-tray.bsky.social on Bluesky

Preview
Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, Yonatan Belinkov. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.

Outstanding paper (5/7):

"Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps"
by Martin Tutek, Fateme Hashemi Chaleshtori, Ana Marasovic, and Yonatan Belinkov
aclanthology.org/2025.emnlp-m...

6/n

07.11.2025 22:32 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

1/ ๐ŸšจNEW PAPER: "BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs", accepted to ACL Findings 2025!
We introduce the first benchmark specifically designed to help LLMs assist lawyers in writing legal briefs ๐Ÿง‘โ€โš–๏ธ

๐Ÿ“„ arxiv.org/abs/2506.06619
๐Ÿ—‚๏ธ huggingface.co/datasets/jw4...

20.06.2025 22:07 โ€” ๐Ÿ‘ 7    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Preview
What Has Been Lost with Synthetic Evaluation? Large language models (LLMs) are increasingly used for data generation. However, creating evaluation benchmarks raises the bar for this emerging paradigm. Benchmarks must target specific phenomena, pe...

๐–๐ก๐š๐ญ ๐‡๐š๐ฌ ๐๐ž๐ž๐ง ๐‹๐จ๐ฌ๐ญ ๐–๐ข๐ญ๐ก ๐’๐ฒ๐ง๐ญ๐ก๐ž๐ญ๐ข๐œ ๐„๐ฏ๐š๐ฅ๐ฎ๐š๐ญ๐ข๐จ๐ง?

(arxiv.org/abs/2505.22830)

I'm happy to announce that the preprint release of my first project is online! Developed with the amazing support of @lasha.bsky.social & @anamarasovic.bsky.social

04.06.2025 22:24 โ€” ๐Ÿ‘ 11    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

๐Ÿ™‹

17.11.2024 19:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@my-tray is following 19 prominent accounts