๐๐'๐ง๐ ๐๐๐ง๐๐ฃ๐ ๐ฃ๐๐ฌ ๐๐๐๐ช๐ก๐ฉ๐ฎ ๐ข๐๐ข๐๐๐ง๐จ!
KSoC: utah.peopleadmin.com/postings/190... (AI broadly)
Education + AI:
- utah.peopleadmin.com/postings/189...
- utah.peopleadmin.com/postings/190...
Computer Vision:
- utah.peopleadmin.com/postings/183...
07.11.2025 23:35 โ ๐ 16 ๐ 10 ๐ฌ 1 ๐ 0
So thankful for this amazing team and all I learned through the process! Proud of how it all came together๐๐
๐: aclanthology.org/2025.emnlp-m...
08.11.2025 01:57 โ ๐ 9 ๐ 0 ๐ฌ 0 ๐ 0
Very honored to be one out of seven outstanding papers at this years' EMNLP :)
Huge thanks to my amazing collaborators @fatemehc.bsky.social @anamarasovic.bsky.social @boknilev.bsky.social , this would not have been possible without them!
07.11.2025 08:58 โ ๐ 23 ๐ 6 ๐ฌ 2 ๐ 2
Thrilled that FUR was accepted to @emnlpmeeting.bsky.social Main๐
In case you canโt wait so long to hear about it in person, it will also be presented as an oral at @interplay-workshop.bsky.social @colmweb.org ๐ฅณ
FUR is a parametric test assessing whether CoTs faithfully verbalize latent reasoning.
21.08.2025 15:21 โ ๐ 13 ๐ 3 ๐ฌ 1 ๐ 1
9/ We hope BriefMe encourages more Legal NLP development that directly aids legal professionals!
Check out our paper for the full methodology, human evaluation details, and comprehensive benchmarks.
What other legal NLP applications can we design using BriefMe? ๐ค
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
8/ โ๏ธ BriefMe extends Legal NLP by introducing a dataset of legal briefs, a type of legal document that hasn't been overlooked before. We've designed tasks that attorneys actually need in their daily work, opening up new research directions to be explored to assist professionals.
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
7/ However, LLMs struggle with these complex tasks:
- Realistic argument completion: Llama-3.1-70B finds missing arguments only 18% of the time
- Case retrieval: Best method finds correct precedents in top-5 results just 31.4% of the time
Lots of room for improvement! ๐
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
6/ Surprising finding: GPT-4o outperforms human-written headings!
๐ค GPT-4o: 4.3/5 avg. LLM-as-judge rating for both arg. summ. & comp.
๐คต Lawyers: 4.0/5 (summ.) and 3.9/5 (comp.) avg. rating
LLMs excel at summarization and guided completion tasks, requiring only minor edits.
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
5/ Evaluating generated text is challenging: traditional metrics (BLEU/ROUGE/...) are not aligned with human preferences. Instead, we built an LLM-as-judge using o3-mini, instructed with expert-written guidelines for brief headings, proving more reliable than human raters!
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
4/ Our novel argument completion task tests if LLMs can identify WHERE exactly a missing argument should go in a brief's logical flow and WHAT that argument should be.
๐งฉ This realistic version is especially challenging: models must spot gaps in the ToCs with no guidance.
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
3/ We built BriefMe from Supreme Court briefs with 3 key tasks:
- Argument summarization
- Realistic/Guided Argument completion: filling in missing arguments within the Table of Contents (ToC)
- Case retrieval
Each assesses different practical aspects of legal reasoning.
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
2/ Legal briefs are documents where attorneys present their arguments to judges, making the case for their client's position by interpreting the law and citing relevant precedents.
Most legal NLP work focuses on judicial opinions, but we target the attorney's perspective instead ๐๏ธ
20.06.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
1/ ๐จNEW PAPER: "BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs", accepted to ACL Findings 2025!
We introduce the first benchmark specifically designed to help LLMs assist lawyers in writing legal briefs ๐งโโ๏ธ
๐ arxiv.org/abs/2506.06619
๐๏ธ huggingface.co/datasets/jw4...
20.06.2025 22:07 โ ๐ 7 ๐ 4 ๐ฌ 1 ๐ 2
GitHub - technion-cs-nlp/parametric-faithfulness
Contribute to technion-cs-nlp/parametric-faithfulness development by creating an account on GitHub.
It has been amazing to work with @fatemehc.bsky.social, @anamarasovic.bsky.social and Yonatan Belinkov on this incredibly important topic.
I look forward to further works on the parametric faithfulness route!
Codebase (& data): github.com/technion-cs-...
21.02.2025 12:42 โ ๐ 6 ๐ 2 ๐ฌ 0 ๐ 0
PhD student explainable AI @ ML Group TU Berlin, BIFOLD
Explainable AI research from the machine learning group of Prof. Klaus-Robert Mรผller at @tuberlin.bsky.social & @bifold.berlin
Safe and robust AI/ML, computational sustainability. Former President AAAI and IMLS. Distinguished Professor Emeritus, Oregon State University. https://web.engr.oregonstate.edu/~tgd/
Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning.
Lover of Linux ๐ง, coffee โ, and retro gaming. Big fan of open-source. #gohabsgo ๐จ๐ฆ
For more info: https://linktr.ee/sharky6000
Ph.D. Student at Utah NLP | Low-resource NLP | Multilinguality
Assistant Prof at University of Utah Fall 2025. NLP+CV+RL. RS at Google DeepMind. PhD from CMU MLD, undergrad Georgia Tech. Sometimes researcher, frequent shitposter.
PhD student in Interpretable Machine Learning at @tuberlin.bsky.social & @bifold.berlin
https://web.ml.tu-berlin.de/author/laura-kopf/
PhD student @LIG | Causal abstraction, interpretability & LLMs
Senior Lecturer and Researcher @LMU_Muenchen working on #ExplainableAI / #interpretableML and #OpenML
PhD candidate for Interpretable AI @ Fraunhofer HHI Berlin
The largest workshop on analysing and interpreting neural networks for NLP.
BlackboxNLP will be held at EMNLP 2025 in Suzhou, China
blackboxnlp.github.io
Postdoc @ TakeLab, UniZG | previously: Technion; TU Darmstadt | PhD @ TakeLab, UniZG
Faithful explainability, controllability & safety of LLMs.
๐ On the academic job market ๐
https://mttk.github.io/
The 2025 Conference on Language Modeling will take place at the Palais des Congrรจs in Montreal, Canada from October 7-10, 2025
NLP PhD @ USC
Everything language, humans, felines and music
(and an infrequent shit-poster)
brihijoshi.github.io
Communication Team @ ARR
PhD Candidate @ TU Munich - Legal NLP (2022-), MTech + BTech - CS,IIT KGP(2015-20),
Intern@JPMC (2024-25), Amazon (2024), Adobe (2023), Microsoft (2019);
Microsoft (2020-22),
CS Phd student in Northwestern University.
looking for 25 research intern in US
Research interests: LLM, GNN
CogSci MA Student @Unibogazici, interested in commonsense reasoning in LLMs. #NLP
Working on ethics and bias in NLP @CardiffNLP #NLP #NLProc
#NLP / #NLProc , #dataScience, #AI / #ArtificialIntelligence, #linguistics (#syntax, #semantics, โฆ), occasional #parenting, #gardening, & what not. PhD. Adjunct prof once in a full red moon. Industry / technical mentor. Not my opinion, never my employerโs