๐ADVSCORE won an Outstanding Paper Award at #NAACL2025
๐จ Don't miss out on our poster presentation *today at 2 pm* by Yoo Yeon (first author).
๐Poster Session 5 - HC: Human-centered NLP
๐ผ Highly recommend talking to her if you are hiring and/or interested in Human-focused Al dev and evals!
01.05.2025 12:38 โ ๐ 7 ๐ 1 ๐ฌ 1 ๐ 0
A screenshot of a paper showing the title - "Pairscale: Analyzing Attitude Change in Online Communities" by Rupak Sarkar, Patrick Wu, Kristina Miler, Alexander Hoyle and Philip Resnik.
Are you tired of using traditional stance detection to measure the polarity of text? Our #NAACL25 paper proposes an approach that uses pairwise comparisons to order texts on a continuous scale, capturing both implicit and explicit evidence in language.
๐Today in Hall 3 from 4-5:30pm
Come say hi!
01.05.2025 15:08 โ ๐ 6 ๐ 1 ๐ฌ 0 ๐ 0
This helps us build groups of examples that evaluate the same pieces of knowledge, allowing us to measure under what *contexts* an LLM can correctly draw a particular inference ("inferential consistency"). We find that LLMs still exhibit room for improvement on this front. (5/n)
29.04.2025 20:41 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
We propose a method to pinpoint the particular pieces of knowledge a defeasible reasoning example aims to evaluate by identifying the atom(s) that are most critical in determining the overall label of a defeasible NLI example. (4/n)
29.04.2025 20:41 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
We also explore how atomic hypothesis decomposition can help us better understand the complexities of defeasible reasoning, a softer inference task that requires models to weigh the effects of multiple, sometimes competing, pieces of evidence on a hypothesis. (3/n)
29.04.2025 20:41 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
For example, after decomposing hypothesis from an NLI premise-hypothesis pair into atoms, we can measure whether its judgment on the overall pair is consistent with its set of judgments on each premise-atom sub-problem in a logical way. (2/n)
29.04.2025 20:40 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I'll be presenting this work with @rachelrudinger at #NAACL2025 tomorrow (Wednesday 4/30) in Albuquerque during Session C (Oral/Poster 2) at 2pm! ๐ฌ
Decomposing hypotheses in traditional NLI and defeasible NLI helps us measure various forms of consistency of LLMs. Come join us!
29.04.2025 20:40 โ ๐ 8 ๐ 3 ๐ฌ 5 ๐ 1
Ah, thanks Joe!! :) And a huge thank you to you for all your early feedback -- it definitely helped the way we framed the concept of atomic inference.
19.02.2025 15:48 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Professor for Natural Language Processing (@utn_nuremberg), CoNLL co-chair 2025, organizer of LSDSem & UnImplicit workshops, expert in misunderstandings.
Assistant Professor at @cs.ubc.caโฌ and โช@vectorinstitute.aiโฌ working on Natural Language Processing. Book: https://lostinautomatictranslation.com/
CS Ph.D. candidate @ USC, https://billzhu.me
Researcher in NLP/IR at the University of Sheffield, Research interests include Conversational AI, RAG and other topics among NLP/IR.
NLP. NMT. Main author of Marian NMT. Research Scientist at Microsoft Translator.
https://marian-nmt.github.io
Principal Scientist at Indeed. PhD Student at UT Austin. AI, Deep Learning, PGMs, and NLP.
NLP research - PhD student at UW
Assistant professor in NLP @UniMelb
AI Safety Fellow @Anthropic | PhD at University of Edinburgh | LLM Hallucinations | Clinical NLP | Opinions are my own.
Personal page: https://aryopg.github.io
่ฎธๆนๅญ๐ฉ๐ปโ๐ปphd student @ nyu, interested in natural language processing
๐: carriex.github.io
Professor of philosophy UTAustin. Philosophical logic, formal epistemology, philosophy of language, Wang Yangming.
www.harveylederman.com
Safe and robust AI/ML, computational sustainability. Former President AAAI and IMLS. Distinguished Professor Emeritus, Oregon State University. https://web.engr.oregonstate.edu/~tgd/
Research Scientist at Google Research
https://www.cs.unc.edu/~somnath/
PhD student @ UNC NLP with @mohitbansal working on grounded reasoning + code generation | currently interning at Ai2 (PRIOR) | formerly NEC Laboratories America | BS + MS @ Northeastern
zaidkhan.me
Researcher at @allen_ai (Ai2) || Research on NLP, LLMs, Reasoning, Agents, AI4Code, AI4Math || Prev: Microsoft AI, Univ. Of Illinois (UIUC), Max Planck (MPI), IIT-Bombay, BITS-Pilani
Web: https://shashankgupta.info/