The project that started my whitespace obsession... #EMNLP2025
While we've all been worrying about tokenizers, lurking in the background has been the preprocessing *before* tokenization. Poems break standard HTML-to-text linearization systems, and we find that multimodal models aren't a solution.
31.10.2025 16:53 โ ๐ 83 ๐ 14 ๐ฌ 3 ๐ 3
๐๐'๐ง๐ ๐๐๐ง๐๐ฃ๐ ๐ฃ๐๐ฌ ๐๐๐๐ช๐ก๐ฉ๐ฎ ๐ข๐๐ข๐๐๐ง๐จ!
KSoC: utah.peopleadmin.com/postings/190... (AI broadly)
Education + AI:
- utah.peopleadmin.com/postings/189...
- utah.peopleadmin.com/postings/190...
Computer Vision:
- utah.peopleadmin.com/postings/183...
07.11.2025 23:35 โ ๐ 16 ๐ 10 ๐ฌ 1 ๐ 0
Which, whose, and how much knowledge do LLMs represent?
I'm excited to share our preprint answering these questions:
"Epistemic Diversity and Knowledge Collapse in Large Language Models"
๐Paper: arxiv.org/pdf/2510.04226
๐ปCode: github.com/dwright37/ll...
1/10
13.10.2025 11:25 โ ๐ 89 ๐ 26 ๐ฌ 2 ๐ 1
Go check Alex's poster today (Wed) in Suzhou! #EMNLP2025
I'm still so proud of our work (led by @lasha.bsky.social) on CondaQA, so we had to ask what would happen if we tried to create high-quality reasoning-over-text benchmarks now that LLMs are available. Turns out, we'd make an easier benchmark!
04.11.2025 22:44 โ ๐ 8 ๐ 1 ๐ฌ 0 ๐ 0
I'll be in Suzhou ๐จ๐ณ at #EMNLP this week presenting "What has been Lost with Synthetic Evaluation?" done with @anamarasovic.bsky.social & @lasha.bsky.social! ๐
๐Findings Session 1 - Hall C
๐
Wed, November 5, 13:00 - 14:00
arxiv.org/abs/2505.22830
03.11.2025 11:03 โ ๐ 11 ๐ 2 ๐ฌ 0 ๐ 1
Screenshot of paper title: Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
AI always calling your ideas โfantasticโ can feel inauthentic, but what are sycophancyโs deeper harms? We find that in the common use case of seeking AI advice on interpersonal situationsโspecifically conflictsโsycophancy makes people feel more right & less willing to apologize.
03.10.2025 22:53 โ ๐ 115 ๐ 46 ๐ฌ 2 ๐ 7
Paper title: Language models align with brain regions that represent concepts across modalities.
Authors: Maria Ryskina, Greta Tuckute, Alexander Fung, Ashley Malkin, Evelina Fedorenko.
Affiliations: Maria is affiliated with the Vector Institute for AI, but the work was done at MIT. All other authors are affiliated with MIT.
Email address: maria.ryskina@vectorinstitute.ai.
Interested in language models, brains, and concepts? Check out our COLM 2025 ๐ฆ Spotlight paper!
(And if youโre at COLM, come hear about it on Tuesday โ sessions Spotlight 2 & Poster 2)!
04.10.2025 02:15 โ ๐ 26 ๐ 5 ๐ฌ 1 ๐ 1
Figure showing the four phases of convergence in LM training
LLMs are trained to mimic a โtrueโ distributionโtheir reducing cross-entropy then confirms they get closer to this target while training. Do similar models approach this target distribution in similar ways, though? ๐ค Not really! Our new paper studies this, finding 4-convergence phases in training ๐งต
01.10.2025 18:08 โ ๐ 24 ๐ 4 ๐ฌ 1 ๐ 1
Abhilasha Ravichander
My group at the Max Planck Institute for Software Systems will be recruiting PhD students. The application deadline is December 31, 2025 (see lasharavichander.github.io/contact.html).
Please feel free to advertise other opportunities or add resources in this thread!
01.10.2025 20:37 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
CS PhD Statements of Purpose
cs-sop.org is a platform intended to help CS PhD applicants. It hosts a database of example statements of purpose (SoP) shared by previous applicants to Computer Science PhD programs.
It is PhD application season again ๐ For those looking to do a PhD in AI, these are some useful resources ๐ค:
1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]
01.10.2025 20:37 โ ๐ 9 ๐ 4 ๐ฌ 1 ๐ 0
Happy to see this work accepted to #EMNLP2025! ๐๐๐
20.08.2025 20:49 โ ๐ 12 ๐ 1 ๐ฌ 0 ๐ 0
I am recruiting emergency reviewers for *SEM 2025 (The 14th Joint Conference on Lexical and Computational Semantics). Please DM me if you might be able to contribute a review within the next few days ๐
12.08.2025 00:44 โ ๐ 0 ๐ 1 ๐ฌ 0 ๐ 0
Weโre thrilled to congratulate Dr. Abhilasha Ravichander (@lasha.bsky.social) and her team for receiving the Outstanding Paper Award at #acl2025 for their work titled "HALoGEN: Fantastic LLM Hallucinations and Where to Find Them"! ๐โจ
#ACL #LLMs #Hallucination #WiAIR #WomenInAI
08.08.2025 16:49 โ ๐ 16 ๐ 2 ๐ฌ 2 ๐ 0
See you soon on the other side!! Best of luck with wrapping up
06.08.2025 17:52 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Status Update: I'm in the middle of my move from Denmark to Colorado! If I seem to be missing for the 1-2 weeks, that is the main reason. Picture me lost amidst suitcases, boxes, moving pods, and far too many books.
Copenhagen friends, I'm here for a couple more days! Please stop by P1 to say bye ๐ฅบ
06.08.2025 16:11 โ ๐ 47 ๐ 1 ๐ฌ 6 ๐ 1
Speaker announcement: the new episode of the Women in AI Research WiAIR podcast is out on August 6th. Our guest is Dr. Abhilasha Ravichander, a postdoc at University of Washington and Assistant Professor at Max Planck Institute for Software Systems.
๐๏ธ New Women in AI Research #WiAIR episode coming Aug 6!
We talk to @lasha.bsky.social about LLM Hallucination, her award-winning HALoGEN benchmark, and how we can better evaluate hallucinations in language models.
๐ Whatโs inside:
1/
01.08.2025 14:35 โ ๐ 2 ๐ 1 ๐ฌ 1 ๐ 0
Open Call for Expressions of Interest in Max Planck Directorships:
Expressions of interest can be submitted until 31 October 2025.
Director at Max Planck - a unique position! The Open Call for Expressions of Interest in Max Planck Directorships is open now and can be submitted by the 31st of October 2025. โก๏ธ mpg.de/directors - Please share the Open Call among potential candidates.
01.08.2025 09:06 โ ๐ 162 ๐ 196 ๐ฌ 1 ๐ 15
Thank you so much, excited to move to Germany! I will be based in Kaiserslautern.
01.08.2025 06:22 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Super super thrilled that HALoGEN, our study of LLM hallucinations and their potential origins in training data, received an โจOutstanding Paper Awardโจ at ACL!
Joint work w/i Shrusti Ghela*, David Wadden, and Yejin Choi
bsky.app/profile/lash...
30.07.2025 19:53 โ ๐ 34 ๐ 3 ๐ฌ 0 ๐ 0
Thank you Asia, super looking forward to collaborating!!
28.07.2025 21:28 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Thank you Lucy!!
27.07.2025 13:36 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Thank you Ana, and thank you again for your generous and thoughtful mentorship!๐ซถ
27.07.2025 13:07 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Join Abhilasha's lab, she is an awesome researcher and mentor! I can attest, being her collaborator was great fun ๐คฉ
24.07.2025 13:25 โ ๐ 2 ๐ 1 ๐ฌ 0 ๐ 0
People applying for NLP PhDs, work with Abhilasha -- she is awesome!!
23.07.2025 07:58 โ ๐ 5 ๐ 2 ๐ฌ 1 ๐ 0
Thank you so much Chenhao!!
23.07.2025 22:07 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Thank you so much, Leonie!! Excited for life in Germany (and thanks also for being a duolingo buddy for my German ๐คฃ)
23.07.2025 22:01 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Thank you so much Marzena!!
23.07.2025 21:59 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Phd student @ University of Mannheim | Social NLP | she/her
Digital Humanities | NLP | Computational Theology
Now: Institute for Digital Humanities, University of Gรถttingen.
Before: Digital Academy, Gรถttingen Academy of Sciences and Humanities.
Before Before: The list is long...
Professor of Data Science
Lead of @ds-hamburg.bsky.social
Researching Safe Generative AI
Data Science research group on AI ethics and multicultural NLP, led by @a-lauscher.bsky.social.
Machine Learning Research Scientist, MScAC | ML Researcher at Vector Institute | Co-Host of Women in AI Research (WiAIR) Podcast | 5+ Years of Industry Experience | University of Toronto Alumnus
Language models and interpretable machine learning. Postdoc @ Uni Tรผbingen.
https://sbordt.github.io/
Machine learning researcher, working on causal inference and healthcare applications
Alexander von Humboldt Professor for AI and Chair for Societal Computing at Saarland University. Co-Director at https://i2sc.net.
Assistant Professor @ UW iSchool. Interested in computational social science, social networks & causal inference.
http://martinsaveski.com
Incoming Assistant Professor @ University of Cambridge.
Responsible AI. Human-AI Collaboration. Interactive Evaluation.
umangsbhatt.github.io
Human-Centric Machine Learning at the Max Planck Institute for Software Systems
Postdoc @vectorinstitute.ai | organizer @queerinai.com | previously MIT, CMU LTI | ๐ rodent enthusiast | she/they
๐ https://ryskina.github.io/
Assistant Professor at @cs.ubc.caโฌ and โช@vectorinstitute.aiโฌ working on Natural Language Processing. Book: https://lostinautomatictranslation.com/
Organized and sponsored by SIGLEX, the Special Interest Group of the ACL, *SEM brings together researchers interested in the semantics of natural languages and its computational modeling.
*SEM 2026: https://starsem2026.github.io
Prof, Chair for AI & Computational Linguistics,
Head of MaiNLP lab @mainlp.bsky.social, LMU Munich
Co-director CIS @cislmu.bsky.social
Visiting Prof ITU Copenhagen @itu.dk
ELLIS Fellow @ellis.eu
Vice-President ACL
PI MCML @munichcenterml.bsky.social
4th year PhD student in UMD CS advised by Philip Resnik. I have also been a research intern at MSR (2024) and Adobe Research (2022).
PhD Student @nyudatascience.bsky.social, working with He He on NLP and Human-AI Collaboration.
Also hanging out @ai2.bsky.social
Website - https://vishakhpk.github.io/
NLP & ML research @cohereforai.bsky.social ๐จ๐ฆ