In honor of some new people coming from AI twitter, I finally updated my post to recommend For You over Discover.
16.10.2025 11:43 โ ๐ 30 ๐ 5 ๐ฌ 1 ๐ 0@kesnet50.bsky.social
Final year Ph.D. candidate in NLP, CV at JHU. Researching reasoning systems, multimodality, and AI for science. On the job market for full-time industry positions! #NLProc https://katesanders9.github.io/
In honor of some new people coming from AI twitter, I finally updated my post to recommend For You over Discover.
16.10.2025 11:43 โ ๐ 30 ๐ 5 ๐ฌ 1 ๐ 0A company that believed it was in the verge of AGI or ASI wouldnโt capitulate to the government because it wouldnโt care about government contracts. They would soon BE the economy and the government would soon be capitulating to them.
11.10.2025 20:57 โ ๐ 18 ๐ 4 ๐ฌ 1 ๐ 0Astronaut meme: "Wait, it's all perception?" "Always has been"
09.10.2025 20:04 โ ๐ 30 ๐ 5 ๐ฌ 1 ๐ 2Keynote spotlight #4: the second day of COLM will close with @ghadfield.bsky.social from JHU talking about human society alignment, and lessons for AI alignment
22.09.2025 14:23 โ ๐ 8 ๐ 2 ๐ฌ 0 ๐ 2Congratulations to Alane Suhr '22, a #CornellTech Ph.D. #alumni advised by associate professor Yoav Artzi, for receiving the prestigious 2022 @aaai.org / @acmsigai.bsky.social Doctoral Dissertation Award!
Read more about the award here: aaai.org/about-aaai/a...
@yoavartzi.com
Line chart showing that there's been a rapid escalation in how quickly the world installs a gigawatt of solar power capacity.
Time for the world to install a gigawatt of solar power capacity
2004: A year
2010: ~ a month
2015: ~ a week
Now: A day
ourworldindata.org/data-insight... ๐งช
๐จ Urban Stats 28.0.0 ๐จ
The mapper is now completely redesigned by me and @spudwaffle.bsky.social, allowing for much prettier looking maps and way more customization alongside significantly more options for geographies!
See below for some of the examples of the maps you can create!
When reading AI reasoning text (aka CoT), we (humans) form a narrative about the underlying computation process, which we take as a transparent explanation of model behavior. But what if our narratives are wrong? We measure that and find it usually is.
Now on arXiv: arxiv.org/abs/2508.16599
Paper: arxiv.org/pdf/2505.22037
๐ Project page: aka.ms/jailbreak-d...
๐ Dataset: huggingface.co/datasets/ja...
So, what's the future of AI safety benchmarks? Jack's solution is "renewable benchmarks" that allows us to refresh and expand benchmarks with a single click!!
x.com/jackjingyuz...
In our forthcoming paper, John Hummel and I ask what it would mean for a neural computing architecture such as a brain to implement a symbol system, and the related question of what makes it difficult for them to do so, with an eye toward the differences between humans, animals, and ANNs.
22.08.2025 18:25 โ ๐ 35 ๐ 13 ๐ฌ 1 ๐ 2This paper is making the rounds: arxiv.org/abs/2506.21734
A tiny (27M) brain-inspired model trained just on 1000 samples outperforming o3-mini-high on reasoning tasks.
#MLSky ๐ง ๐ค
Interested in large-scale GPU optimization? Interested in how modern neural networks are being deployed to solve classical optimization problems?
Writing a paper on these topics? Submit to the ScaleOPT workshop at NeurIPS!
www.cvxgrp.org/scaleopt/#su...
I'm recruiting MLEs @ #ACL2025!
Reach out if you know folks interested in legal NLP, structured prediction, and full-time at a startup environment in NYC
I'll also always chat about:
โข population-level inference on corpora
โข broad-coverage semantics
โข which cafรฉ has the best Sachertorte in Vienna
My students and I are presenting three papers on Monday at #ACL2025 and this thread will recap them (including their videos).
28.07.2025 08:35 โ ๐ 7 ๐ 2 ๐ฌ 1 ๐ 0โWikipedia is this economic anomaly. In many ways, itโs sort of magical that people will just volunteer without explicit economic incentives to create artifacts that are meant to share knowledge with everyone in the worldโ
26.07.2025 14:46 โ ๐ 3049 ๐ 493 ๐ฌ 90 ๐ 207Taking off for Vienna #ACL2025! ๐ฆ๐น Excited to talk with people about transparent reasoning, multimodality, and fact verification. Stop by our multimodal RAG workshop on Friday ๐ฅ๐ฅ๐ฅ
Please reach out if you want to grab coffee!
The #ACL2025 #ACL2025NLP feed is up and running! It matches both hashtags and any posts from or mentions of @aclmeeting.bsky.social
Pin it to your home ๐ and enjoy!
bsky.app/profile/did:...
Juxtastat DAU update! Crazy how we've been >1000 every day for over a year now!
Thank you all for all your support, and make sure to keep spreading the word!
๐ฅณ ๐ โค๏ธ The ACL 2025 Proceedings are live on the ACL Anthology ๐ฅฐ !
Weโre thrilled to pre-celebrate the incredible research ๐ โจ that will be presented starting Monday next week in Vienna ๐ฆ๐น !
Start exploring ๐ aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology
This New Yorker piece is the most hopeful I've felt about the world in a long time.
I had no idea solar was booming like this. And if you live in the same world as me, dominated by oil & gas guys maintaining that solar and wind are inefficient gimmicks, you might not've known some of this either.
๐When LLMs solve tasks with a mid-to-low resource input or target language, their output quality is poor. We know that. But can we put our finger on what breaks inside the LLM? We introduce the ๐ฅ translation barrier hypothesis ๐ฅ for failed multilingual generation with LLMs. arxiv.org/abs/2506.22724
04.07.2025 17:04 โ ๐ 26 ๐ 7 ๐ฌ 2 ๐ 1I wrote something up for AI people who want to get into bluesky and either couldn't assemble an exciting feed or gave up doomscrolling when their Following feed switched to talking politics 24/7.
26.04.2025 01:31 โ ๐ 314 ๐ 88 ๐ฌ 23 ๐ 19Explore Wikipedia through a data map. Pages are grouped by semantic similarity, for topic clusters.
Hover to see details, zoom to explore more fine-grained topics, click to go to a page. Search by page
name to find interesting starting points for exploration.
lmcinnes.github.io/datamapplot_...
William Walden, Kathryn Ricci, Miriam Wanner, Zhengping Jiang, Chandler May, Rongkun Zhou, Benjamin Van Durme
How Grounded is Wikipedia? A Study on Structured Evidential Support
https://arxiv.org/abs/2506.12637
What happens when an LLM is asked to use information that contradicts its knowledge? We explore knowledge conflict in a new preprint๐
TLDR: Performance drops, and this could affect the overall performance of LLMs in model-based evaluation.๐๐งตโฌ๏ธ 1/8
#NLProc #LLM #AIResearch
Learn about the groundbreaking work being presented by JHU researchers at @cvprconference.bsky.socialโs #CVPR2025! Check out the full list here www.cs.jhu.edu/news/johns-h... or browse through the thread below! ๐งต (1/14)
10.06.2025 19:45 โ ๐ 2 ๐ 1 ๐ฌ 1 ๐ 0We know that speech LID systems flunk on accented speech. But why? And what can we do about it? ๐ค
Our work arxiv.org/abs/2506.00628 (Interspeech '25) finds that *accent-language confusion* is an important culprit, ties it to the length of feature that the model relies on, and proposes a fix.
image of nyc
Excited to announce that I'm working on a project at AWS in New York this summer! Reach out if you're in the area and want to grab coffee ๐
12.05.2025 19:00 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0@npr.org for every minute spent talking to a non-autistic person about autistic people's needs, you should be giving at least 2x that amount to actually autistic peopleโ & not just in written stories, but on-air.
Apply said rubric to any other group under attack right now for their lived realities.