Kate Sanders's Avatar

Kate Sanders

@kesnet50.bsky.social

LLM post-training, reasoning, and multimodality. Ph.D. @ JHU CLSP, incoming researcher at Microsoft Copilot Tuning. #NLProc https://katesanders9.github.io/

807 Followers  |  395 Following  |  23 Posts  |  Joined: 18.11.2024
Posts Following

Posts by Kate Sanders (@kesnet50.bsky.social)

This year's shared task allows you to submit for the retrieval track, generation track, or full RAG track on a challenging new collection of unedited ("raw") videos.

Research Papers (Apr. 1)
Shared Task (Apr. 20)

17.02.2026 05:11 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
MAGMaR Workshop MAGMaR

πŸ“Ή + 🧠 + πŸ“ = πŸ”₯ First call for MAGMaR 2026, the 2nd workshop on multimodal augmented generation via multimodal retrieval! If #RAG isn't hard enough for you, try multilingually and multimodally. Collocated with
@aclmeeting
in San Diego in July.

nlp.jhu.edu/magmar/

17.02.2026 05:11 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Bonsai poster

Bonsai poster

Will be presenting Bonsai on Thursday, 1/22 at the morning talks session and noon poster session:
arxiv.org/pdf/2504.03640

16.01.2026 16:24 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I will be at AAAI 2026 in Singapore next week! ✈️ I'm looking forward to seeing everyone's cool projects and discussing reasoning, post-training, and multimodality. Please reach out if you will be there and would like to connect.

16.01.2026 16:21 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bye bye 2025, a divisive year,
with many divisors: 3, 5, 9, 15, 25, 27, 45, 75, 81, 135, 225, 405, 675.

Happy 2026 = 2*1013
Just two primes

Cheers!

31.12.2025 16:32 β€” πŸ‘ 22    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0
Eleven, eleven, eleven, eleven..

Eleven, eleven, eleven, eleven..

Thinking about my favorite amp today πŸ˜”β€οΈ

15.12.2025 16:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - kr-ramesh/synthtexteval: SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data Across Domains (EMNLP 2025 System Demonstration) SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data Across Domains (EMNLP 2025 System Demonstration) - kr-ramesh/synthtexteval

πŸš€ SynthTextEval, our open-source toolkit for generating and evaluating synthetic text data for high-stakes domains, will be featured at EMNLP 2025 as a system demonstration!

GitHub: github.com/kr-ramesh/sy...
Paper πŸ“: aclanthology.org/2025.emnlp-d...

#EMNLP2025 #EMNLP #SyntheticData

07.11.2025 00:53 β€” πŸ‘ 13    πŸ” 3    πŸ’¬ 1    πŸ“Œ 2

In honor of some new people coming from AI twitter, I finally updated my post to recommend For You over Discover.

16.10.2025 11:43 β€” πŸ‘ 33    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0

A company that believed it was in the verge of AGI or ASI wouldn’t capitulate to the government because it wouldn’t care about government contracts. They would soon BE the economy and the government would soon be capitulating to them.

11.10.2025 20:57 β€” πŸ‘ 20    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Astronaut meme: "Wait, it's all perception?" "Always has been"

Astronaut meme: "Wait, it's all perception?" "Always has been"

09.10.2025 20:04 β€” πŸ‘ 34    πŸ” 6    πŸ’¬ 1    πŸ“Œ 2
Post image

Keynote spotlight #4: the second day of COLM will close with @ghadfield.bsky.social from JHU talking about human society alignment, and lessons for AI alignment

22.09.2025 14:23 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 2
AAAI/ACM SIGAI Doctoral Dissertation Award - AAAI The AAAI/ACM SIGAI Doctoral Dissertation Award recognizes and encourages superior research and writing by doctoral candidates in AI.

Congratulations to Alane Suhr '22, a #CornellTech Ph.D. #alumni advised by associate professor Yoav Artzi, for receiving the prestigious 2022 @aaai.org / @acmsigai.bsky.social Doctoral Dissertation Award!

Read more about the award here: aaai.org/about-aaai/a...

@yoavartzi.com

19.09.2025 18:38 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Line chart showing that there's been a rapid escalation in how quickly the world installs a gigawatt of solar power capacity.

Line chart showing that there's been a rapid escalation in how quickly the world installs a gigawatt of solar power capacity.

Time for the world to install a gigawatt of solar power capacity
2004: A year
2010: ~ a month
2015: ~ a week
Now: A day
ourworldindata.org/data-insight... πŸ§ͺ

15.09.2025 05:01 β€” πŸ‘ 2455    πŸ” 902    πŸ’¬ 33    πŸ“Œ 87
Post image Post image Post image

🚨 Urban Stats 28.0.0 🚨

The mapper is now completely redesigned by me and @spudwaffle.bsky.social, allowing for much prettier looking maps and way more customization alongside significantly more options for geographies!

See below for some of the examples of the maps you can create!

31.08.2025 18:58 β€” πŸ‘ 35    πŸ” 7    πŸ’¬ 2    πŸ“Œ 3
Preview
Humans Perceive Wrong Narratives from AI Reasoning Texts A new generation of AI models generates step-by-step reasoning text before producing an answer. This text appears to offer a human-readable window into their computation process, and is increasingly r...

When reading AI reasoning text (aka CoT), we (humans) form a narrative about the underlying computation process, which we take as a transparent explanation of model behavior. But what if our narratives are wrong? We measure that and find it usually is.

Now on arXiv: arxiv.org/abs/2508.16599

27.08.2025 21:30 β€” πŸ‘ 84    πŸ” 22    πŸ’¬ 4    πŸ“Œ 2
Preview
jackzhang/JBDistill-Bench Β· Datasets at Hugging Face

Paper: arxiv.org/pdf/2505.22037
πŸ”— Project page: aka.ms/jailbreak-d...
πŸ“Š Dataset: huggingface.co/datasets/ja...

26.08.2025 21:15 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

So, what's the future of AI safety benchmarks? Jack's solution is "renewable benchmarks" that allows us to refresh and expand benchmarks with a single click!!
x.com/jackjingyuz...

26.08.2025 21:15 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
From Basic Affordances to Symbolic Thought: A Computational Phylogenesis of Biological Intelligence What is it about human brains that allows us to reason symbolically whereas most other animals cannot? There is evidence that dynamic binding, the ability to combine neurons into groups on the fly, is...

In our forthcoming paper, John Hummel and I ask what it would mean for a neural computing architecture such as a brain to implement a symbol system, and the related question of what makes it difficult for them to do so, with an eye toward the differences between humans, animals, and ANNs.

22.08.2025 18:25 β€” πŸ‘ 35    πŸ” 13    πŸ’¬ 1    πŸ“Œ 2
Post image

This paper is making the rounds: arxiv.org/abs/2506.21734

A tiny (27M) brain-inspired model trained just on 1000 samples outperforming o3-mini-high on reasoning tasks.

#MLSky πŸ§ πŸ€–

03.08.2025 02:01 β€” πŸ‘ 131    πŸ” 26    πŸ’¬ 4    πŸ“Œ 1
ScaleOPT

Interested in large-scale GPU optimization? Interested in how modern neural networks are being deployed to solve classical optimization problems?

Writing a paper on these topics? Submit to the ScaleOPT workshop at NeurIPS!

www.cvxgrp.org/scaleopt/#su...

16.07.2025 00:27 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

I'm recruiting MLEs @ #ACL2025!

Reach out if you know folks interested in legal NLP, structured prediction, and full-time at a startup environment in NYC

I'll also always chat about:
β€’ population-level inference on corpora
β€’ broad-coverage semantics
β€’ which cafΓ© has the best Sachertorte in Vienna

27.07.2025 21:48 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

My students and I are presenting three papers on Monday at #ACL2025 and this thread will recap them (including their videos).

28.07.2025 08:35 β€” πŸ‘ 7    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

β€œWikipedia is this economic anomaly. In many ways, it’s sort of magical that people will just volunteer without explicit economic incentives to create artifacts that are meant to share knowledge with everyone in the world”

26.07.2025 14:46 β€” πŸ‘ 3034    πŸ” 490    πŸ’¬ 87    πŸ“Œ 202

Taking off for Vienna #ACL2025! πŸ‡¦πŸ‡Ή Excited to talk with people about transparent reasoning, multimodality, and fact verification. Stop by our multimodal RAG workshop on Friday πŸ”₯πŸ”₯πŸ”₯

Please reach out if you want to grab coffee!

26.07.2025 18:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The #ACL2025 #ACL2025NLP feed is up and running! It matches both hashtags and any posts from or mentions of @aclmeeting.bsky.social

Pin it to your home πŸ“Œ and enjoy!

bsky.app/profile/did:...

17.07.2025 11:15 β€” πŸ‘ 48    πŸ” 14    πŸ’¬ 2    πŸ“Œ 0
Post image

Juxtastat DAU update! Crazy how we've been >1000 every day for over a year now!

Thank you all for all your support, and make sure to keep spreading the word!

25.07.2025 19:18 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Annual Meeting of the Association for Computational Linguistics (2025) - ACL Anthology pdf bibProceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)Wanxiang Che | Joyce Nabende | Ekaterina Shutova | Mohammad Taher Pilehvar

πŸ₯³ πŸŽ‰ ❀️ The ACL 2025 Proceedings are live on the ACL Anthology πŸ₯° !
We’re thrilled to pre-celebrate the incredible research πŸ“š ✨ that will be presented starting Monday next week in Vienna πŸ‡¦πŸ‡Ή !
Start exploring πŸ‘‰ aclanthology.org/events/acl-2...
#NLProc #ACL2025NLP #ACLAnthology

22.07.2025 20:00 β€” πŸ‘ 57    πŸ” 19    πŸ’¬ 0    πŸ“Œ 1

This New Yorker piece is the most hopeful I've felt about the world in a long time.

I had no idea solar was booming like this. And if you live in the same world as me, dominated by oil & gas guys maintaining that solar and wind are inefficient gimmicks, you might not've known some of this either.

10.07.2025 16:27 β€” πŸ‘ 794    πŸ” 406    πŸ’¬ 14    πŸ“Œ 3
Post image

πŸ”ˆWhen LLMs solve tasks with a mid-to-low resource input or target language, their output quality is poor. We know that. But can we put our finger on what breaks inside the LLM? We introduce the πŸ’₯ translation barrier hypothesis πŸ’₯ for failed multilingual generation with LLMs. arxiv.org/abs/2506.22724

04.07.2025 17:04 β€” πŸ‘ 26    πŸ” 7    πŸ’¬ 2    πŸ“Œ 1
Preview
The AI Researcher's Guide to a Non-Boring Bluesky Feed | Naomi Saphra How to migrate to bsky without a boring feed.

I wrote something up for AI people who want to get into bluesky and either couldn't assemble an exciting feed or gave up doomscrolling when their Following feed switched to talking politics 24/7.

26.04.2025 01:31 β€” πŸ‘ 341    πŸ” 92    πŸ’¬ 23    πŸ“Œ 21