Valentina Pyatkin's Avatar

Valentina Pyatkin

@valentinapy.bsky.social

Postdoc in AI at the Allen Institute for AI & the University of Washington. 🌐 https://valentinapy.github.io

5,747 Followers  |  584 Following  |  77 Posts  |  Joined: 08.09.2023  |  1.6388

Latest posts by valentinapy.bsky.social on Bluesky

Post image

Excited to have the Big Picture workshop back for another iteration this year at @aclmeeting.bsky.social
Submit your big picture ideas, consolidation work, phd thesis distillation, etc. by March 5th!

www.bigpictureworkshop.com
w/ Allyson Ettinger, @norakassner.bsky.social, @sebruder.bsky.social

03.02.2026 14:44 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

🚨 New Study 🚨

@arxiv.bsky.social has recently decided to prohibit any 'position' paper from being submitted to its CS servers.
Why? Because of the "AI slop", and allegedly higher ratios of LLM-generated content in review papers, compared to non-review papers.

29.01.2026 14:00 β€” πŸ‘ 29    πŸ” 9    πŸ’¬ 2    πŸ“Œ 2
Looking forward to learning new things in 2026?
We’ve got you covered with 17 amazing talks exploring how AI reshapes the way we work!

Get your conference pass
380.-
available until January 31

Looking forward to learning new things in 2026?
We’ve got you covered with 17 amazing talks exploring how AI reshapes the way we work! Get your conference pass 380.- available until January 31

Front Conference Zurich is coming up soon! On Friday, February 27, an amazing group of speakers will explore how AI is reshaping the way we work, from creativity and product design to engineering and collaboration

🀩 Our lineup: frontconference.com/schedule
🎟️ Your ticket: frontconference.com/tickets

17.01.2026 11:36 β€” πŸ‘ 2    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Post image Post image Post image

We're at #NeurIPS2025 with papers, posters, workshops, fireside chats, & talks across the conference. Come learn about our latest research + see live demos!

02.12.2025 18:05 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
Olmo 3 is a fully open LLM Olmo is the LLM series from Ai2β€”the Allen institute for AI. Unlike most open weight models these are notable for including the full training data, training process and checkpoints along …

Olmo 3 is notable as a "fully open" LLM - all of the training data is published, plus complete details on how the training process was run. I tried out the 32B thinking model and the 7B instruct models, + thoughts on why transparent training data is so important simonwillison.net/2025/Nov/22/...

23.11.2025 00:17 β€” πŸ‘ 191    πŸ” 33    πŸ’¬ 2    πŸ“Œ 3

Olmo 3 is out! 🀩
I am particularly excited about Olmo 3 models' precise instruction following abilities and their good generalization performance on IFBench!
Lucky to have been a part of the Olmo journey for three iterations already.

20.11.2025 15:12 β€” πŸ‘ 24    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

Happy Halloween!

31.10.2025 10:48 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

There’s plenty of evidence for political bias in LLMs, but very few evals reflect realistic LLM use cases β€” which is where bias actually matters.

IssueBench, our attempt to fix this, is accepted at TACL, and I will be at #EMNLP2025 next week to talk about it!

New results 🧡

29.10.2025 16:11 β€” πŸ‘ 32    πŸ” 11    πŸ’¬ 1    πŸ“Œ 0
Post image

I will be giving a talk at @eth-ai-center.bsky.social next week, on RLVR for verifiable instruction following, generalization, and reasoning! πŸ“’
Join if you are in Zurich and interested in hearing about IFBench and our latest Olmo and TΓΌlu works at @ai2.bsky.social

27.10.2025 14:22 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Title page of the paper: WUGNECTIVES: Novel Entity Inferences of Language Models from Discourse Connectives, with two figures at the bottom

Left: Our figure 1 -- comparing previous work, which usually predicted the connective given the arguments (grounded in the world); our work flips this premise by getting models to use their knowledge of connectives to predict something about the world.

Right: Our main results across 7 types of connective senses. Models are especially bad at Concession connectives.

Title page of the paper: WUGNECTIVES: Novel Entity Inferences of Language Models from Discourse Connectives, with two figures at the bottom Left: Our figure 1 -- comparing previous work, which usually predicted the connective given the arguments (grounded in the world); our work flips this premise by getting models to use their knowledge of connectives to predict something about the world. Right: Our main results across 7 types of connective senses. Models are especially bad at Concession connectives.

"Although I hate leafy vegetables, I prefer daxes to blickets." Can you tell if daxes are leafy vegetables? LM's can't seem to! πŸ“·

We investigate if LMs capture these inferences from connectives when they cannot rely on world knowledge.

New paper w/ Daniel, Will, @jessyjli.bsky.social

16.10.2025 15:27 β€” πŸ‘ 32    πŸ” 10    πŸ’¬ 2    πŸ“Œ 2
Post image

Next up we had @tsvetshop β€˜s Yulia Tsvetkov talk about ethics, safety, and reliability of LLMs in the health domain.

10.10.2025 16:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ’‘We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."

10.10.2025 14:31 β€” πŸ‘ 15    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
Third Workshop on Socially Responsible Language Modelling Research (SoLaR) 2025 COLM 2025 in-person Workshop, October 10th at the Palais des Congrès in Montreal, Canada

Hi #COLM2025! πŸ‡¨πŸ‡¦ I will be presenting a talk on the importance of community-driven LLM evaluations based on an opinion abstract I wrote with Jo Kavishe, @victorojewale.bsky.social and @geomblog.bsky.social tomorrow at 9:30am in 524b for solar-colm.github.io

Hope to see you there!

09.10.2025 19:32 β€” πŸ‘ 9    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0

Now accepted to #neurips25 datasets & benchmarks!
See you in San Diego! πŸ₯³

20.09.2025 06:56 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

πŸš€ Can open science beat closed AI? TΓΌlu 3 makes a powerful case. In our new #WiAIRpodcast, we speak with Valentina Pyatkin (@valentinapy.bsky.social) of @ai2.bsky.social and the University of Washington about a fully open post-training recipeβ€”models, data, code, evals, and infra. #WomenInAI 1/8🧡

19.09.2025 16:13 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

"π‹π‹πŒ 𝐏𝐨𝐬𝐭-𝐭𝐫𝐚𝐒𝐧𝐒𝐧𝐠: 𝐎𝐩𝐞𝐧 π’πœπ’πžπ§πœπž π“π‘πšπ­ 𝐏𝐨𝐰𝐞𝐫𝐬 𝐏𝐫𝐨𝐠𝐫𝐞𝐬𝐬 " πŸŽ™οΈ

On Sept 17, the #WiAIRpodcast speaks with @valentinapy.bsky.social (@ai2.bsky.social & University of Washington) about open science, post-training, mentorship, and visibility

#WiAIR #NLProc

12.09.2025 15:00 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

With fresh support of $75M from NSF and $77M from NVIDIA, we’re set to scale our open model ecosystem, bolster the infrastructure behind it, and fast‑track reproducible AI research to unlock the next wave of scientific discovery. πŸ’‘

14.08.2025 12:16 β€” πŸ‘ 45    πŸ” 7    πŸ’¬ 1    πŸ“Œ 7
Post image

On my way to Oxford: Looking forward to speaking at OxML 2025

10.08.2025 08:09 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Opinion Abstracts: SoLaR Workshop @ COLM 2025 Beyond the two main tracks we are inviting short opinion abstracts (500 words maximum) on new perspectives for what socially responsible language modeling research might look like. The SoLaR organizi...

The submission deadline is August 26 2025 (AoE time), and decisions will be sent out on September 2, 2025.

Submit your abstracts here:
docs.google.com/forms/d/e/1F...

08.08.2025 12:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”ˆFor the SoLaR workshop
@COLM_conf
we are soliciting opinion abstracts to encourage new perspectives and opinions on responsible language modeling, 1-2 of which will be selected to be presented at the workshop.

Please use the google form below to submit your opinion abstract ⬇️

08.08.2025 12:40 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

I had a lot of fun contemplating about memorization questions at the @l2m2workshop.bsky.social panel yesterday together with Niloofar Mireshghallah and Reza Shokri, moderated by
@pietrolesci.bsky.social who did a fantastic job!
#ACL2025

02.08.2025 15:04 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

I'll be at #ACL2025πŸ‡¦πŸ‡Ή!!
Would love to chat about all things pragmatics 🧠, redefining "helpfulness"πŸ€” and enabling better cross-cultural capabilities πŸ—ΊοΈ 🫢

Presenting our work on culturally offensive nonverbal gestures πŸ‘‡
πŸ•›Wed @ Poster Session 4
πŸ“Hall 4/5, 11:00-12:30

26.07.2025 02:46 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I did! very very good!!

19.07.2025 05:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ”₯tokenization panel!

18.07.2025 22:45 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

why is vancouver sushi so good? 🀀 (vancouver food in general actually)

18.07.2025 20:27 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
Post image Post image

This week is #ICML in Vancouver, and a number of our researchers are participating. Here's the full list of Ai2's conference engagementsβ€”we look forward to connecting with fellow attendees. πŸ‘‹

14.07.2025 19:30 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Let me know if you want to meet up! Always happy to chat!

11.07.2025 14:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ICML Poster Diverging Preferences: When do Annotators Disagree and do Models Know?ICML 2025

07/17, Poster: Diverging Preferences: When do Annotators Disagree and do Models Know? icml.cc/virtual/2025...

07/16, Poster: SafetyAnalyst: Interpretable, transparent, and steerable safety moderation for AI behavior
icml.cc/virtual/2025...

11.07.2025 14:09 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I'll be at ICML in Vancouver next week! #ICML2025
You can find me at the following:

- giving an invited talk at the "Models of Human Feedback for AI Alignment" workshop

- giving an invited talk at the "AI for Math" workshop

I'll also present these two papers ‡️

11.07.2025 14:09 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

In GenevaπŸ‡¨πŸ‡­to attend the International Open-Source LLM Builders Summit and present OLMo and TΓΌlu!

06.07.2025 17:23 β€” πŸ‘ 10    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@valentinapy is following 20 prominent accounts