Lucy Li's Avatar

Lucy Li

@lucy3.bsky.social

Postdoc at UW NLP ๐Ÿ”๏ธ. #NLProc, computational social science, cultural analytics, responsible AI. she/her. Previously at Berkeley, Ai2, MSR, Stanford. Incoming assistant prof at Wisconsin CS. lucy3.github.io

5,780 Followers  |  658 Following  |  352 Posts  |  Joined: 17.05.2023
Posts Following

Posts by Lucy Li (@lucy3.bsky.social)

Post image

Abstract submissions close on March 3rd!

We are also extending a โœจ call for mentored reviewers โœจ if you advise excellent graduate or postdoctoral researchers you are welcome to recommend them to review for IC2S2 2026. Email IC2S2@uvm.edu to nominate mentored reviewers (or faculty colleagues)

23.02.2026 19:39 โ€” ๐Ÿ‘ 10    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

CORRECTION, Claude Code launched in February 2025, suggesting a roughly 13% increase above expectations.

26.02.2026 00:47 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

I remember the time to time muttering!! ๐Ÿ˜ฎ Curious, chinese-speaking culture in mainland china or US or elsewhere??

25.02.2026 23:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Agents of Chaos -- what are autonomous OpenClaw agents up to? How do they interact with each other? Read our investigation of OpenClaw at
researchgate.net/publication/...
And an interactive website agentsofchaos.baulab.info
@davidbau.bsky.social @natalieshapira.bsky.social @openclaw-x.bsky.social

24.02.2026 15:04 โ€” ๐Ÿ‘ 6    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I'm hiring a postdoc at @cmu.edu (w/ far.ai & @dgrand.bsky.social + @gordpennycook.bsky.social)!

How do LLMs shape human beliefs โ€” and what do we do about it? AI safety meets behavioral science.

Open to technical and social science backgrounds.

23.02.2026 18:46 โ€” ๐Ÿ‘ 42    ๐Ÿ” 27    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Preview
Anthropic Education Report: The AI Fluency Index Anthropic's AI Fluency Index measures 11 observable behaviors across thousands of Claude.ai conversations to understand how people develop AI collaboration skills.

New research: The AI Fluency Index.

We tracked 11 behaviors across thousands of http://Claude.ai conversationsโ€”for example, how often people iterate and refine their work with Claudeโ€”to measure how well people collaborate with AI.

Read more: https://www.anthropic.com/research/AI-fluency-index

23.02.2026 15:06 โ€” ๐Ÿ‘ 15    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 3

We've alllllmost gotten all the Jan26 ARR reviews in, but I'm still trying to track down new emergency reviewers for papers on the following topics:
1) agents
2) jailbreaking
3) coding
4) RL
5) reasoning
6) LLM for finance
7) AMR
8) alignment
If you can review any (in next 24-48h) please DM me ๐Ÿ™๐Ÿ™๐Ÿ™

20.02.2026 04:39 โ€” ๐Ÿ‘ 3    ๐Ÿ” 9    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I was taught that to have a great job talk narrative, you really only need ~3 high quality papers

20.02.2026 01:54 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

How horrible to be a CS grad student under pressure to submit multiple first author papers to every conference deadline, whether they feel ready or not. This serves no oneโ€™s best interests in long run (science included). But lots of students appear to being getting advice itโ€™s necessary to compete

20.02.2026 01:03 โ€” ๐Ÿ‘ 71    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Preview
Matching sounds to shapes: Evidence of the bouba-kiki effect in naรฏve baby chicks Humans across multiple languages spontaneously associate the nonwords โ€œkikiโ€ and โ€œboubaโ€ with spiky and round shapes, respectively, a phenomenon named the bouba-kiki effect. To explore the origin of t...

โ€œHumans across multiple languages spontaneously associate the nonwords kiki & bouba with spiky & round shapes, respectively...We tested the bouba-kiki effect in baby chickens. Similar to humans, they spontaneously chose a spiky shape when hearing a kiki sound & a round shape when hearing a bouba.โ€๐Ÿ˜ฒ๐Ÿงช

19.02.2026 19:20 โ€” ๐Ÿ‘ 323    ๐Ÿ” 118    ๐Ÿ’ฌ 13    ๐Ÿ“Œ 40

I have a small project that is taking me outside of academia to dip into industry, just ever so briefly.

I engage a lot with AI. I was not at all prepared for how industry is using it. Not. at. all.

This brief little window is definitely helping me better frame my teaching in this new world.

17.02.2026 21:28 โ€” ๐Ÿ‘ 49    ๐Ÿ” 6    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 1

My contribution to the discourse, which I've said before and will say again: DH isn't over. DH has won. 1/

17.02.2026 15:46 โ€” ๐Ÿ‘ 72    ๐Ÿ” 23    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 11
Preview
Bellwether Postdoctoral Scholar - School of Information University of California, Berkeley is hiring. Apply now!

Postdoc positions at UC Berkeley, including with the fabulous Cultural Analytics group: aprecruit.berkeley.edu/JPF05222

16.02.2026 19:10 โ€” ๐Ÿ‘ 40    ๐Ÿ” 26    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

I asked Gemini to "defend itself," and say what the big benefits of LLMs have been since 2020:

"Since 2020, the volume of digital noise has increased, and LLMs have provided the first reliable shield against it."

15.02.2026 15:18 โ€” ๐Ÿ‘ 18    ๐Ÿ” 1    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1
Preview
The evolution of OpenAIโ€™s mission statement As a USA 501(c)(3) the OpenAI non-profit has to file a tax return each year with the IRS. One of the required fields on that tax return is to โ€œBriefly โ€ฆ

I had some fun pulling OpenAI's mission statement out of their IRS tax filings from 2016 to 2024, loading them into a git repo with fake commit dates and then taking a look at the diffs simonwillison.net/2026/Feb/13/...

13.02.2026 23:40 โ€” ๐Ÿ‘ 240    ๐Ÿ” 45    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 3

I doubt it. I would read the author's piece very literally. He just put this preprint on arxiv: arxiv.org/pdf/2601.19062 I think some (and my read, this includes the author) are realizing that much more than AI is disempowering us. Many of us have known this for a very long time, of course.

12.02.2026 05:32 โ€” ๐Ÿ‘ 3    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I wrote a short article on AI Model Evaluation for the Open Encyclopedia of Cognitive Science ๐Ÿ“•๐Ÿ‘‡

Hope this is helpful for anyone who wants a super broad, beginner-friendly intro to the topic!

Thanks @mcxfrank.bsky.social and @asifamajid.bsky.social for this amazing initiative!

12.02.2026 22:22 โ€” ๐Ÿ‘ 47    ๐Ÿ” 19    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

Well done @zdenekkasner.bsky.social et al!

LLMs as Span Annotators: A Comparative Study of LLMs and Humans is accepted to multilingual-multicultural-evaluation.github.io ๐ŸŽ‰

See paper arxiv.org/abs/2504.08697

29.01.2026 15:35 โ€” ๐Ÿ‘ 8    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Post image

If you think labeling text spans with LLMs is easy, you probably have not tried it yourself (we have! ๐Ÿ™ƒ).

Any method you can think of โ€“ be it tagging, matching, or indexing โ€“ has flaws.

In our new preprint, we tested them all ๐Ÿ’ชWe also proposed how to improve one of them.

arxiv.org/abs/2601.16946

29.01.2026 14:20 โ€” ๐Ÿ‘ 40    ๐Ÿ” 6    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3

I am looking for 2 emergency reviewers for the ARR Ethics, Bias & Fairness track. Please DM me if you are available ๐Ÿ™

10.02.2026 09:27 โ€” ๐Ÿ‘ 6    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Screen shot of title page of a preprint.
Title: Should generative AI be used in reflexive qualitative research?
Authors: Elida Izani Ibrahim, Laura K. Nelson, and Andrea Voyer

Screen shot of title page of a preprint. Title: Should generative AI be used in reflexive qualitative research? Authors: Elida Izani Ibrahim, Laura K. Nelson, and Andrea Voyer

Recent publications arguing against the use of genAI in reflexive qual research inspired us (Elida Ibrahim and @andreavoyer.bsky.social) to write our own perspective. Not to convince anyone to use genAI but for those who might be interested and are looking for guidance.

osf.io/preprints/so...

09.02.2026 18:49 โ€” ๐Ÿ‘ 52    ๐Ÿ” 21    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Bad Bunny's historical advisor is an assistant professor at UW-Madison.

Hell of a flex for your tenure file.

09.02.2026 13:47 โ€” ๐Ÿ‘ 1607    ๐Ÿ” 263    ๐Ÿ’ฌ 18    ๐Ÿ“Œ 10

Excited to be co-organizing the #CHI2026 workshop on augmented reading interfaces ๐Ÿ“šโœจ Submissions are open for one more week! We want to know what you're working on!

06.02.2026 20:21 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
First Monday @ 30 | First Monday

Really sad to hear that First Monday is shutting down after 30 years. It was one of the first journals devoted to internet research & fully open access: no fees, no paywalls, and authors retained copyright.

My very first publication was there in 2004. End of an era.

firstmonday.org/ojs/index.ph...

06.02.2026 21:28 โ€” ๐Ÿ‘ 11    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

PhD admissions visits/open houses are starting to happen, and I got a comment on an old Reddit post where I was offering advice, and realized that it's actually really good advice. So here it is! (And this applies whether you've already been admitted to the program or not.) ๐Ÿงต

05.02.2026 17:26 โ€” ๐Ÿ‘ 30    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Post image

I've always been a fan of what the Allen Institute is doing. New in Nature: OpenScholar, an 8B RAG model for scientific literature, outperforms GPT-4o by 6% on correctness. Experts preferred its answers over human-written ones 51%-70 of the time. www.nature.com/articles/s41... 1/3 ๐Ÿงต

05.02.2026 13:24 โ€” ๐Ÿ‘ 28    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I have a google doc containing a list of papers on this exact topic ๐Ÿ˜†

05.02.2026 18:29 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Title card of our paper: "Which course? Discourse! Teaching Discourse and Generation in the Era of LLMs" by Junyi Jessy Li, Yang Janet Liu, Valentina Pyatkin, and William Sheffield.

Title card of our paper: "Which course? Discourse! Teaching Discourse and Generation in the Era of LLMs" by Junyi Jessy Li, Yang Janet Liu, Valentina Pyatkin, and William Sheffield.

Nearly 2 years ago, @jessyjli.bsky.social, @janetlauyeung.bsky.social, @valentinapy.bsky.social, and I decided that it's time to bring discourse structure to the center of NLP teaching.

05.02.2026 03:53 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

When I go to the dentist / doctor, they ask me if I'm in school or going to school and my answer has always been yes

02.02.2026 21:54 โ€” ๐Ÿ‘ 17    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

๐ŸŽญ How do LLMs (mis)represent culture?
๐Ÿงฎ How often?
๐Ÿง  Misrepresentations = missing knowledge? spoiler: NO!

At #CHI2026 we are bringing โœจTALESโœจ a participatory evaluation of cultural (mis)reps & knowledge in multilingual LLM-stories for India

๐Ÿ“œ arxiv.org/abs/2511.21322

1/10

02.02.2026 21:38 โ€” ๐Ÿ‘ 45    ๐Ÿ” 21    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2