Laure Thompson's Avatar

Laure Thompson

@laurejt.bsky.social

Research software engineer @princetoncdh.bsky.social. natural language processing, machine learning, cultural analytics https://laurejt.github.io/

527 Followers  |  341 Following  |  30 Posts  |  Joined: 06.08.2023  |  1.9788

Latest posts by laurejt.bsky.social on Bluesky

πŸ“£ New preprint! We know humans are biased against AI-creativity. But what about LLMs, now often judging creativity in various contexts? Do they replicate, transform, or amplify this bias? We tested it. Turns out: AI is 2.5X more biased against its own work than humans. arxiv.org/pdf/2510.08831 🧡

13.10.2025 13:56 β€” πŸ‘ 67    πŸ” 25    πŸ’¬ 3    πŸ“Œ 6
Grad App Aid β€” Queer in AI

We are launching our Graduate School Application Financial Aid Program (www.queerinai.com/grad-app-aid) for 2025-2026. We’ll give up to $750 per person to LGBTQIA+ STEM scholars applying to graduate programs. Apply at openreview.net/group?id=Que.... 1/5

09.10.2025 00:37 β€” πŸ‘ 7    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0
Preview
Collaborative Research Supports groups of two or more scholars engaging in significant and sustained research in the humanities.

NEH Collaborative research grant program now explicitly prohibits digital projects. Output should be a book or journal issue. FFS
www.neh.gov/grants/resea...

25.09.2025 17:06 β€” πŸ‘ 28    πŸ” 17    πŸ’¬ 9    πŸ“Œ 12
Preview
Software Engineer (Full-stack) - ITHAKA ITHAKA’s mission is to expand access to knowledge and education around the world. Our services β€” Artstor, JSTOR, Portico, and Ithaka S+R β€” enable people everywhere to learn, to grow, and to overcome b...

I'm hiring a mid-level full-stack SWE! Our team at @jstor.bsky.social Labs is looking for yet another product-minded engineer to join our team. We come from all kinds of backgrounds, tech and non-tech alike.

Please apply or send to your awesome friends, and DM me with ?s: grnh.se/19o370345us

22.09.2025 19:48 β€” πŸ‘ 8    πŸ” 12    πŸ’¬ 0    πŸ“Œ 0
DHQ: Digital Humanities Quarterly: News

Excited to be co-editing a special issue of @dhquarterly.bsky.social on Artificial Intelligence for Digital Humanities: Research problems and critical approaches
dhq.digitalhumanities.org/news/news.html

We're inviting abstracts now - please feel free to reach out with any questions!

09.09.2025 20:28 β€” πŸ‘ 21    πŸ” 10    πŸ’¬ 0    πŸ“Œ 1

@greenleejw.bsky.social you've made some maps for novels, right?

28.08.2025 19:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Just curious: are you primarily looking for maps that were part of the first publication? Or any that developed in subsequent editions or when a series/world was more developed?

28.08.2025 18:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Adding my reply to the main thread since I posted to @dmimno.bsky.social 's quoted repost:

A recent, funky map I came across was in Christelle Dabos's The Memory of Babel.

28.08.2025 18:40 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A recent, funky map I came across was in Christelle Dabos's The Memory of Babel.

28.08.2025 18:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Please share! We're trying to crowd-source a dataset of post-1929 novels with maps.

28.08.2025 14:55 β€” πŸ‘ 45    πŸ” 31    πŸ’¬ 9    πŸ“Œ 1

Check out Rebecca Hicke @dmimno.bsky.social piece on LLMs! β€œLanguage models have the ability to identify the characteristics of much shorter literary passages than was thought feasible with traditional stylometry. We evaluate authorship and genre detection for a new corpus of literary novels…”

19.08.2025 18:17 β€” πŸ‘ 6    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
A dot plot titled '"nthropomorphized Animals in Popular Children's Books (*Animals That Appear in 10+ Books)" showing the proportion of animals depicted with gendered pronouns. Animals toward the left side are more often represented as male (he/him), and those toward the right are more often represented as female (she/her). Birds, ducks, and cats lean female. Bears, monkeys, dogs, elephants, foxes, wolves, and frogs lean male. Each animal is represented by a colorful, illustrated face.

A dot plot titled '"nthropomorphized Animals in Popular Children's Books (*Animals That Appear in 10+ Books)" showing the proportion of animals depicted with gendered pronouns. Animals toward the left side are more often represented as male (he/him), and those toward the right are more often represented as female (she/her). Birds, ducks, and cats lean female. Bears, monkeys, dogs, elephants, foxes, wolves, and frogs lean male. Each animal is represented by a colorful, illustrated face.

Screenshot of Publishers Weekly article titled "The Sneaky Gender Bias in Picture Books: Animal Characters" that includes photo of the author, a woman with brown hair and glasses. Text reads: "Melanie Walsh is an assistant professor in the Information School and an adjunct assistant professor in the English department at the University of Washington. She uses data to analyze contemporary culture, especially literature and publishing. She is currently at work on a book, When Postwar American Fiction Went Viral: Protest, Profit, and Popular Readers in the 21st Century, which follows the surprising social media afterlives of five iconic American authors. Here she shares her investigations into the subtle gender imbalance often at play in picture books featuring animal characters.

I recently published a data analysis with The Pudding, a digital publication known for data-driven storytelling, about animal characters in picture books. We read approximately 300 popular English-language picture books from the past 70+ years and noted the gender of any anthropomorphized animal character that was important to the story.

We found that male animal characters were twice as common as female characters across all the books. Some strong animal stereotypes also emerged: frogs and dogs were boys; birds and cats were girls. Even more surprising, according to our data: this disparity is not obviously improving, even over the last 25 years."

Screenshot of Publishers Weekly article titled "The Sneaky Gender Bias in Picture Books: Animal Characters" that includes photo of the author, a woman with brown hair and glasses. Text reads: "Melanie Walsh is an assistant professor in the Information School and an adjunct assistant professor in the English department at the University of Washington. She uses data to analyze contemporary culture, especially literature and publishing. She is currently at work on a book, When Postwar American Fiction Went Viral: Protest, Profit, and Popular Readers in the 21st Century, which follows the surprising social media afterlives of five iconic American authors. Here she shares her investigations into the subtle gender imbalance often at play in picture books featuring animal characters. I recently published a data analysis with The Pudding, a digital publication known for data-driven storytelling, about animal characters in picture books. We read approximately 300 popular English-language picture books from the past 70+ years and noted the gender of any anthropomorphized animal character that was important to the story. We found that male animal characters were twice as common as female characters across all the books. Some strong animal stereotypes also emerged: frogs and dogs were boys; birds and cats were girls. Even more surprising, according to our data: this disparity is not obviously improving, even over the last 25 years."

For PW, I wrote about the persistent gender gap in fictional animal charactersβ€”a pattern I noticed while analyzing 100s of picture books with @puddingviz.bsky.social.

It's a more interesting (and pervasive) problem than I first thought.

#kidlit #booksky

πŸ”—: www.publishersweekly.com/pw/by-topic/...

05.08.2025 23:29 β€” πŸ‘ 96    πŸ” 29    πŸ’¬ 6    πŸ“Œ 5
Q&A with Ted Chiang πŸ€–

The @princetoncdh.bsky.social newsletter is always a treat (and you should subscribe if you dont) but having an exclusive interview with Ted Chiang sort of takes the cake!
mailchi.mp/princeton/au...

14.08.2025 13:18 β€” πŸ‘ 13    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

If you’re an incoming Princeton 🐯 this fall, consider joining FRS159 - a unique chance to dive into technology, culture and African Languages. More about the course in the post below.

12.08.2025 16:30 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I'm so sorry πŸ’”

10.08.2025 16:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congrats!!!

29.07.2025 22:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congrats!!!

05.07.2025 13:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Octavia Butler’s Parable of the Sower was published in 1993 and starts in 2024β€”a 31-year leap. Are creators imagining futures that are closer or further away?

Explore a *new* dataset of 2.5k narrative works set in the future, each tagged with its release year and setting.

doi.org/10.18737/552...

25.06.2025 09:01 β€” πŸ‘ 49    πŸ” 26    πŸ’¬ 2    πŸ“Œ 6

I just pulled out my copy of Newton's Principia, a book that was originally written in Latin, and noticed that the translation I am using was completed with support from the National Endowment for the Humanities, which has basically been destroyed

Science depends on the humanities πŸ§ͺ

17.06.2025 15:40 β€” πŸ‘ 1493    πŸ” 348    πŸ’¬ 15    πŸ“Œ 11
Preview
Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge The measurement tasks involved in evaluating generative AI (GenAI) systems lack sufficient scientific rigor, leading to what has been described as "a tangle of sloppy tests [and] apples-to-oranges com...

Check out the camera-ready version of our ICML position paper ("Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge") to learn more!!! arxiv.org/abs/2502.00561

(6/6)

15.06.2025 00:20 β€” πŸ‘ 43    πŸ” 11    πŸ’¬ 3    πŸ“Œ 0
Post image

Today we released Institutional Books 1.0, a 242B token dataset from Harvard Library's collections, refined for accuracy and usability. 🧡

12.06.2025 21:12 β€” πŸ‘ 75    πŸ” 29    πŸ’¬ 9    πŸ“Œ 6

The NEH has been a singular force for bringing people together. In this moment of hate, fear and isolation, the NEH helped us learn about our past and about each other.

Today we are losing almost all of the NEH public servants. We are all in their debt. A national shame. A profound cultural loss.

10.06.2025 15:41 β€” πŸ‘ 142    πŸ” 69    πŸ’¬ 1    πŸ“Œ 2
Post image

My course proposal for Cultural Analytics @berkeleyischool.bsky.social has been approved for Fall 2025! This is the fullest expression of my vision for CA: a radical interdisciplinary experiment for rethinking knowledge production at the intersection of the humanities and machine learning. (1/9)

04.06.2025 17:19 β€” πŸ‘ 30    πŸ” 11    πŸ’¬ 2    πŸ“Œ 0
Screenshot of first page of the FAccT '25 paper "Algorithms in the Stacks: Investigating automated, for-profit diversity audits in public libraries" by Melanie Walsh, Connor Franklin Rey, Chang Ge, Tina Nowak, and Sabina Tomkins.

Abstract: Algorithmic systems are increasingly being adopted by cultural heritage institutions like libraries. In this study, we investigate U.S. public libraries' adoption of one specific automated tool -- automated collection diversity audits -- which we see as an illuminating case study for broader trends. Typically developed and sold by commercial book distributors, automated diversity audits aim to evaluate how well library collections reflect demographic and thematic diversity. We investigate how these audits function, whether library workers find them useful, and what is at stake when sensitive, normative decisions about representation are outsourced to automated commercial systems. Our analysis draws on an anonymous survey of U.S. public librarians (n=99), interviews with 14 librarians, a sample of purchasing records, and vendor documentation. We find that many library workers view these tools as convenient, time-saving solutions for assessing and diversifying collections under real and increasing constraints. Yet at the same time, the audits often flatten complex identities into standardized categories, fail to reflect local community needs, and further entrench libraries' infrastructural dependence on vendors. We conclude with recommendations for improving collection diversity audits and reflect on the broader implications for public libraries operating at the intersection of AI adoption, escalating anti-DEI backlash, and politically motivated defunding.

Screenshot of first page of the FAccT '25 paper "Algorithms in the Stacks: Investigating automated, for-profit diversity audits in public libraries" by Melanie Walsh, Connor Franklin Rey, Chang Ge, Tina Nowak, and Sabina Tomkins. Abstract: Algorithmic systems are increasingly being adopted by cultural heritage institutions like libraries. In this study, we investigate U.S. public libraries' adoption of one specific automated tool -- automated collection diversity audits -- which we see as an illuminating case study for broader trends. Typically developed and sold by commercial book distributors, automated diversity audits aim to evaluate how well library collections reflect demographic and thematic diversity. We investigate how these audits function, whether library workers find them useful, and what is at stake when sensitive, normative decisions about representation are outsourced to automated commercial systems. Our analysis draws on an anonymous survey of U.S. public librarians (n=99), interviews with 14 librarians, a sample of purchasing records, and vendor documentation. We find that many library workers view these tools as convenient, time-saving solutions for assessing and diversifying collections under real and increasing constraints. Yet at the same time, the audits often flatten complex identities into standardized categories, fail to reflect local community needs, and further entrench libraries' infrastructural dependence on vendors. We conclude with recommendations for improving collection diversity audits and reflect on the broader implications for public libraries operating at the intersection of AI adoption, escalating anti-DEI backlash, and politically motivated defunding.

Many libraries now use automated tools to measure diversity in their collections.

We examined how these tools work and whether library workers find them useful. A complex case study of libraries navigating automation, DEI, & shrinking public funding.

Our new FAccT paper: arxiv.org/abs/2505.14890

26.05.2025 12:10 β€” πŸ‘ 81    πŸ” 32    πŸ’¬ 2    πŸ“Œ 1
Preview
College English majors can't read They have one job and they can't do it

Worth a read. Not about AI. Many thoughts come to mind about literacy (and political literacy), grading standards, perceptions of studying literature as "easy" compared to science (it's not easy), how to translate expertise so people can understand.

kittenbeloved.substack.com/p/college-en...

22.05.2025 09:06 β€” πŸ‘ 19    πŸ” 4    πŸ’¬ 1    πŸ“Œ 2
Preview
Extracting memorized pieces of (copyrighted) books from open-weight language models Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expr...

Llama 3.1 70B contains copies of nearly the entirety of some books. Harry Potter is just one of them. I don’t know if this means it’s an infringing copy. But the first question to answer is if it’s a copy at all/in the first place. That’s what our new results suggest:

arxiv.org/abs/2505.12546

21.05.2025 11:20 β€” πŸ‘ 53    πŸ” 24    πŸ’¬ 4    πŸ“Œ 4
Preview
Chicago Sun-Times Prints AI-Generated Summer Reading List With Books That Don't Exist "I can't believe I missed it because it's so obvious. No excuses," the writer said. "I'm completely embarrassed."

I spoke to the person who AI-generated the Chicago Sun-Times reading list. Says he's very embarrassed. This was part of a generic package inserted into newspapers and other publications, so likely to run elsewhere. He didn't know it'd be in Chicago Sun-Times

www.404media.co/chicago-sun-...

20.05.2025 14:47 β€” πŸ‘ 1984    πŸ” 822    πŸ’¬ 157    πŸ“Œ 443

Public Access > Open Access

20.05.2025 01:41 β€” πŸ‘ 50    πŸ” 11    πŸ’¬ 0    πŸ“Œ 0

@laurejt is following 20 prominent accounts