@siddheshp - Bluesky Profile

Latest posts by siddheshp.bsky.social on Bluesky

Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge The measurement tasks involved in evaluating generative AI (GenAI) systems lack sufficient scientific rigor, leading to what has been described as "a tangle of sloppy tests [and] apples-to-oranges com...

Check out the camera-ready version of our ICML position paper ("Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge") to learn more!!! arxiv.org/abs/2502.00561

(6/6)

15.06.2025 00:20 — 👍 41 🔁 10 💬 3 📌 0

i mean, people have different goals, and if you cared about some niche aspect of query focused multi doc sum before, it is legit to continue. or you can switch focus and start thinking of HCI. the second became much more possible now, the first maybe hasnt.

17.12.2024 17:16 — 👍 3 🔁 1 💬 1 📌 0

I wonder if people have suggestions about what parts of writing could be complemented using AI with compromising thinking or could help better organization of thoughts: Making arguments stronger, reviewing, generating ideas about structure?

12.12.2024 17:00 — 👍 1 🔁 0 💬 0 📌 0

🌶️(?) take: Agents are somehow hot right because people realized that LLM output can be interpreted as a DSL which directs side effects in the world (e.g. tool calls) rather than just returning text in a chat/autocomplete sense. What are the open challenges? A 🧵... [1/11]

19.11.2024 09:32 — 👍 168 🔁 30 💬 9 📌 7

#EMNLP has a nice set of tokenization/subword modeling papers this year.

It's a good mix of tokenization algorithms, tokenization evaluation, tokenization-free methods, and subword embedding probing. Lmk if I missed some!

Here is a list with links + presentation time (in chronological order).

11.11.2024 22:38 — 👍 48 🔁 16 💬 5 📌 2

Tagging my co-authors as I find them:
@iaugenstein.bsky.social @rnv.bsky.social

11.11.2024 09:58 — 👍 0 🔁 0 💬 0 📌 0

Survey of Cultural Awareness in Language Models: Text and Beyond Large-scale deployment of large language models (LLMs) in various applications, such as chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure inclusivity. Cul...

We are excited to share our comprehensive survey on cultural awareness in #LLMs! 🗺️ [Was posted on X a few days before]
We reviewed 300+ papers across diverse modalities (language, vision-language, etc.)
arxiv.org/abs/2411.00860

11.11.2024 09:57 — 👍 2 🔁 0 💬 1 📌 1

@siddheshp is following 20 prominent accounts

Dean Eckles
@eckles

networks, contagion, causality faculty at MIT

Heather Froehlich
@heatherfro

supporting researchers counting words in various ways with computers at university of arizona libraries; increasingly displaced new englander

McSweeney's
@mcsweeneys.net

The official Bluesky feed of McSweeney's Quarterly Concern, McSweeney's Internet Tendency, & McSweeney's Books. .

Dan Jurafsky
@jurafsky

Stanford professor

Francesca Padovani
@frap98

2nd year PhD Student at @gronlp.bsky.social 🐮 - University of Groningen Language Acquisition - NLP

Srishti
@srishtiy

Grace
@gracekind.net

A latent space odyssey gracekind.net

Stanford NLP Group
@stanfordnlp

Computational Linguists—Natural Language—Machine Learning

Dorsa Amir
@dorsaamir

Assistant Professor of Psychology at Duke University studying kids & culture. Director of the Mind & Culture Lab. Mom x3. Some people just want to watch the world learn. dorsaamir.com | mindandculturelab.com

Jessi Grieser
@jessgrieser.com

Sociolinguist, novelist, photographer, quilter, saxophonist and Boglehead. Associate Professor of Linguistics at UMich.

Sara Hooker
@sarahooker

I lead Cohere For AI. Formerly Research Google Brain. ML Efficiency, LLMs, @trustworthy_ml.

SkynetAndChill.com
@skynetandchill.com

Daily artificial intelligence news, lovingly curated by man and machine. https://www.skynetandchill.com Neo-Luddite AI maven. On a long enough timeline, p(doom) for everything goes to 1.

Ian Maurer
@imaurer

Fighting cancer with code. Interests: Bioinformatics, Genomics, LLMs, Python, Rust, NLP. https://imaurer.com/ https://github.com/imaurer/

Jack Parker-Holder
@jparkerholder

RS at Google DeepMind and Honorary Lecturer at UCL. Building general world models to solve AGI :)

Akari Asai
@akariasai

Ph.D. student at University of Washington CSE. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃‍♀️🧗‍♀️🍳

Teddy Roland
@teddyroland

Postdoc @ School of Information Sciences, University of Illinois Urbana-Champaign. American Literature, Media Theory, Data Science. I publish under "Edwin Roland" but don't tell anyone.

Becca Cohen
@beccacohen

IS PhD Student at UIUC studying digital humanities, language, cultural analytics and ethical AI

PD (Petey) Edgar MFA, MA
@pdedgar30

Texts & Technology PhD student @ U. Central Florida Poetry culture online + in print + w AI Founding editor @remediatelitmag.bsky.social Forthcoming in OROBORO, MQR pdedgar.xyz

Anna Schewelew
@otocolobusmanul

PhD Student CompLit @UCSB Intellectual History of Machine Translation Soviet Cosmopolis/Central Asian Film Frankfurter Schule x Gießener Schule Macrodata Refinement

Julia Neugarten
@julianeugarten

Researcher of fandom studies, fanfiction, computational literary studies and literary reception. Currently a PhD candidate at Radboud University. Views are my own.