Ryan Wesslen's Avatar

Ryan Wesslen

@ryanwesslen.bsky.social

ML Engineer. Data Scientist. HCI/Vis Researcher. Bayesian. Computational social scientist. Tar Heel. White Sox fan. Views are my own.

131 Followers  |  170 Following  |  1 Posts  |  Joined: 19.10.2023  |  1.6092

Latest posts by ryanwesslen.bsky.social on Bluesky


Video thumbnail

Economics might seem -- from the outside -- like it's about competition. But really it's about creating the miracle of cooperation, where folks from all around the world enrich your day in a million tiny ways. It's that beauty that I'm worried about losing.

11.04.2025 22:16 β€” πŸ‘ 339    πŸ” 77    πŸ’¬ 19    πŸ“Œ 8

behold the CONNECTED SPLATTERPIE

kneel before my works, ye mighty, and despair

30.03.2025 21:00 β€” πŸ‘ 394    πŸ” 68    πŸ’¬ 34    πŸ“Œ 23

β€œResponsible AI” is a bad word at NIST now.

14.03.2025 23:37 β€” πŸ‘ 24    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0
Preview
Editorial: A president just disrespected America in the Oval Office. It wasn’t Zelensky It’s time to say it plainly. America’s leadership has switched sides in the war. The American people have not, and they should speak up. In the past several weeks, the U.S. leadership has demonstrate...

NEW: A searing editorial in the The Kyiv Independent.

β€œIt’s time to say it plainly. America’s leadership has switched sides in the war. The American people have not, and they should speak up.

β€œA president just disrespected America in the Oval Office. It wasn’t Zelensky.”

@kyivindependent.com

28.02.2025 22:45 β€” πŸ‘ 67706    πŸ” 19229    πŸ’¬ 1257    πŸ“Œ 1039
Post image

New paper <3
Interested in inference-time scaling? In-context Learning? Mech Interp?
LMs can solve novel in-context tasks, with sufficient examples (longer contexts). Why? Bc they dynamically form *in-context representations*!
1/N

05.01.2025 15:49 β€” πŸ‘ 53    πŸ” 16    πŸ’¬ 2    πŸ“Œ 1
Synthetically generated text for supervised text analysis | Political Analysis | Cambridge Core Synthetically generated text for supervised text analysis

New paper in Political Analysis on synthetic text data for training classifiers. Main idea: generate training examples with LLMs, then fit classifiers on synthetic (+real) text. Paper has validations and guidance.
Blog: andrewhalterman.com/post/synthet...
Paper: www.cambridge.org/core/journal...

31.01.2025 17:07 β€” πŸ‘ 14    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1

What are some of the things you've learned about how LLMs (and LLM-powered systems like ChatGPT) work that were non-obvious but most helped you build a more effective mental model of how to use them?

04.01.2025 20:40 β€” πŸ‘ 350    πŸ” 31    πŸ’¬ 109    πŸ“Œ 14
Preview
Zero-Cost Custom Feeds on Bluesky A simple stack for generating custom feeds for Bluesky programmatically without a backend server

Wrote down the process to build your own custom feeds for Bluesky programmatically in Python and run it 100% free

Uses @skyfeed.app + @github.com actions to do periodic filtering and re-ranking and @cloudflare.social static pages to provide data to @bsky.app

01.12.2024 14:42 β€” πŸ‘ 136    πŸ” 25    πŸ’¬ 10    πŸ“Œ 2

There’s something fundamentally wacky going on here. That a) there’s an arrow of time in language is cool but b) that the magnitude varies a lot between languages? Wacky.

12.02.2024 22:02 β€” πŸ‘ 22    πŸ” 7    πŸ’¬ 6    πŸ“Œ 1
Preview
What's new with ML in production What's different about LLMs versus traditional ML

New post: Have been meaning to write something around what has fundamentally changed around the process of putting ML into prod now that we have LLMs. TL;DR: It's still just compression, we just don't control as much anymore.

vickiboykis.com/2024/01/15/w...

18.01.2024 20:45 β€” πŸ‘ 22    πŸ” 7    πŸ’¬ 1    πŸ“Œ 1
Preview
Prodigy in 2023: LLMs, task routers, QA and plugins Β· Explosion We have made a ton of new updates in Prodigy this year with v1.12, v1.13, and v1.14 releases. So we decided to write a post about them.

In 2023, we rolled out Prodigy v1.12-1.14 packed with new features like spacy-llm integration, prompt engineering, QA support like IAA metrics, task routing, and new plugins such as PDF and Hugging Face πŸ€—.

We highlight the many updates in our new blog post πŸŽ‰

explosion.ai/blog/prodigy...

29.11.2023 15:34 β€” πŸ‘ 5    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

The PyData NYC video from "Half hour of labeling power: Can we beat GPT?" by @ryanwesslen.bsky.social & me is now live!

We show how to use LLMs to speed up annotation, collect 1.2k examples & beat our baseline.

πŸ“Ί Video: www.youtube.com/watch?v=Ta45...
πŸ“ Slides: speakerdeck.com/inesmontani/...

27.11.2023 12:09 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

Deck the halls with NLP! @explosion-ai.bsky.social'sπŸ’€ t-shirts, tot bags, or mugs are sure to bring a touch of skeleton swagger this holiday season

20.11.2023 13:47 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Hello NYC ✨ Looking forward to seeing everyone at PyData NYC next week!

We'll have an @explosion-ai.bsky.social booth again with brand new swag, and @ryanwesslen.bsky.social & I will start the conf with our tutorial "Half hour of labeling power: Can we beat GPT?": nyc2023.pydata.org/cfp/talk/WQY...

29.10.2023 14:50 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

Announcing ✨Prodigy-HF ✨

It's a new plugin that allows you to train @huggingface.bsky.social NER models directly on annotated data in Prodigy. It also provides a recipe to upload annotations to Hugging Face HUB!

25.10.2023 13:52 β€” πŸ‘ 5    πŸ” 4    πŸ’¬ 2    πŸ“Œ 1
Post image

Interesting attempt to categorize LLM hallucinations (from an international team including Stanford and Amazon) arxiv.org/abs/2310.04988

24.10.2023 12:20 β€” πŸ‘ 0    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1
Prodigy-PDF for PDF annotation and OCR - Prodigy Shorts
We've recently introduced Prodigy Plugins which extend the features of Prodigy by adding direct support for 3rd party integrations. One of these plugins is P... Prodigy-PDF for PDF annotation and OCR - Prodigy Shorts

I added support for PDFs for Prodi.gy in the past few weeks. So I figured I'd record a small demo.

www.youtube.com/watch?v=rwyz...

If folks are interested in working on detection models for PDFs -> let me know if there's recipes missing!

24.10.2023 15:29 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

I wonder if people can guess what plugin I'll add next to prodi.gy.

23.10.2023 11:32 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

We recently released ✨ Prodigy-ANN ✨ that allows you to use contextual search to find relevant subsets of data to annotate first.

To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature πŸ‘€

youtu.be/jyu2nbjwfXw

20.10.2023 13:56 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Prodigy interface with PDF-Prodigy OCR extension example

Prodigy interface with PDF-Prodigy OCR extension example

The new OCR feature uses Pytesseract under the hood to attach parsed text to segments that you've annotated with the `pdf.image.manual` recipe.

If you want to learn more, the docs have plenty of extra info: πŸ‘€

prodi.gy/docs/plugins...

19.10.2023 12:47 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
An example output of Prodigy's inter-annotator agreement command with annotation and agreement statistics.

An example output of Prodigy's inter-annotator agreement command with annotation and agreement statistics.

Curious if your annotators are on the same page? Prodigy has just released v1.14.3 with built-in inter-annotator agreement (IAA) metrics to track and measure their agreement. In this 🧡, we'll review Prodigy's document-level IAA metrics. prodi.gy/docs/metrics

18.10.2023 14:34 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

@ryanwesslen is following 20 prominent accounts