Dave Willner @dwillner - Bluesky Profile

That's not how the End Times work Mike!

22.11.2025 01:41 — 👍 7 🔁 0 💬 0 📌 0

Bellingcat’s contact email has always been a magnet for people with fairly unusual views; paranoid delusions, sprawling conspiracies, the works. But recently, the pattern has shifted, we’re seeing more and more emails clearly written with ChatGPT.

19.11.2025 14:18 — 👍 3089 🔁 813 💬 54 📌 268

this administration, and its congressional allies, are free speech phonies. not warriors. phonies. censors. propagandists.

20.11.2025 01:55 — 👍 13 🔁 5 💬 0 📌 1

DHS playing 'whack-a-mole' shooting down made-up ICE stories The Department of Homeland Security is stepping up efforts to combat fake news, viral AI videos, and misinformation on ICE and Border Patrol.

Just wanna re-up in simple terms that when Biden talked to platforms, Jim Jordan launched years of investigations into everybody involved, said it was tyranny, censorship, etc.

And now they just straight up acknowledge that they talk to platforms too.

www.washingtonexaminer.com/news/crime/3...

20.11.2025 01:54 — 👍 519 🔁 146 💬 10 📌 8

💸

18.11.2025 03:44 — 👍 2 🔁 0 💬 0 📌 0

Totally feel free to point folks to the blog, it won’t poison anything!

13.11.2025 23:14 — 👍 1 🔁 0 💬 1 📌 0

Observations on Toxicity We've published Zentropi's toxicity labeler (toxicity-public-s5), which you can integrate with your platform instantly using the Zentropi API. Browse the full policy to see how defining observable fea...

We just wrote an in-depth post about Toxic Content labeling. It presents a new way of defining toxic speech online-- and illustrates the importance of observable features for accurate language model interpretability. Would love to hear how YOU define toxicity, too! blog.zentropi.ai/observations...

13.11.2025 22:47 — 👍 5 🔁 2 💬 0 📌 0

Observations on Toxicity We published a novel toxicity labeler (toxicity-public-s5), which you can integrate with your platform instantly using the Zentropi API. Browse the full policy to see how defining observable features ...

I’ve had a very “text-oriented” view of content labeling for a long time, and used the opportunity of our recent launch to lay out some of those ideas in the context of the idea of “toxicity”

Interested to know what others think!

blog.zentropi.ai/observations...

13.11.2025 22:56 — 👍 6 🔁 2 💬 1 📌 1

Zentropi - Build Custom Content Labelers Instantly

Find these at zentropi.ai and on my profile at zentropi.ai/u/dave. As more folks write and publish policies, we'll be adding the best ones to the featured section to give people more options. 🧵 5/5

10.11.2025 20:10 — 👍 4 🔁 0 💬 0 📌 0

Our goal here isn't to provide perfect policies or tell anyone what rules they should have. It's to provide examples of what actually works when writing content policies for LLM interpretation that anyone can then adapt to fit their own needs. 🧵 4/5

10.11.2025 20:10 — 👍 2 🔁 0 💬 1 📌 0

To start things off, I've built 7 policies - harassment, hate, violence, self-harm, sexual content, drugs, and toxicity. All created using Zentropi itself, with a touch of editing on my end. Examples of what's possible, starting points to adapt. 🧵 3/5

10.11.2025 20:10 — 👍 6 🔁 0 💬 1 📌 1

We're hoping to change this by encouraging more public sharing of people's work. So, today, we're launching policy discovery and sharing on Zentropi. Browse featured policies, find more from authors you like, fork what fits, and adapt for your context. 🧵 2/5

10.11.2025 20:10 — 👍 4 🔁 1 💬 1 📌 1

Content policies are usually private, one-off efforts. You build yours, I build mine, we don't share much about what works or why. This makes sense given products can (and should) set different policies based on their communities, but it leaves us reinventing the wheel. 🧵 1/5

10.11.2025 20:10 — 👍 18 🔁 7 💬 2 📌 3

You should resign both your leadership position and your seat so that someone who is up to this can take over.

10.11.2025 03:54 — 👍 3 🔁 0 💬 0 📌 0

*whispers* you can continue to read me, the pundit who insisted the other pundits were wrong about these conclusions

05.11.2025 15:06 — 👍 5907 🔁 557 💬 67 📌 15

Go U Bears!

05.11.2025 02:54 — 👍 1 🔁 0 💬 1 📌 0

Picture of the East Wing demolition of the White House taken on my flight out of DCA.

23.10.2025 17:16 — 👍 15066 🔁 5849 💬 1205 📌 1081

It’s recorded iirc, should be up on YouTube!

22.10.2025 22:43 — 👍 3 🔁 0 💬 1 📌 0

Automating Content Policy AI is no longer just moderating individual posts — it is learning how to interpret and enforce policy itself. Dave Willner — who has led trust and safety teams at Facebook, Airbnb, and OpenAI — joins ...

I am forgetful about it self-promotion, so dropping a last minute link to note that I’m giving a talk Berkman Klein today. Come check it out if you’re free, or catch the recording later:

cyber.harvard.edu/events/autom...

22.10.2025 16:08 — 👍 20 🔁 4 💬 2 📌 2

I feel like some of the difference in reactions here also rests on on frequently you have to do a somewhat complex, but very repetitive, task. Taking the time to get these sort of workflows really dialed in is most useful for stuff you do over and over.

17.10.2025 22:00 — 👍 7 🔁 1 💬 0 📌 0

I’ve never understood why, because it’s not like this was a subtle theme!

16.10.2025 03:02 — 👍 11 🔁 0 💬 0 📌 0

He would definitely have also hated that 😂

16.10.2025 03:01 — 👍 10 🔁 0 💬 0 📌 0

The thing that always gets me is that Tolkien obviously would have hated Silicon Valley generally, and these particular guys specifically.

16.10.2025 00:52 — 👍 333 🔁 35 💬 2 📌 8

Tyranny is brittle.

10.10.2025 17:16 — 👍 53 🔁 11 💬 1 📌 0

Right? I don’t even know what the current fight is about, but let’s not be silly now.

03.10.2025 15:11 — 👍 4 🔁 0 💬 1 📌 0

So, the first part of this is plainly false, both historically and currently. I don’t think it’s a good thing in most cases…but it’s plainly the case that pressuring the people in charge of moderation to either ban (or not ban) people works *All The Time*. It is why people do it!

03.10.2025 15:03 — 👍 50 🔁 9 💬 3 📌 1

Moderating is Such Sweet Sorrow - Ctrl-Alt-Speech In this week’s roundup of the latest news in online speech, content moderation and internet regulation, Mike is joined by Dave Willner, founder of Zentropi, and long-time trust & safety expert who...

New Ctrl-Alt-Speech: Moderating is Such Sweet Sorrow with guest host @dwillner.bsky.social who is entirely responsible for bringing up Shakespeare as part of this discussion. (@benwhitelaw.bsky.social will be back next week!)

podcast.ctrlaltspeech.com/2315966/epis...

01.10.2025 23:25 — 👍 12 🔁 4 💬 1 📌 1

While terrible, this is entirely unsurprising. If you hold serious safety efforts in contempt, this sort of thing is inevitable.

23.09.2025 04:59 — 👍 78 🔁 22 💬 1 📌 0

Disney/ABC have a responsibility to refuse to participate in corruption.

Kimmel must be reinstated. If Disney/ABC agree to this extortion then perhaps creatives + workers should consider collective action to push back. Same w/buying park + cruise tickets if they bow.

People have power. Ask Target

20.09.2025 01:13 — 👍 59406 🔁 13447 💬 1804 📌 531

No one who agrees to this is a journalist.

20.09.2025 00:27 — 👍 11 🔁 4 💬 0 📌 0

Dave Willner

Latest posts by dwillner.bsky.social on Bluesky

@dwillner is following 20 prominent accounts