That's not how the End Times work Mike!
22.11.2025 01:41 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0@dwillner.bsky.social
Co-Founder at Zentropi. Formerly Head of Trust & Safety at OpenAI, of Community Policy at Airbnb, and of Content Policy Facebook. Strictly cold takes.
That's not how the End Times work Mike!
22.11.2025 01:41 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0Bellingcatโs contact email has always been a magnet for people with fairly unusual views; paranoid delusions, sprawling conspiracies, the works. But recently, the pattern has shifted, weโre seeing more and more emails clearly written with ChatGPT.
19.11.2025 14:18 โ ๐ 3089 ๐ 813 ๐ฌ 54 ๐ 268this administration, and its congressional allies, are free speech phonies. not warriors. phonies. censors. propagandists.
20.11.2025 01:55 โ ๐ 13 ๐ 5 ๐ฌ 0 ๐ 1Just wanna re-up in simple terms that when Biden talked to platforms, Jim Jordan launched years of investigations into everybody involved, said it was tyranny, censorship, etc.
And now they just straight up acknowledge that they talk to platforms too.
www.washingtonexaminer.com/news/crime/3...
๐ธ
18.11.2025 03:44 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Totally feel free to point folks to the blog, it wonโt poison anything!
13.11.2025 23:14 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0We just wrote an in-depth post about Toxic Content labeling. It presents a new way of defining toxic speech online-- and illustrates the importance of observable features for accurate language model interpretability. Would love to hear how YOU define toxicity, too! blog.zentropi.ai/observations...
13.11.2025 22:47 โ ๐ 5 ๐ 2 ๐ฌ 0 ๐ 0Iโve had a very โtext-orientedโ view of content labeling for a long time, and used the opportunity of our recent launch to lay out some of those ideas in the context of the idea of โtoxicityโ
Interested to know what others think!
blog.zentropi.ai/observations...
Find these at zentropi.ai and on my profile at zentropi.ai/u/dave. As more folks write and publish policies, we'll be adding the best ones to the featured section to give people more options. ๐งต 5/5
10.11.2025 20:10 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0Our goal here isn't to provide perfect policies or tell anyone what rules they should have. It's to provide examples of what actually works when writing content policies for LLM interpretation that anyone can then adapt to fit their own needs. ๐งต 4/5
10.11.2025 20:10 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0To start things off, I've built 7 policies - harassment, hate, violence, self-harm, sexual content, drugs, and toxicity. All created using Zentropi itself, with a touch of editing on my end. Examples of what's possible, starting points to adapt. ๐งต 3/5
10.11.2025 20:10 โ ๐ 6 ๐ 0 ๐ฌ 1 ๐ 1We're hoping to change this by encouraging more public sharing of people's work. So, today, we're launching policy discovery and sharing on Zentropi. Browse featured policies, find more from authors you like, fork what fits, and adapt for your context. ๐งต 2/5
10.11.2025 20:10 โ ๐ 4 ๐ 1 ๐ฌ 1 ๐ 1Content policies are usually private, one-off efforts. You build yours, I build mine, we don't share much about what works or why. This makes sense given products can (and should) set different policies based on their communities, but it leaves us reinventing the wheel. ๐งต 1/5
10.11.2025 20:10 โ ๐ 18 ๐ 7 ๐ฌ 2 ๐ 3You should resign both your leadership position and your seat so that someone who is up to this can take over.
10.11.2025 03:54 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0*whispers* you can continue to read me, the pundit who insisted the other pundits were wrong about these conclusions
05.11.2025 15:06 โ ๐ 5907 ๐ 557 ๐ฌ 67 ๐ 15Go U Bears!
05.11.2025 02:54 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Picture of the East Wing demolition of the White House taken on my flight out of DCA.
23.10.2025 17:16 โ ๐ 15066 ๐ 5849 ๐ฌ 1205 ๐ 1081Itโs recorded iirc, should be up on YouTube!
22.10.2025 22:43 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0I am forgetful about it self-promotion, so dropping a last minute link to note that Iโm giving a talk Berkman Klein today. Come check it out if youโre free, or catch the recording later:
cyber.harvard.edu/events/autom...
I feel like some of the difference in reactions here also rests on on frequently you have to do a somewhat complex, but very repetitive, task. Taking the time to get these sort of workflows really dialed in is most useful for stuff you do over and over.
17.10.2025 22:00 โ ๐ 7 ๐ 1 ๐ฌ 0 ๐ 0Iโve never understood why, because itโs not like this was a subtle theme!
16.10.2025 03:02 โ ๐ 11 ๐ 0 ๐ฌ 0 ๐ 0He would definitely have also hated that ๐
16.10.2025 03:01 โ ๐ 10 ๐ 0 ๐ฌ 0 ๐ 0The thing that always gets me is that Tolkien obviously would have hated Silicon Valley generally, and these particular guys specifically.
16.10.2025 00:52 โ ๐ 333 ๐ 35 ๐ฌ 2 ๐ 8Tyranny is brittle.
10.10.2025 17:16 โ ๐ 53 ๐ 11 ๐ฌ 1 ๐ 0Right? I donโt even know what the current fight is about, but letโs not be silly now.
03.10.2025 15:11 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0So, the first part of this is plainly false, both historically and currently. I donโt think itโs a good thing in most casesโฆbut itโs plainly the case that pressuring the people in charge of moderation to either ban (or not ban) people works *All The Time*. It is why people do it!
03.10.2025 15:03 โ ๐ 50 ๐ 9 ๐ฌ 3 ๐ 1New Ctrl-Alt-Speech: Moderating is Such Sweet Sorrow with guest host @dwillner.bsky.social who is entirely responsible for bringing up Shakespeare as part of this discussion. (@benwhitelaw.bsky.social will be back next week!)
podcast.ctrlaltspeech.com/2315966/epis...
While terrible, this is entirely unsurprising. If you hold serious safety efforts in contempt, this sort of thing is inevitable.
23.09.2025 04:59 โ ๐ 78 ๐ 22 ๐ฌ 1 ๐ 0Disney/ABC have a responsibility to refuse to participate in corruption.
Kimmel must be reinstated. If Disney/ABC agree to this extortion then perhaps creatives + workers should consider collective action to push back. Same w/buying park + cruise tickets if they bow.
People have power. Ask Target
No one who agrees to this is a journalist.
20.09.2025 00:27 โ ๐ 11 ๐ 4 ๐ฌ 0 ๐ 0