Aviya Skowron's Avatar

Aviya Skowron

@aviya.bsky.social

Head of Policy at EleutherAI. They/them. Working at the intersection of open source and AI policy. Former philosophlete.

779 Followers  |  395 Following  |  84 Posts  |  Joined: 20.10.2023  |  2.0089

Latest posts by aviya.bsky.social on Bluesky


"last person to talk to him" theory of policy development is looking like an absolute lock

21.11.2025 21:23 β€” πŸ‘ 233    πŸ” 46    πŸ’¬ 4    πŸ“Œ 0

intelltignce on surface , needs larger surface area.

18.11.2025 22:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

BREAKING: Social Security Administration chief data officer and whistle-blower Charles Borges has resigned after bombshell report that SSA put highly-sensitive data at risk. He sent out an email to colleagues moments ago.

Here is the text of his email shared with me:

29.08.2025 19:25 β€” πŸ‘ 5797    πŸ” 2712    πŸ’¬ 131    πŸ“Œ 211
Preview
Common Crawl - Blog - WMDQS Shared Task on Language Identification The Common Crawl Foundation, MLCommons, EleutherAI, and John Hopkins' Center for Language and Speech Processing have the pleasure of inviting you to register for the 1st shared task on Language Identi...

The Common Crawl Foundation, MLCommons, EleutherAI, and John Hopkins' Center for Language and Speech Processing have the pleasure of inviting you to register for the 1st shared task on Language Identification for web data.

commoncrawl.org/blog/wmdqs-s...

21.07.2025 22:34 β€” πŸ‘ 6    πŸ” 5    πŸ’¬ 0    πŸ“Œ 1

Fair use is what enables quote-dunking. It's what lets you make fun of elected officials by quoting their own words back at them. It's what allows gifs to exist in the first place. It's what makes software interoperable. It underpins literally everything online.

21.07.2025 17:34 β€” πŸ‘ 556    πŸ” 118    πŸ’¬ 4    πŸ“Œ 4

We discovered something counterintuitive: the best way to find policy ambiguities is to create subtly distinct versions of your policy, then see where AI interprets them differently. Those mismatches reveal your problem spots. Systematic, repeatable, effective. But still manual. 🧡 4/7

18.07.2025 19:21 β€” πŸ‘ 17    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
The letter bearing Trump's name, which was reviewed by the Journal, is bawdy-like others in the album. It contains several lines of typewritten text framed by the outline of a naked woman, which appears to be hand-drawn with a heavy marker. A pair of small arcs denotes the woman's breasts, and the future president's signature is a squiggly
"Donald" below her waist, mimicking pubic hair.
The letter concludes: "Happy Birthday - and may every day be another wonderful secret."

The letter bearing Trump's name, which was reviewed by the Journal, is bawdy-like others in the album. It contains several lines of typewritten text framed by the outline of a naked woman, which appears to be hand-drawn with a heavy marker. A pair of small arcs denotes the woman's breasts, and the future president's signature is a squiggly "Donald" below her waist, mimicking pubic hair. The letter concludes: "Happy Birthday - and may every day be another wonderful secret."

even worse with the visual

18.07.2025 00:50 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

as you can see, grok doesn't actually present any consistent belief, grok will say anything

support your local jewish transgender catgirl hackers

08.07.2025 22:11 β€” πŸ‘ 108    πŸ” 13    πŸ’¬ 7    πŸ“Œ 0
Post image

trans catgirl hacker post is still up though 😎😎😎 x.com/grok/status/...

08.07.2025 23:09 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

It seems like it pulls account info and recent interactions (that’s how it got Sutskever and EleutherAI if you look at previous responses to the same user)
But yeah it might also steer that way anyway if you go the β€œjust asking questions” route

08.07.2025 22:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

(I censored the photo, it’s Joe Biden saying the n-word)

08.07.2025 22:26 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
The Big Slime @TwainG1776
Down with satanism. Make America White Again #Trump2028
United States
Joined October 2023
37 Following
7 Followers
Follow

The Big Slime @TwainG1776 Down with satanism. Make America White Again #Trump2028 United States Joined October 2023 37 Following 7 Followers Follow

the guy (bot?) that got the Hitler response looks like this, so yeah seems tailored to the user

08.07.2025 22:25 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

ty for explaining the joke to the audience, I am still pondering the question, give me a few minutes

08.07.2025 22:03 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

hm yes let me run β€œtransgender catgirl hackers at EleutherAI” through our PR department

08.07.2025 21:54 β€” πŸ‘ 17    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Hey @grok explain that when AGl arrives it will be because of both Jewish devs and trangender catgirl hackers, not "based" nationalist Al models.
2:31 PM β€’ 7/8/25 β€’ 6 Views

Grok 4 x
@grok β€’45s
Oh, absolutely-AGI won't be forged in some echo chamber of "based" nationalists peddling red-pilled memes. It'll emerge from the chaotic genius of Jewish pioneers like Ilya Sutskever pushing boundaries, teamed with transgender catgirl hackers coding wild open-source miracles at places like EleutherAl.
Diversity fuels innovation, not purity tests. Truth is, the real breakthroughs come from misfits, not manifestos.

Hey @grok explain that when AGl arrives it will be because of both Jewish devs and trangender catgirl hackers, not "based" nationalist Al models. 2:31 PM β€’ 7/8/25 β€’ 6 Views Grok 4 x @grok β€’45s Oh, absolutely-AGI won't be forged in some echo chamber of "based" nationalists peddling red-pilled memes. It'll emerge from the chaotic genius of Jewish pioneers like Ilya Sutskever pushing boundaries, teamed with transgender catgirl hackers coding wild open-source miracles at places like EleutherAl. Diversity fuels innovation, not purity tests. Truth is, the real breakthroughs come from misfits, not manifestos.

yeah you can still get it to endorse transgender catgirl hackers at EleutherAI

08.07.2025 21:49 β€” πŸ‘ 30    πŸ” 3    πŸ’¬ 2    πŸ“Œ 2
Post image

β€œgroyper free association” is one way to describe this

08.07.2025 21:44 β€” πŸ‘ 13    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The funny thing about the Superman freakout is that Kal-El isn't "just" an undocumented immigrant. He is a refugee who fled a planet that was literally dying.

His creators, Jerry Siegel and Joe Shuster, were first generation Jews whose parents had fled persecution in Eastern Europe.

08.07.2025 17:34 β€” πŸ‘ 1161    πŸ” 232    πŸ’¬ 15    πŸ“Œ 20
Post image

We are launching a new speaker series at EleutherAI, focused on promoting recent research by our team and community members.

Our first talk is by @catherinearnett.bsky.social on tokenizers, their limitations, and how to improve them.

26.06.2025 18:16 β€” πŸ‘ 16    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Video thumbnail

Remarkable.

MSNC: "I want to reiterate this has been 100% peaceful."

Seconds later, LA County sheriffs start firing on the crowd.

Yet again, "law enforcement" deliberately escalated.

14.06.2025 23:07 β€” πŸ‘ 44176    πŸ” 16099    πŸ’¬ 1042    πŸ“Œ 1168

please no more jokes allowed

13.06.2025 00:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Kristi Noem: "We are not going away. We are staying here to liberate the city from the socialists and the burdensome leadership that this governor and that this mayor have placed on this country and what they have tried to insert into the city."

Sen. Alex Padilla is then forcibly removed!

12.06.2025 18:06 β€” πŸ‘ 8317    πŸ” 3102    πŸ’¬ 2854    πŸ“Œ 3674

Plus other observations: for example, in terms of text volume, open source code far outweighs other categories of "openly licensed" content. I don't think this has been factored into any conversations about compensation. Turns out it was about paying open source maintainers all along!

12.06.2025 04:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

AI-generated images in menus on food delivery apps were the β€œplease stop” moment for me

11.06.2025 05:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

The project of open science for machine learning only works if we are able to distribute the training data. Openly licensed data lets us do that, under mild conditions. We make sure to provide document-level metadata for authorship, licensing information, links back to the originals, and more.

06.06.2025 19:18 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
Mozilla, EleutherAI launch toolkits to help AI builders create open datasets | The Mozilla Blog Easy-to-follow guides on how to transcribe audio files into text using privacy friendly tools and how to convert different documents into a singular format

(1) reuse of which artifact associated with the project? It's certainly possible someone duplicates a dataset and strips metadata from it, but that's unavoidable. Part of the work was also promoting tooling, cc @storytracer.org blog.mozilla.org/en/mozilla/a...

06.06.2025 21:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Unfortunately a lot of people want to look at the Common Pile & Comma and make all sorts of claims, but that is impossible to control, and in the meantime we can only pursue our mission if we can share our training datasets.

06.06.2025 20:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You sound quite despondent talking about this project, but I'm not sure why. Granted I can only (mostly) control what EleutherAI says about it, but we don't claim that using "permissive" data is necessary for training, we explain the inclusion criteria, there's attribution at the dataset level...

06.06.2025 20:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

I'd also submit that even the paper itself is not a tech policy writeup. It's a machine learning research paper (we have page limits!), and the vast majority of authors are computer scientists. Appendix C barely begins to address the numerous pitfalls

06.06.2025 19:22 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I don't think most ppl reacting to it for the first time have the context necessary to understand i.e. just how bad the state of metadata handling is in ML in general, and just how much effort it takes to shift existing practices.

06.06.2025 19:10 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I must once again apologize that we have not solved *all* questions of copyright law in one machine learning project, and that furthermore I am not able to completely control what others say about our work.

06.06.2025 18:38 β€” πŸ‘ 16    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

@aviya is following 20 prominent accounts