Stella Biderman's Avatar

Stella Biderman

@stellaathena.bsky.social

I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.

5,563 Followers  |  358 Following  |  357 Posts  |  Joined: 04.05.2023
Posts Following

Posts by Stella Biderman (@stellaathena.bsky.social)

I'm not sure if I'm more called out by this skeet or the fact that I've had two kidney stones already tbh...

05.03.2026 06:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

Language identification still proves to be a challenging task, especially for web data. In collaboration with @mlcommons.org @eleutherai.bsky.social @jhu.edu and 97 community members, we created CommonLID, a new benchmark for LangID for 100+ languages!

10.02.2026 20:44 β€” πŸ‘ 11    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Post image

Has anyone else had Claude code become non-functional recently? Even with a test input it spins for minutes without doing anything. Same thing happens in terminal.

07.02.2026 16:40 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 4    πŸ“Œ 0

It's going to get worse because people hate AI

07.02.2026 16:37 β€” πŸ‘ 11    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
Post image Post image Post image Post image

The only reasons I use social media platforms is to get eyeballs on research and to yell at people who are wrong online.

X > Bluesky at both for me

27.01.2026 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And the next talk (exact details TBA) by @pjox.bsky.social and @very-laurie.bsky.social from Common Crawl on work we've been collaborating on to build better benchmarking of LangID systems and understand the issues with the long tail of human language that comes up at Common Crawl scales.

09.01.2026 00:56 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

We’re bringing back a Community Spotlight talk series, highlighting cool work being done by members of our community. We’re kicking it off with a talk on running diffusion-based world-models in real time on consumer hardware.

Jan 9th at 2 pm US Eastern Time

09.01.2026 00:56 β€” πŸ‘ 11    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards Filtering pretraining data prevents dangerous capabilities, doesn’t sacrifice general performance, and results in models that are resistant to tampering.

What are people's favorite paper / project websites? I'm looking to build a library to base future ones I make off of.

The one EleutherAI has done that I'm proudest of is deepignorance.ai

06.01.2026 20:15 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Imagine if they had done their job and put the pedophile insurrectionists in jail for the rest of their lives.

24.12.2025 17:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'm late to seeing this post, but this is a super cool paper! Thanks for sharing. Do you have more work in this vein in the works?

24.12.2025 04:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
60 Minutes Inside CECOT : Free Download, Borrow, and Streaming : Internet Archive Full video of the 60 Minutes Inside CECOT episode that CBS pulled.

If you're looking for the 60 minutes piece on CECOT that Bari Weiss canned to protect the President, you can find it here: archive.org/details/60-m...

23.12.2025 04:02 β€” πŸ‘ 13    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0

Ukraine is not the only country Sebastian has created non-profits to support. The @datarescueproject.org has spent the past year coordinating and managing volunteer communities in the US working to back up scientific, cultural, and historical data that the USG wants to suppress.

19.12.2025 19:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

They were honored alongside the Ukrainian Library Association whose on-the-ground work in Ukraine to promote and preserve Ukrainian language and culture has been all the more vital as Ukraine fights for its existence.

19.12.2025 19:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

Incredibly proud of my friend and colleague @storytracer.com. Two weeks ago he and his cofounders @sucho-org.bsky.social were honored for organizing a global network of volunteers to exfiltrate and back up endangered Ukrainian cultural heritage in the wake of the invasion by Russia.

19.12.2025 19:51 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Really great to see NVIDIA staking out a pro-open data position. This used to be common, if not the norm in AI, and the backing away from this level of transparency has done a lot of harm to the research community.

16.12.2025 19:32 β€” πŸ‘ 48    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1

Ah, that would explain it.

14.12.2025 23:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
"I wrote: "Your call for a Palestinian state pours fuel on the antisemitic fire. It rewards Hamas terrorists. It emboldens those who menace Australian Jews and encourages the Jew hatred now stalking your streets.""

"I wrote: "Your call for a Palestinian state pours fuel on the antisemitic fire. It rewards Hamas terrorists. It emboldens those who menace Australian Jews and encourages the Jew hatred now stalking your streets.""

"The attack at a Hanukkah celebration in Sydney today was a vile act of antisemitic terror. I mourn those who were murdered and will be keeping their families, the Jewish community, and the Chabad movement in my prayers. May the memories of all those killed be a blessing.

While we are still waiting for all the facts to emerge, what we already know is devastating. At least 11 dead, including Rabbi Eli Schlanger, who held deep ties to Crown Heights. At least 29 injured. Another Jewish community plunged into mourning and loss, a holiday of light so painfully reduced to a day of darkness. This attack is merely the latest, most horrifying iteration in a growing pattern of violence targeted at Jewish people across the world. Too many no longer feel safe to be themselves, to express their faith publicly, to worship in their synagogues without armed security stationed outside. What happened at Bondi is what many Jewish people fear will happen in their communities too.

On Bondi Beach today, as men with long guns targeted innocents, another man ran towards the gunfire and disarmed a shooter. Tonight, as Jewish New Yorkers light menorahs and usher in a first night of Hanukkah clouded by grief, let us look to his example and confront hatred with the urgency and action it demands. When I am Mayor, I will work every day to keep Jewish New Yorkers safeβ€”on our streets, our subways, at shul, in every moment of every day. Let this be a purpose shared by every New Yorker, and let us banish this horrific violence to the past."

"The attack at a Hanukkah celebration in Sydney today was a vile act of antisemitic terror. I mourn those who were murdered and will be keeping their families, the Jewish community, and the Chabad movement in my prayers. May the memories of all those killed be a blessing. While we are still waiting for all the facts to emerge, what we already know is devastating. At least 11 dead, including Rabbi Eli Schlanger, who held deep ties to Crown Heights. At least 29 injured. Another Jewish community plunged into mourning and loss, a holiday of light so painfully reduced to a day of darkness. This attack is merely the latest, most horrifying iteration in a growing pattern of violence targeted at Jewish people across the world. Too many no longer feel safe to be themselves, to express their faith publicly, to worship in their synagogues without armed security stationed outside. What happened at Bondi is what many Jewish people fear will happen in their communities too. On Bondi Beach today, as men with long guns targeted innocents, another man ran towards the gunfire and disarmed a shooter. Tonight, as Jewish New Yorkers light menorahs and usher in a first night of Hanukkah clouded by grief, let us look to his example and confront hatred with the urgency and action it demands. When I am Mayor, I will work every day to keep Jewish New Yorkers safeβ€”on our streets, our subways, at shul, in every moment of every day. Let this be a purpose shared by every New Yorker, and let us banish this horrific violence to the past."

Compare the statement about the antisemitic terror attack in Australia by the Israeli Prime Minister with the one by Zohran Mamdani, and ask yourself who more truly cares about condemning antisemitism, as opposed to using it to promote unrelated politics.

14.12.2025 16:45 β€” πŸ‘ 1663    πŸ” 495    πŸ’¬ 19    πŸ“Œ 18

I didn't... is it because I'm at a non-profit research institute but not a university? πŸ€”

14.12.2025 20:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"The incentives made me do it" is an excuse, not a justification. You can be better than that, and if you're not it's because you choose to not be.

14.12.2025 20:54 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I saw a pamphlet recently from WotC to game store employees about how to talk to parents who thought Magic: the Gathering was Satanic and whose kids were into it. Someone had found it in a box and framed it.

13.12.2025 17:51 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

It's a wrap on EvalEval in San Diego! A jam packed day of learning, making new friends, critically examining the field of evals, and walking away with renewed energy and new collaborations!

We have a lot of announcements coming, but first: EvalEval will be back for #ACL2026!

10.12.2025 22:55 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

In 2023-ish it was trendy to write papers trying to explain why scaling laws had power law structures. The papers I remember were pretty unconvincing. Did anything meaningful come of this work? What does the best work in this vein look like?

29.11.2025 16:36 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I highly recommend "do artifacts have politics?"

faculty.cc.gatech.edu/~beki/cs4001...

28.11.2025 17:26 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Put this person in jail.

Put the person who drafted it in jail.

Put the higher ups who covered it up in jail.

This is a crime. If Kilmar had done the same they wouldn't hesitate to punish him. ICE is a criminal organization, not a law enforcement organization, and justice requires accountability.

20.11.2025 18:44 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Hyped to write "The models in this paper cost us 476,246.57 USD to train. I'm sorry you are sad we didn't redo all of our experiments on multiple independent training datasets. If you'd like to give us a million dollars we'd be happy to run the experiments you wish" in my response to a reviewer.

14.11.2025 23:07 β€” πŸ‘ 41    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

🚨 AI keeps scaling, but social impact evaluations aren’t–and the data proves it 🚨

Our new paper, πŸ“Žβ€œWho Evaluates AI’s Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations,” analyzes hundreds of evaluation reports and reveals major blind spots β€ΌοΈπŸ§΅ (1/7)

13.11.2025 13:59 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

Our #NeurIPS2025 paper shows that even comparable monolingual tokenizers have different compression rates across languages. But by getting rid of whitespace tokenization and using a custom vocab size for each language, we can reduce token premiums. Preprint out now!

28.10.2025 15:11 β€” πŸ‘ 34    πŸ” 5    πŸ’¬ 1    πŸ“Œ 2
Preview
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mi...

Small caveat: I misunderstood arXiv's ToS when I wrote this paper. While a large portion of arXiv has an open license, the majority (last time I checked) does not. That shouldn't have a check under "author."

PG-19 lacks one because of how radically technology has changed.

arxiv.org/abs/2101.00027

20.10.2025 06:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

In the original Pile paper we talked about various conceptions of consent (though I don't stand by everything I wrote about this topic 5 years ago). None of this data has EIC, though I think that the ones marked "author" in the table are ones where authorial objection would be unreasonable.

20.10.2025 06:55 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Adding to what @mmitchell.bsky.social said, EIC cannot be use-agnostic by definition. It must be explicit to the use in question. If you put a notice that says "everyone can use this for every purpose" that's *not* EIC.

20.10.2025 06:55 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0