Stella Biderman (@stellaathena) — Bluesky Profile

1 week ago

I'm not sure if I'm more called out by this skeet or the fact that I've had two kidney stones already tbh...

1 0 0 0

1 month ago

Examples of mislabeled web text by existing LangID systems. A full text version is available on the blog post below.

Language identification still proves to be a challenging task, especially for web data. In collaboration with @mlcommons.org @eleutherai.bsky.social @jhu.edu and 97 community members, we created CommonLID, a new benchmark for LangID for 100+ languages!

11 5 1 0

1 month ago

Has anyone else had Claude code become non-functional recently? Even with a test input it spins for minutes without doing anything. Same thing happens in terminal.

4 0 4 0

1 month ago

It's going to get worse because people hate AI

11 0 3 0

1 month ago

The only reasons I use social media platforms is to get eyeballs on research and to yell at people who are wrong online.

X > Bluesky at both for me

0 0 0 0

2 months ago

And the next talk (exact details TBA) by @pjox.bsky.social and @very-laurie.bsky.social from Common Crawl on work we've been collaborating on to build better benchmarking of LangID systems and understand the issues with the long tail of human language that comes up at Common Crawl scales.

5 1 0 0

2 months ago

We’re bringing back a Community Spotlight talk series, highlighting cool work being done by members of our community. We’re kicking it off with a talk on running diffusion-based world-models in real time on consumer hardware.

Jan 9th at 2 pm US Eastern Time

11 4 1 0

2 months ago

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards Filtering pretraining data prevents dangerous capabilities, doesn’t sacrifice general performance, and results in models that are resistant to tampering.

What are people's favorite paper / project websites? I'm looking to build a library to base future ones I make off of.

The one EleutherAI has done that I'm proudest of is deepignorance.ai

5 0 0 0

2 months ago

Imagine if they had done their job and put the pedophile insurrectionists in jail for the rest of their lives.

1 0 0 0

2 months ago

I'm late to seeing this post, but this is a super cool paper! Thanks for sharing. Do you have more work in this vein in the works?

0 0 1 0

2 months ago

60 Minutes Inside CECOT : Free Download, Borrow, and Streaming : Internet Archive Full video of the 60 Minutes Inside CECOT episode that CBS pulled.

If you're looking for the 60 minutes piece on CECOT that Bari Weiss canned to protect the President, you can find it here: archive.org/details/60-m...

13 4 0 0

2 months ago

Ukraine is not the only country Sebastian has created non-profits to support. The @datarescueproject.org has spent the past year coordinating and managing volunteer communities in the US working to back up scientific, cultural, and historical data that the USG wants to suppress.

2 0 0 0

2 months ago

They were honored alongside the Ukrainian Library Association whose on-the-ground work in Ukraine to promote and preserve Ukrainian language and culture has been all the more vital as Ukraine fights for its existence.

2 0 1 0

2 months ago

Incredibly proud of my friend and colleague @storytracer.com. Two weeks ago he and his cofounders @sucho-org.bsky.social were honored for organizing a global network of volunteers to exfiltrate and back up endangered Ukrainian cultural heritage in the wake of the invasion by Russia.

8 0 1 0

2 months ago

Really great to see NVIDIA staking out a pro-open data position. This used to be common, if not the norm in AI, and the backing away from this level of transparency has done a lot of harm to the research community.

48 4 1 1

2 months ago

Ah, that would explain it.

1 0 0 0

2 months ago

"I wrote: "Your call for a Palestinian state pours fuel on the antisemitic fire. It rewards Hamas terrorists. It emboldens those who menace Australian Jews and encourages the Jew hatred now stalking your streets.""

"The attack at a Hanukkah celebration in Sydney today was a vile act of antisemitic terror. I mourn those who were murdered and will be keeping their families, the Jewish community, and the Chabad movement in my prayers. May the memories of all those killed be a blessing.

While we are still waiting for all the facts to emerge, what we already know is devastating. At least 11 dead, including Rabbi Eli Schlanger, who held deep ties to Crown Heights. At least 29 injured. Another Jewish community plunged into mourning and loss, a holiday of light so painfully reduced to a day of darkness. This attack is merely the latest, most horrifying iteration in a growing pattern of violence targeted at Jewish people across the world. Too many no longer feel safe to be themselves, to express their faith publicly, to worship in their synagogues without armed security stationed outside. What happened at Bondi is what many Jewish people fear will happen in their communities too.

On Bondi Beach today, as men with long guns targeted innocents, another man ran towards the gunfire and disarmed a shooter. Tonight, as Jewish New Yorkers light menorahs and usher in a first night of Hanukkah clouded by grief, let us look to his example and confront hatred with the urgency and action it demands. When I am Mayor, I will work every day to keep Jewish New Yorkers safe—on our streets, our subways, at shul, in every moment of every day. Let this be a purpose shared by every New Yorker, and let us banish this horrific violence to the past."

Compare the statement about the antisemitic terror attack in Australia by the Israeli Prime Minister with the one by Zohran Mamdani, and ask yourself who more truly cares about condemning antisemitism, as opposed to using it to promote unrelated politics.

1,663 495 19 18

2 months ago

I didn't... is it because I'm at a non-profit research institute but not a university? 🤔

1 0 1 0

2 months ago

"The incentives made me do it" is an excuse, not a justification. You can be better than that, and if you're not it's because you choose to not be.

7 0 0 0

3 months ago

I saw a pamphlet recently from WotC to game store employees about how to talk to parents who thought Magic: the Gathering was Satanic and whose kids were into it. Someone had found it in a box and framed it.

4 0 0 0

3 months ago

It's a wrap on EvalEval in San Diego! A jam packed day of learning, making new friends, critically examining the field of evals, and walking away with renewed energy and new collaborations!

We have a lot of announcements coming, but first: EvalEval will be back for #ACL2026!

5 1 1 0

3 months ago

In 2023-ish it was trendy to write papers trying to explain why scaling laws had power law structures. The papers I remember were pretty unconvincing. Did anything meaningful come of this work? What does the best work in this vein look like?

7 1 1 0

3 months ago

I highly recommend "do artifacts have politics?"

faculty.cc.gatech.edu/~beki/cs4001...

5 0 0 0

3 months ago

Put this person in jail.

Put the person who drafted it in jail.

Put the higher ups who covered it up in jail.

This is a crime. If Kilmar had done the same they wouldn't hesitate to punish him. ICE is a criminal organization, not a law enforcement organization, and justice requires accountability.

10 1 0 0

3 months ago

Hyped to write "The models in this paper cost us 476,246.57 USD to train. I'm sorry you are sad we didn't redo all of our experiments on multiple independent training datasets. If you'd like to give us a million dollars we'd be happy to run the experiments you wish" in my response to a reviewer.

41 5 1 1

4 months ago

🚨 AI keeps scaling, but social impact evaluations aren’t–and the data proves it 🚨

Our new paper, 📎“Who Evaluates AI’s Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations,” analyzes hundreds of evaluation reports and reveals major blind spots ‼️🧵 (1/7)

11 3 1 0

4 months ago

Our #NeurIPS2025 paper shows that even comparable monolingual tokenizers have different compression rates across languages. But by getting rid of whitespace tokenization and using a custom vocab size for each language, we can reduce token premiums. Preprint out now!

34 5 1 2

4 months ago

The Pile: An 800GB Dataset of Diverse Text for Language Modeling Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mi...

Small caveat: I misunderstood arXiv's ToS when I wrote this paper. While a large portion of arXiv has an open license, the majority (last time I checked) does not. That shouldn't have a check under "author."

PG-19 lacks one because of how radically technology has changed.

arxiv.org/abs/2101.00027

2 0 0 0

4 months ago

In the original Pile paper we talked about various conceptions of consent (though I don't stand by everything I wrote about this topic 5 years ago). None of this data has EIC, though I think that the ones marked "author" in the table are ones where authorial objection would be unreasonable.

3 0 1 0

4 months ago

Adding to what @mmitchell.bsky.social said, EIC cannot be use-agnostic by definition. It must be explicit to the use in question. If you put a notice that says "everyone can use this for every purpose" that's *not* EIC.

4 0 1 0