๐จ AI keeps scaling, but social impact evaluations arenโtโand the data proves it ๐จ
Our new paper, ๐โWho Evaluates AIโs Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations,โ analyzes hundreds of evaluation reports and reveals major blind spots โผ๏ธ๐งต (1/7)
13.11.2025 13:59 โ ๐ 5 ๐ 2 ๐ฌ 1 ๐ 0
Our #NeurIPS2025 paper shows that even comparable monolingual tokenizers have different compression rates across languages. But by getting rid of whitespace tokenization and using a custom vocab size for each language, we can reduce token premiums. Preprint out now!
28.10.2025 15:11 โ ๐ 30 ๐ 3 ๐ฌ 1 ๐ 0
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mi...
Small caveat: I misunderstood arXiv's ToS when I wrote this paper. While a large portion of arXiv has an open license, the majority (last time I checked) does not. That shouldn't have a check under "author."
PG-19 lacks one because of how radically technology has changed.
arxiv.org/abs/2101.00027
20.10.2025 06:55 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
In the original Pile paper we talked about various conceptions of consent (though I don't stand by everything I wrote about this topic 5 years ago). None of this data has EIC, though I think that the ones marked "author" in the table are ones where authorial objection would be unreasonable.
20.10.2025 06:55 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Adding to what @mmitchell.bsky.social said, EIC cannot be use-agnostic by definition. It must be explicit to the use in question. If you put a notice that says "everyone can use this for every purpose" that's *not* EIC.
20.10.2025 06:55 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0
That was the best I was able to find, I swear I've read others though.
04.10.2025 06:25 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
It doesn't directly address your original question though... maybe I should write a blog post about it.
04.10.2025 06:24 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0
You can't train a LLM like that without multiple revolutionary breakthroughs. It's a common talking point from people who are grifters or clueless, but the technology simply doesn't work like that.
04.10.2025 06:09 โ ๐ 8 ๐ 0 ๐ฌ 1 ๐ 0
I feel like there are several blog posts or papers that put forth a research agenda of "making AI research a scientific field" or "advancing the science of AI" or something like that. I'm trouble finding them, does this ring a bell to anyone / does anyone have links to notable examples?
03.10.2025 16:00 โ ๐ 8 ๐ 0 ๐ฌ 3 ๐ 0
Obviously she assaulted him /s
26.09.2025 13:20 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
No, itโs not The Incentivesโitโs you
Thereโs a narrative I find kind of troubling, but that unfortunately seems to be growing more common in science. The core idea is that the mere existence of perverse incentives is a valid and sufficient reason to knowingly behave in an antisocial way, just as long as one first acknowledges the existence of those perverse incentives. The way this dynamic usually unfolds is that someone points out some fairly serious problem with the way many scientists behaveโsay, our collective propensity to p-hack as if itโs going out of style, or the fact that we insist on submitting our manuscripts to publishers that are actively trying to undermine our interestsโand then someone else will say, โI know, rightโbut what are you going to do, those are the incentives.โ
I want to print it out giant and put it everywhere
18.09.2025 00:05 โ ๐ 61 ๐ 9 ๐ฌ 1 ๐ 1
This is always dire, but especially so when the US is being run by an authoritarian who delights in using state power to go after people he dislikes. The second OpenAI starts asking for your ID, the government will be asking OpenAI for your chats.
18.09.2025 00:40 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
We Need to Think Beyond Police in Mental Health Crises
In March of 2020, Joe Prude called 911 for assistance. His brother, Daniel Prude, was behaving erratically and had just bolted out the back door of Joeโฆ
OpenAI says adult chats deserve confidentiality, then single out teens for surveillance and says that they'll call the cops on people with mental health crises.
This will kill people and not help them get the care they need. It happens all the time
www.vera.org/news/we-need...
18.09.2025 00:40 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0
Building towards age prediction
Learn how OpenAI is building age prediction and parental controls in ChatGPT to create safer, age-appropriate experiences for teens while supporting families with new tools.
The new โteen safetyโ program from OpenAI repeats the same lies that companies and governments have been saying since the internet began. This won't achieve better online safety for kids, but it will suppress individual liberty and promote censorship.
openai.com/index/buildi...
18.09.2025 00:40 โ ๐ 8 ๐ 2 ๐ฌ 1 ๐ 0
the *cato* institute says less than 10% of politically motivated terrorism is caused by leftists. the *cato* institute.
more than two-thirds is from the far-right.
17.09.2025 17:07 โ ๐ 125 ๐ 39 ๐ฌ 3 ๐ 0
There are some papers demonstrating that this improves performance, especially in translation contexts IIRC.
05.09.2025 21:01 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
It's also a pretty notable comment about my friend group that when I wrote this comment I was considering "partner" to be the opposite-gender counterpart of "girlfriend"
02.09.2025 02:49 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
How can an imitative model like an LLM outperform the experts it is trained on? Our new COLM paper outlines three types of transcendence and shows that each one relies on a different aspect of data diversity. arxiv.org/abs/2508.17669
29.08.2025 21:45 โ ๐ 95 ๐ 17 ๐ฌ 3 ๐ 5
How did you learn to present code? Are there resources that you recommend using to help teach people?
27.08.2025 18:02 โ ๐ 5 ๐ 1 ๐ฌ 1 ๐ 0
Good luck! Maybe you'll succeed where people have failed for decades.
26.08.2025 17:21 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Same
(Since I don't know most of the people in this thread, the joke is that I run one of the servers Naomi mentioned. Except it's not a joke.)
26.08.2025 17:08 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
I have so few straight friends that "partner" to me is mostly coded as "bi but in a relationship with someone of the opposite gender"
26.08.2025 17:05 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
You're right that this is an active area of research but I'm unaware of any meaningful successes coming out of it.
26.08.2025 17:01 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Can you name an example of an idea that is well-grounded in biology that has proven successful for neural networks? I don't mean "oh DL was inspired by how non-neuroscientists thinks the brain works," I mean an actual case of making a model work better by making it more brain-like
26.08.2025 17:00 โ ๐ 2 ๐ 0 ๐ฌ 5 ๐ 0
Digging into unpopular positions with no evidence while the rest of the world passes them by is basically my expectation for sufficiently senior folk.
26.08.2025 16:58 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Here are a couple of slides that I presented yesterday at #aitechgov about open-weight model risk management.
17.08.2025 10:39 โ ๐ 2 ๐ 1 ๐ฌ 1 ๐ 0
Age verification laws are sweeping the US, changing the future of online speech
Age verification laws have been passed in at least 24 states. Some say itโs an effort to protect kids, while others say it restricts protected speech.
โYour driver's license contains a ton of somewhat immutable information about youโ like your name, address, DOB, and face, EFFโs Lisa Femia told the @thetennesean.bsky.social. It's not like a credit card number that can be replaced if it's leaked.
14.08.2025 21:16 โ ๐ 149 ๐ 59 ๐ฌ 5 ๐ 6
We are a researcher community developing scientifically grounded research outputs and robust deployment infrastructure for broader impact evaluations.
https://evalevalai.com/
Director, Center for Tech Responsibility@Brown. FAccT OG. AI Bill of Rights coauthor. Former tech advisor to President Biden @WHOSTP. He/him/his. Posts my own.
Data Science PhD from Drexel CCI
๐บ๐ธ๐ณ๏ธโ๐๐ณ๏ธโโง๏ธ๐ฅโ๏ธ๐๐ฉ๐ฝโ๐ป in ๐ฆ๐บ๐จ๐ฆ working on ๐ฏโฌข๐ญ
AI technical gov & risk management research. PhD student @MIT_CSAIL, fmr. UK AISI. I'm on the CS faculty job market! https://stephencasper.com/
Director of data & digital scholarship at an ivy league univ | organizing a bunch of #data rescuers @datarescueproject.org | support #qualitative research & software | PhD in American history | thoughts are mine not my employerโs
Fortune reporter covering AI
Sharon.goldman@fortune.com
studying the minds on our computers | https://kyobrien.io
Public Access to Public Data is a Public Good. We want to ensure our data are not gone forever. Read more about our efforts: https://www.datarescueproject.org/press/
Journalist, looking for next thing. Bylines all over the place.
Book: NOT A SCIENTIST: How politicians mistake, misrepresent, and utterly mangle science (WW Norton).
Signal: @davelevitan.26
www.davelevitan.com
Sign up: www.gravityisgone.com
Knowing things is a solved problem. Getting along is not. Working on AI, media, and inter-group conflict @CHAI_Berkeley. Got here from computational journalism.
Professor a NYU; Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
http://yann.lecun.com
Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU
https://emerge-lab.github.io
https://www.admonymous.co/eugenevinitsky
NLP Researcher at EleutherAI, PhD UC San Diego Linguistics.
Previously PleIAs, Edinburgh University.
Interested in multilingual NLP, tokenizers, open science.
๐Boston. She/her.
https://catherinearnett.github.io/
VP and Distinguished Scientist at Microsoft Research NYC. AI evaluation and measurement, responsible AI, computational social science, machine learning. She/her.
One photo a day since January 2018: https://www.instagram.com/logisticaggression/
AI researcher at XBOW. Security, RE, ML. PGP http://keybase.io/moyix/
busy building stuff. likes: offensive security, LLMs, and dumb memes. prev: research scientist @ OpenAI / CS PhD @ Harvard / cofounded DEF CON AI Village
LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer.
All posts public domain under CC0 1.0.