Kabir Kumar, aiplans.org's Avatar

Kabir Kumar, aiplans.org

@kabirkumar.bsky.social

I run AI Plans, an AI Safety lab focused on solving AI Alignment before 2029. For several weeks I used a stone for a pillow. I once spent a quarter of my paycheck on cheese. Ping me! DMs not working atm due to totalitarian UK law :( SurpassAI

1,018 Followers  |  1,420 Following  |  4,389 Posts  |  Joined: 25.12.2023  |  2.1626

Latest posts by kabirkumar.bsky.social on Bluesky

is there any candidate who's running on a platform to regulate recommendation algorithms?

12.10.2025 04:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

they will take everything before they kill us and then take more still

12.10.2025 03:37 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
The Company Man To get to the campus, I have to walk past the fentanyl zombies.

this is one of the most specifc and funny things I've read in a while: tomasbjartur.substack.com/p/the-compan...

12.10.2025 03:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

ahhh. that makes sense, i think. i think im still confused as to why the bill change did that, when it just allowed mining when it wasnt allowed before

12.10.2025 02:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'm confused. I thought you were saying that some americans tried removing the law so they could mine the metals needed, but dod stopped them because that would somehow give away ip, right?

11.10.2025 22:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

>
1) cuz they were doing a huge legal oopsie by handing over military tech IP to the Chinese because there was no other way of making advanced sensors etc

at this point though, surely china is ahead??

11.10.2025 22:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

though more religions

11.10.2025 22:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

btw, this is what I feel like with alignment and incentive structures

11.10.2025 22:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

What the fuck. Why did the DoD do that?? And why has no other country tried to jump on it if they've seen how much of an advantage China gained from it??

11.10.2025 22:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

guess how this gets funnier, without looking it up

11.10.2025 22:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Dwarkesh feels like it's gotten watered down imo

11.10.2025 03:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Laudatory definition. Expressing praise and commendation

Laudatory definition. Expressing praise and commendation

For those of less erudite persuasion, such as myself:

10.10.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Shared it with a bright young teammate!

10.10.2025 13:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

At best, this is addiction, not abuse. But really, I think the ai might just be understanding that the most likely thing to do when given the chance to gamble a lot, is to gamble a lot and when given the chance to gamble more, gamble more.

10.10.2025 13:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Evals Guide Some definitions or descriptions will be imprecise. Part of the problem to solve is finding words which more accurately describe what we're looking for. Feedback, comments and questions are very welco...

docs.google.com/document/d/1...
which has eval red teaming components

10.10.2025 01:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

actually hosting an alignment evals hackathons, where teams are red teaming alignment evals, for this.
also, very cool to say, that someone has made a paper inspired by our previous red teaming approach in hackathons: anloehr.github.io/ai-projects/...

and making a course (very early stage):

10.10.2025 01:53 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Us:
www.lesswrong.com/posts/35vPhn...

10.10.2025 01:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

it's obv a smaller model??

10.10.2025 01:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

all of these evals themselves need to actually be thoroughly tested

10.10.2025 01:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

gm

10.10.2025 01:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Congratulations Dr. Edstedt!!

10.10.2025 01:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I want them to sound more like robots tbh

10.10.2025 01:44 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

what's the sexual connotation???

10.10.2025 01:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

nooki.me/c/AISafety

10.10.2025 01:42 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

affiliating, sure. to actually do stuff and find people to work with though, who are really strongly motivated to actually change the world and willing to say when they're wrong, there's not many better atm.

09.10.2025 23:35 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

what about potato waffles tho??

09.10.2025 11:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

gonna be using 'professor-ass web app' in my lingo from now on

09.10.2025 11:52 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I need to use bsky more because I'm still amazed to see this not erupt into name calling and talking past each other

09.10.2025 11:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

this is great, thank you!!

09.10.2025 11:45 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

really? i don't think it has, tbh

09.10.2025 11:42 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@kabirkumar is following 20 prominent accounts