congrats!!!
09.09.2025 22:48 β π 1 π 0 π¬ 0 π 0congrats!!!
09.09.2025 22:48 β π 1 π 0 π¬ 0 π 0so close! that's standard error β€οΈ
06.08.2025 17:12 β π 3 π 0 π¬ 0 π 0i'm still pissed about this like the difference is literally too small to have been distinguishable with swe bench (500 samples) lmaoooo
06.08.2025 03:54 β π 8 π 0 π¬ 0 π 0hey wasn't this the same company that made a beautiful shiny "research" post about how AI evals should include error bars or something like that. or did they decide the CLT didn't apply here
06.08.2025 03:20 β π 39 π 3 π¬ 5 π 1I will be at ICML in a few weeks & would love to chat about how to make this real - I am a critic at heart and also hate self-promo so thatβs how you know I really believe in this π₯²
01.07.2025 23:39 β π 1 π 0 π¬ 0 π 0
various ways to read more π
blog post- argmin.net/p/individual...
position paper- arxiv.org/abs/2506.18133
fairness-oriented instantiation- arxiv.org/abs/2502.08166
& many thanks to brilliant collaborators
@rajiinio.bsky.social @irenetrampoline.bsky.social @beenwrekt.bsky.social & paula gradu !!
lots of other stuff I wonβt get into rn (e.g., I think this is a prereq to any serious attempt at βdemocraticβ AI!), and thereβs also a ton of open research questions (stats, econ/ml, empirical methods, hci, β¦)
01.07.2025 23:38 β π 1 π 0 π¬ 1 π 0the core concept is individual reporting as a means to build collective knowledge. if one person has a bad experience, that doesnβt necessarily mean that thereβs something wrong with the system β but if lots of people start reporting similar things, maybe we should pay attention.
01.07.2025 23:38 β π 2 π 0 π¬ 1 π 0weβve already seen this informally with the chatgpt sycophancy debacle β a few days of twitter virality resulted in action and statements from openai β but what other, subtler, patterns are happening? what could we discover if we had better ways to listen to the public?
01.07.2025 23:38 β π 2 π 1 π¬ 1 π 0
individual reporting for post-deployment evals β a little manifesto (& new preprints!)
tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.
@jessica.bsky.social on individual reporting as a means to build collective knowledge.
24.06.2025 14:46 β π 8 π 2 π¬ 1 π 0
right but one would hope that the date of doom _does_ get further away as safety research improves
bsky.app/profile/jess...
help ..
02.05.2025 19:55 β π 3 π 0 π¬ 0 π 0where are the bullshit "x% of experts believe" polls when you need them lol
24.04.2025 17:31 β π 6 π 0 π¬ 0 π 0well probably, but i wanna know how folks who do believe in that happening think about the field
19.04.2025 02:09 β π 0 π 0 π¬ 1 π 0or is it a secret third thing idk. scared to ask this on Real Twitter but genuinely curious how people think about the role of this field
18.04.2025 05:45 β π 0 π 0 π¬ 0 π 0like is it that the field has been ineffective (studied the wrong problems, advocated for the wrong positions, etc) or is it that every step of safety progress has been matched by 2 steps of capabilities progress (in which case, what are the best examples of safety work concretely reducing harm?)
18.04.2025 05:44 β π 3 π 0 π¬ 1 π 1perhaps this is a stupid question but given that ai safety has been a pretty vibrant (+ well funded) field for the last 5-10 years... how should we be thinking about the concern that (ai) catastrophe still is, allegedly, imminent
18.04.2025 05:42 β π 6 π 0 π¬ 2 π 0
in middle school we were asked to write a short story in the style of edgar allan poe. as you might expect, all of our little pieces (even, especially, the ones the students thought were "good") were hilariously bad. anyway, i had forgotten about that homework until now
x.com/sama/status/...
back on bluesky to be mean about ai discourse
10.02.2025 18:04 β π 4 π 0 π¬ 0 π 0im ngl i think this kinda just means u are stupid
10.02.2025 18:03 β π 3 π 0 π¬ 1 π 0
happy new year
letterstomyfriends.substack.com/p/how-to-hav...
i don't work well under deadline pressure but i also don't work well without it. therefore,
18.12.2024 01:01 β π 4 π 0 π¬ 0 π 0... didn't we just talk about this ...
16.12.2024 23:46 β π 1 π 0 π¬ 0 π 0ill read it
14.12.2024 18:23 β π 1 π 0 π¬ 0 π 0The plan: Post your dissertation abstract online to rekindle a decades-long controversy about the utility of the humanities, turning your paper into the most-read publication in the history of your field
03.12.2024 23:28 β π 10 π 1 π¬ 1 π 0were you born yesterday
02.12.2024 00:50 β π 3 π 0 π¬ 0 π 0wait is that your house lmaooo
01.12.2024 07:25 β π 0 π 0 π¬ 1 π 0where
01.12.2024 07:25 β π 0 π 0 π¬ 1 π 0i just know these people would have been the biggest fans of japanese internment
01.12.2024 02:23 β π 9 π 0 π¬ 3 π 0