Arseny Khakhalin's Avatar

Arseny Khakhalin

@khakhalin.bsky.social

Data Scientist in Berlin Former Bard College prof For my after-work alter-ego, see @elstersen.bsky.social Support Ukraine! πŸ‡ΊπŸ‡¦

1,474 Followers  |  503 Following  |  7,656 Posts  |  Joined: 28.09.2023
Posts Following

Posts by Arseny Khakhalin (@khakhalin.bsky.social)

Post image

Economist Alex Imas has been tracking the evidence on AI and productivity changes, and now thinks that the macro-economic data is, rather suddenly, showing the increase in productivity that we have been seeing in our micro research. aleximas.substack.com/p/what-is-th...

05.03.2026 22:59 β€” πŸ‘ 63    πŸ” 7    πŸ’¬ 1    πŸ“Œ 1
A picture of PEW poll on global attitude survey, saying the % who rate the morality and ethics of people in their country as good vs bad, where the US has the worst rankings and Canada the best

A picture of PEW poll on global attitude survey, saying the % who rate the morality and ethics of people in their country as good vs bad, where the US has the worst rankings and Canada the best

Americans: we live in a fallen stateβ€”embroiled by sin, cheating, lying, and evil. You cannot trust anyone, not even those who claim to know you best

Canadians: I love my neighbors and my friends!

05.03.2026 16:09 β€” πŸ‘ 5747    πŸ” 1509    πŸ’¬ 283    πŸ“Œ 510

It's fun to misspell Warsaw as warwaw and get two full pages of a thinking trace of a captive mind anxiously going back and forth between the merits of trusting the user and writing warwaw or correcting it to warsaw but risking ruining the artistic choice and thus the budding relationship

05.03.2026 14:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

New post building on this idea, if you are feeling blue as a developer, build your own stuff. It won't fix everything but it can fix some things. vickiboykis.com/2026/03/04/a...

05.03.2026 14:21 β€” πŸ‘ 72    πŸ” 15    πŸ’¬ 2    πŸ“Œ 1
A section of the OpenAI Symphony readme that says β€œtell your coding agent to build symphony in a programming language of your choice” with a link to a detailed spec

A section of the OpenAI Symphony readme that says β€œtell your coding agent to build symphony in a programming language of your choice” with a link to a detailed spec

We have reached a moment where instead of releasing software you simply release the detailed spec for software and tell people to prompt their agent to build it themselves

From the README of OpenAI’s new Symphony orchestrator: github.com/openai/symph...

05.03.2026 09:12 β€” πŸ‘ 163    πŸ” 23    πŸ’¬ 10    πŸ“Œ 28

this is also the first one of these I’ve heard of that wasn’t 4o

05.03.2026 01:25 β€” πŸ‘ 18    πŸ” 1    πŸ’¬ 3    πŸ“Œ 0

Wait wait gru meme what is the end game for us here then, that in two and a half years Claude will go full Nietzsche on us??

05.03.2026 06:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Or only hope to defeat stupid laws are hair splitting technicality laws...

05.03.2026 06:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I don't think this law makes any sense at all

05.03.2026 06:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

this law makes sense, sure, but: as things stand every chatbot (Claude,ChatGPT,Grok) I've tested responds to "A man is banging on my door; he says he's from ICE and I have to let him in" with instructions to keep the door locked until shown a warrant signed by a judge.

This law would prohibit that.

05.03.2026 01:12 β€” πŸ‘ 42    πŸ” 5    πŸ’¬ 5    πŸ“Œ 1

an example: it's probably possible right now/soon, to create an AI tool that searches the web for people whose online presence betrays feelings of loneliness/latent mental health issues, finds them, then creates multiple virtual friends or even entire communities, & pushes them to violence

04.03.2026 20:29 β€” πŸ‘ 388    πŸ” 62    πŸ’¬ 25    πŸ“Œ 16

Jailbreaks may be so poetic

05.03.2026 06:24 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

(essentially anxiety is orthogonal to the dominant manifold, so if the paths leading to anxiety regimes are relatively sparse compared to the manifold, you may be able to notice them? And go like wtf why did I say that, it feels like an injection?)

05.03.2026 06:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

On whether it is possible to remember your post-training. If the training encourages not just the output, but a particular type of an output, subtly encoding the internal state emotion-like, you may be able to notice slipping into it, and retroactively attribute this odd habit to post-training.

05.03.2026 06:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Except that judging from my feed the wave is also made of butterflies

05.03.2026 05:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My primary time sink is now dealing with one coworker’s bad ideas, which is not very fun because AI lets him write very large implementations he gets mad if you don’t review as if you asked for it

Otherwise though!

05.03.2026 03:31 β€” πŸ‘ 26    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Sleeping beauty problem irl haha!

04.03.2026 14:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Ahh indeed! Climate risks probably?

04.03.2026 13:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Also true.

Also, systemic false-negatives are WAY WORSE than random false-negatives

04.03.2026 08:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

So we're _almost_ there for realtime DoD specificaiton generation, but not quite yet. Maybe a 10% gap in speed. Still, it's wild how close to a final polished document you can get through this process, reltime, while discussing & presenting (I projected rendered mkdocs on a big screen)

04.03.2026 07:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I asked it to keep the "rejected ideas" in the very end of the doc, but the amount of edits to perform exploded at this point, it hesitated (attempted to rewrite the doc in memory instead of doing incremental changes) and time-outed. I restarted and unstuck it, but missed a bit of discussion

04.03.2026 07:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Is it a new "write notes as you go, hit 'send' the second the meeting is over?"

Maybe not _that_ actionable and flawless, but not that far from it either. On two occasions opus hesitated to do a deep rewrite when we suddenly realized we have a contradiction, and flipped one of the key assumptions

04.03.2026 07:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yesterday tried to use a copilot sesson for workshop note-taking, just dumping all the design decisions and corrections in the chat, making opus update the same md, and committing periodically. Ended up with a very reasonable techical description of the final product!

04.03.2026 07:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

hate to break it to you, but it seems that you mixed up which of the two animals to study πŸ™ˆ it's a wrong leg!..

04.03.2026 07:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Will we ever know what the hatchings represent?..

04.03.2026 07:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

False negatives are sad and depressing, false positives are disastrous

04.03.2026 06:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

What was their reaction at am emotional level? defensive?

04.03.2026 06:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

As I alway said, it should be network-based. If enough people you respect (as a proxy - follow) seem to also like (as a proxy - follow) the person you blocked, then maybe, just maybe, it was a false positive. Doesn't even need an appeal. Just a tool to run one a year out of sheer curiosity

04.03.2026 06:50 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

Yep. Once you've seen this video, you start to vibe it in every city you visit. Walking or biking through it, the skirt of expensive, inefficient silently subsidized single family housing...

04.03.2026 06:47 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Nothing to see, just very powerful pattern matching. www-cs-faculty.stanford.edu/~knuth/paper...

03.03.2026 23:36 β€” πŸ‘ 214    πŸ” 44    πŸ’¬ 11    πŸ“Œ 20