I believe that's true in practice, but an interesting exercise to prove it might be to temporarily delete one source file, have a coding agent regenerate it so it compiles and passes the tests, and then do a diff. How much knowledge is lost? Could that be captured by the tests, too?
10.02.2026 17:02 β π 0 π 0 π¬ 0 π 0
Just as a data point, I never heard of him before and donβt know these things. I browsed his Bluesky account and read his Wikipedia article and Iβm not entirely sure what he stands for.
09.02.2026 14:08 β π 0 π 0 π¬ 0 π 0
Yeah but that means the search doesn't help me at all. The point of searching within the page is to scroll down to the post I want to look at.
09.02.2026 03:17 β π 0 π 0 π¬ 0 π 0
Bluesky "For You" feed playground
The thing I was trying to figure out is why "For You" keeps recommending very ordinary posts by Kelsey Hightower to me, and I think it might just be because he has a lot of followers? Maybe posts by people with lots of followers need to be downranked more to make up for wide exposure? Example:
09.02.2026 03:14 β π 0 π 0 π¬ 1 π 0
Also, everywhere there's a DID, put the username instead (or as well). Example: 'top curators who liked it'.
It's good to be able to get a DID, but it shouldn't be the default.
09.02.2026 03:05 β π 0 π 0 π¬ 1 π 0
Also, searching within the page doesn't really work for some reason.
09.02.2026 02:58 β π 0 π 0 π¬ 2 π 0
A simple change would be to make it clearer that you can just paste in your account name and don't need to know your 'did' (which is the first example).
09.02.2026 02:48 β π 0 π 0 π¬ 1 π 0
My project started out that way, with the bot one-shotting a simple, generic link-sharing website. But since then, we've made over 900 commits as we built it out. These are all pretty routine changes that add up to something larger - a more elaborate link-sharing website.
09.02.2026 01:39 β π 1 π 0 π¬ 1 π 0
It makes sense looking at the code it changed. Also, changing the code fixed it. It's pretty much the same thing as reviewing a bugfix made by a human.
I have standing instructions to write a failing test before changing the code. If it didn't write a new test then I tell it to write one.
09.02.2026 01:24 β π 0 π 0 π¬ 0 π 0
My coding agent can investigate a bug and fix it, and explain what it did so *I* understand what the problem was.
I donβt see why I should care whether it βreallyβ understood it or not. If itβs true that it didnβt βreallyβ understand, then apparently it didnβt need to.
08.02.2026 19:08 β π 1 π 0 π¬ 2 π 0
Thatβs certainly a thing that happened, particularly in the early days. Generating AI images is like that. But I havenβt seen it since I started using a coding agent in December. There are bugs but I haveβt had to undo anything. Itβs great at investigating and fixing bugs.
08.02.2026 18:59 β π 0 π 0 π¬ 0 π 0
That study was done almost a year ago and the first model I tried that I thought was good didnβt come out until December. Meanwhile the tools have improved. You canβt expect studies like that to settle questions for all time in a rapidly moving field.
08.02.2026 17:05 β π 0 π 0 π¬ 1 π 0
I've spent a fair amount of time working with a coding agent recently, and this doesn't seem to be a problem in practice. Sometimes it makes mistakes, but then it gets feedback and recovers well.
Being a bit out of date is not that bad when it can read up-to-date documentation and do experiments.
08.02.2026 06:28 β π 0 π 0 π¬ 1 π 0
I guess it depends how you do it. Substack is apparently free, but maybe they wouldn't scale high enough for some newspapers?
08.02.2026 02:20 β π 0 π 0 π¬ 0 π 0
I think the historical analogy works by coincidence, because mailing costs are no longer an issue. Distribution is nearly free for a website. There are different reasons why journalism needs subsidies. (I'd guess mostly labor?)
08.02.2026 02:08 β π 0 π 0 π¬ 1 π 0
We have numbered design docs with a status field at top: "draft", "in progress", "completed". Completed docs go to a completed subdirectory. We don't look at completed docs much.
07.02.2026 01:48 β π 0 π 0 π¬ 0 π 0
We can easily count conversations or Bluesky accounts with a database query. When we count agents, what are we counting?
07.02.2026 01:21 β π 0 π 0 π¬ 0 π 0
LLMs are kind of like sails in that left free flowing they're completely useless but tightly bound and directed they can dramatically accelerate your progress
05.02.2026 20:52 β π 44 π 4 π¬ 3 π 0
Maybe someone will take this as a challenge?
06.02.2026 00:26 β π 0 π 0 π¬ 0 π 0
Depends what you mean by βlimited.β Itβs will pick strings it wasnβt trained on. Randomly if temperature>0. The set of possible (though unlikely) strings is much larger than its training set.
The same is true of a random string generator. Very low probabilities arenβt practically possible, butβ¦
04.02.2026 17:44 β π 1 π 0 π¬ 1 π 0
The actual Turing test is a party game like Werewolf. As with all games it depends on the skills of the players; testing a chess bot against random undergrads wouldn't be the same as testing it against grandmasters. Who is testing LLM's using the actual Turing test against skilled players?
04.02.2026 06:31 β π 0 π 0 π¬ 1 π 0
Wrote blog.exe.dev/expensively-... to dig into how cache reads costs dominate LLM agent conversations. Several visualizations and one terrible pun included!
03.02.2026 17:28 β π 31 π 4 π¬ 0 π 1
There's a sense in which every possible string already exists in Borge's Library of Babel. When you write, you're picking a string that already exists.
I can't think of any practical implications, though. It seems like the normal way we use "exists," for things we actually wrote, is better?
04.02.2026 05:58 β π 0 π 0 π¬ 1 π 0
I see it more like a dog personality. Claude Opus 4.5 makes sure you know that itβs into whatever coding task youβre doing today. Iβve tried other models that give the impression that theyβre silently judging me.
I donβt really expect a smart ghost dog to bite its owner if theyβre being bad.
03.02.2026 16:17 β π 0 π 0 π¬ 0 π 0
I prefer "ghosts" because they're immaterial beings that resemble people. Maybe they'll become golems if they can get the robots to work well?
03.02.2026 00:43 β π 1 π 0 π¬ 0 π 1
exe.dev - Persistent VMs via SSH
Start VMs with persistent disks in seconds. The disk persists. You have sudo.
Alternatively check out Shelley from exe.dev, which has a nice web UI for chatting with a coding agent running in a VM. No routing through Telegram needed; you can go to the website directly. Works great for me in Chrome on Android, iPad, and laptop.
03.02.2026 00:22 β π 0 π 0 π¬ 0 π 0
"Refuse small speed improvements especially if they make the code much slower" - oops, that doesn't make logical sense.
I find that asking a coding agent to read over my AGENTS file and offer suggestions will sometimes find stuff like this.
02.02.2026 17:17 β π 0 π 0 π¬ 1 π 0
Archaeologists find a supersized medieval shipwreck in Denmark
The sunken ship reveals that the medieval European economy was growing fast.
In Medieval Europe ships called cogs revolutionised shipping simply by their size. A cog named SvΓ¦lget 2 was recently found off the coast of Copenhagen: itβs 28 metres from bow to stern and, preserved under sand, its rigging is still intact. buff.ly/tqiJIJQ
#ShareGoodNewsToo
02.02.2026 13:50 β π 71 π 23 π¬ 1 π 2
Person: say, i am alive.
Computer: I am alive.
Person: oh my god.
01.02.2026 23:48 β π 22549 π 4965 π¬ 82 π 126
Co-founder at Asana and Good Ventures (a funding partner of Coefficient Giving). Meta delenda est. Strange looper.
A modern runtime for JavaScript and TypeScript
https://deno.com
π¦ new englander
π¦ bsky team
βοΈ seats.aero
head of product @bsky.app
Software developer at Deno. Also working on ts-morph, dprint, and more (https://github.com/dsherret)
π delightful docs & social publishing π
try: leaflet.pub/home
explore: leaflet.pub/discover
lab notes: lab.leaflet.pub/
code: tangled.org/leaflet.pub/leaflet
made by @schlage.town + @cozylittle.house + @awarm.space
Interests in ML and social aspects of tech.
Building For You feed: https://bsky.app/profile/spacecowboy17.bsky.social/feed/for-you
Hobby project: linklonk.com
Software engineer at Google in New York, developing analysis and refactoring tools for Go (#golang). Co-author of "The Go Programming Language" (gopl.io).
ceo exe.dev, tailscale co-founder, programmer
software imagineeringβ’ riichi mahjong π
kidney donor
seattle β https://tarng.com
dreaming of protocol level sovereignty
bsky team π
Solar analyst at BloombergNEF, goose keeper. Author of a book, "Solar Power Finance Without the Jargon". Opinions all my own.
words in places and ways they probably shouldn't be
Ophthalmologist. Comedian. Speaker. Jonathan.
Philosophy professor. My new book is THE SCORE, about true play, the limits of data - and why scoring systems can lead to beautiful games and soul-killing metrics.