It seems like the trouble is that the results of some other specialistโs problems might not generalize to *your* problems, and the only way to be sure is to do your own evals. Or maybe people will start publishing increasingly specialized industry benchmarks?
07.08.2025 22:48 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Or accordions.
06.08.2025 17:51 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The post Jim was replying to is gone, but this seems like good general advice. I read a lot of good stuff on Substack too.
Social networks seem quite useful for finding (and posting) *links* to more substantial articles and discussions, though?
04.08.2025 21:06 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Yeah, but they say you can feed it whatever you want, so I wonder what it does if you feed it a character's dialog? Is there a Sherlock Holmes persona vector?
03.08.2025 13:53 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Persona vectors: Monitoring and controlling character traits in language models
A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior
What happens when you can give it any personality you want? Will some personalities seem more real? Or does being able to manipulate them make them seem less real?
If the AI seems upset then you could cheer it up by giving it an optimistic, bubbly personality.
www.anthropic.com/research/per...
02.08.2025 16:11 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Could also compare with the other anti-immigrant campaigns you wrote about too.
But maybe none of the above? Historical analogies suggest possibilities but they aren't a crystal ball.
01.08.2025 20:48 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
This is great work!
I'm wondering about the persistence API. Suppose you asked Spark to write code that checks the current user against a list of authorized users before writing to the KV store. Is there another way to write to the KV store that bypasses that?
24.07.2025 18:19 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
What does "communal" mean to you? The relationship between a musician and their audience can certainly be commercial. There are transactions. But it seems like it's a community too?
19.07.2025 02:52 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
For me it was employers paying me too much money for some reason, not that I minded. And collectively, this sort of thing drove up the cost of housing for the entire region.
16.07.2025 17:34 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
The people I know who are well off didn't do it that way, though?
14.07.2025 20:40 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
They carelessly used AI because the people recruited to do the work believed in it themselves and didn't mind being sloppy. "Move fast and break things." Being willing to break things is part of Trump's brand, as is being unaccountable.
It undermines trust in government by making it untrustworthy.
12.07.2025 17:42 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Sure, it's a minor perk that's not worth it for a lot of people.
12.07.2025 01:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
TSA precheck costs $78 for 5 years, about $16 a year. For someone who travels at least once a year, this is insignificant compared to the cost of the flight and related expenses. You will pay more for checked luggage, or for a sandwich.
Hardly an โelite statusโ thing. More like โplanned ahead.โ
11.07.2025 16:07 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
I'm wondering there's a gap in public communication about flash flood risks? Do we need something like a "red flag warning" for fires, but for flooding?
If it were an airplane crash investigation, there would be discussion about what to fix.
06.07.2025 16:43 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
I'm curious: which study is that?
01.07.2025 18:11 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
I tried it. It doesnโt support constraints and the AI stuff didnโt work very well. I went back to OnShape.
01.07.2025 13:06 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I wonder if people would like it better if they sold tickets for a random drawing where the prize is a burrito? (Losers get a bag of chips instead.)
29.06.2025 13:25 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
It seems like this calculation assumes people don't cut back on other spending and buy food instead, since food is more important. (That is, in those cases where they have something else to cut back on.) There would be some reduction but not the full amount.
27.06.2025 21:27 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
*Well, that was ingenious #printbooks #devoured
simonwillison.net/2025/Jun/24/...
24.06.2025 22:25 โ ๐ 9 ๐ 3 ๐ฌ 2 ๐ 1
> It would take Iran just a few days to convert the 60% material into weapons-grade material.
www.theguardian.com/world/2024/n...
24.06.2025 15:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I think the biggest roadblock is widespread skepticism about the federal government, particularly now. I'm more optimistic about California.
24.06.2025 15:25 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
I don't have a need for dictation myself, but the Recorder app on Pixel phones appears to have some new AI features?
android-developers.googleblog.com/2024/08/reco...
24.06.2025 15:20 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Maybe exporting the audio from Recorder would help? When looking at an individual recording, press the share icon, then select File, Audio, and โFiles by Goโฆ Downloadโ as a destination, which will copy the audio file to your downloads folder.
21.06.2025 14:10 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Ok, I wrote a reply disagreeing with a bad take. That's helpful right?
20.06.2025 02:36 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Hmm. Maybe putting "be brief" in my custom instructions isn't doing me any favors here.
18.06.2025 02:12 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
I tried asking about Iran and it was mediocre, but perhaps I'm holding it wrong. How specifically do you prompt for this?
18.06.2025 02:02 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
They might if you also have the source code in the right place. Do npms commonly bundle both sourcemaps and source? (Haven't used npm in a while.)
17.06.2025 18:46 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
This seems something like what LLM benchmarks attempt to do, by defining some "correct" responses in certain contexts. But there are many possible benchmarks.
17.06.2025 00:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Maybe for now, but the approach Deno took with jsr.io seems pretty promising? I dislike ending up in type-stripped, minimized JS in the debugger.
16.06.2025 17:34 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Ophthalmologist. Comedian. Speaker. Jonathan.
Philosophy professor, writing about games, trust, echo chambers, community, bureaucracy and technology. My first book is GAMES: AGENCY AS ART
Winner of the 2025 Thurber Prize for American Humor, Cartoonist and Author https://www.paulnoth.com https://substack.com/@paulnoth?r=7j6we&utm_medium=ios
I do computers. Married to @radkat.fitzpat.com. Three kids. Xoogler.
Go (#golang) team 2010~2020. Made LiveJournal, OpenID, memcached. Currently at @Tailscale.com making WireGuard easy.
Seattle, WA // Bainbridge
CUE, Go, fiddler, climber, gardener, curry maker.
Cybersecurity Reporter, Ars Technica: https://arstechnica.com/author/dan-goodin/ Hungry for tips. Text me on Signal: DanArs.82. "The world isnโt run by weapons anymore, or energy, or money. Itโs run by little 1s and 0s, little bits of data."
Senior Fellow, Carnegie Endowment. Defense analysis with a focus on the Russian and Ukrainian militaries.
Substack: http://lcamtuf.substack.com/archive
Homepage: http://lcamtuf.coredump.cx
Novelist (SF, fantasy), historian (UChicago Renaissance Europe, intellectual history, Italy, classical reception), composer (filk, Norse myth), disability (chronic pain), manga/anime (Tezuka), food, cool history pics (#SomethingBeautiful) Blog exurbe.com
Official Bluesky account for NOAA's National Weather Service.
security & privacy geek by day, decorative artist by night.
Not interested in cryptocurrency or gofundme scams, sorry.
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Searching for the numinous
Australian Canadian, currently living in the US
https://michaelnotebook.com
๐ Creator of Zod, tRPC (v0)
๐ฎ OSS Fellow @ Clerk
๐ฆ Friendly neighborhood TypeScript nerd
๐ง๐ผโ๐ป Prev @ Bun, EdgeDB, YC, MIT
RC F'13, F2'17
Cryptogopher / Go cryptography maintainer
Professional open source maintainer
https://filippo.io / https://github.com/FiloSottile
https://mkcert.dev / https://age-encryption.org
https://sunlight.dev / https://filippo.io/newsletter
Reproducible bugs are candies ๐ญ๐ฌ
All things space and nerd;
Astrophotographer in The Big Sky State of Montana!
Buy prints and help me buy more gear!
https://fyfeastro.pixieset.com/fyfeastro-gallery
Astrobin: https://www.astrobin.com/users/Fyferoni/