Do they subjectively feel like they've overcome LLM limitations to you?
It's great that they've been fine-tuned on a bunch of synthetic symbolic problems. Benchmarks go up.
But I still need to clear the context whenever a bad take or mistaken assumption gets in.
09.12.2025 09:54 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
2/6
This is the point I have been making again and again over the years. The global economy is a closed system, and it must balance. This means that domestic imbalances created by countries that control their external accounts must...
08.12.2025 04:30 โ ๐ 10 ๐ 2 ๐ฌ 1 ๐ 0
(At this point I still prefer to use AI for any expensive-ish (say, over three digits) purchase, which tips the scales towards feeding the data monster, i.e. reviews, influencers etc., getting people talking.)
08.12.2025 10:05 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Sure they're mostly aligned, but the misalignment has an important inefficiency: auction bidding directly siphons off buyside margin to sellside + broker - that's money that doesn't stay in the consumer's pocket. So even if I see an ad, I rather re-search with that vendor than click through.
08.12.2025 10:02 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
Search ads are basically a parallel search engine using ad targeting criteria and auctions to determine ranking, instead of an analysis of the user's value function, which is far too expensive. Until AI, that is.
07.12.2025 16:24 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The numbers are somewhat skewed, because the incentives to improve avenues for sellers to buy placement are at odds with finding the best response for the user. If you spend more effort on the former, it looks like people prefer the former.
07.12.2025 16:20 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
Persuasion goes both ways and facts can be cherrypicked, and are, heavily, by biased publications everywhere on the spectrum. Facts and evidence are, alas, no protection from malpersuasion.
07.12.2025 14:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
(If it's not clear, corrupt authoritarian politics are a bad thing, and should not be supported wherever you are on the political spectrum. But the public en masse often doesn't realise what it's buying when those guys sell fantasies.)
07.12.2025 10:57 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
The Trump administration's enemy isn't countries, it's non-authoritarian politics. This is why Europe is a bigger threat than China or Russia.
It's also why Russia attacked Ukraine after Euromaidan. Liberal (as in, not based on authoritarian crony power networks) politics is the threat.
07.12.2025 10:54 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
That doesn't capture the vibe of slop to me. Slop is all texture and nothing more.
Slop is all surface but hollow on the inside. Aesthetic polish, but nothing below the surface. Low meaning. Information content entirely in signifiers but nothing signified. A walking, talking zombie with dead eyes.
06.12.2025 20:56 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
It's the other way around. LLMs are adept at content-free PR speak, so PR needs to adapt to avoid being mistaken for LLM slop. IMO.
This is not that different from the old Cluetrain Manifesto. Authentic voices don't sound like LLMs.
05.12.2025 00:30 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
A high ratio of linguistic polish to meat always smells like LLM to me now. Like, show me details that are falsifiable. High powered verbs, adjectives and adverbs but lacking specifics: who, where, when, what, how, why. Paint me a picture of process that fixes things and improves proactively.
04.12.2025 21:34 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
All action words without specificity. "Comprehensive" steps. Secured & revoked (how many?), blocked (from where?), engaged (who?), performed review (what was found?), additional controls (what controls?).
It reads literally like a sample "best of breed" response. No evidence of engineering.
04.12.2025 21:29 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I take a train every week, between Zurich and Hannover. Between 60 and 110 EUR a leg 1st class depending on how far in advance booked. Food and drink to your seat but it's a mediocre choice. Either way it's far better than driving or flying. Only private flights would compete.
04.12.2025 21:22 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
It happens quite a lot actually. Most Starbuck clones, for quite some time, do not offer black coffee. The best you can get is an Americano. It's the menu that irritates though, not the barista.
02.12.2025 10:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Have you ever gone into a coffee shop, wanting black coffee, and been faced with a menu without that on it? Where the closest thing is a watered down bitter espresso labelled Americano?
Maybe just me.
02.12.2025 10:00 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Akshually, I have, far more than once, been forced to drink Americano when I wanted black filter coffee. And it's a watered down bitter espresso and not very nice!
02.12.2025 09:57 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
But what about paying people to wait on you?
Stuff only goes so far, and it's relative. The hedonic treadmill is real; my new PC in 2025 delights me no more (less probably) than my new PC did in 1995.
Getting people to do your bidding, on the other hand...
29.11.2025 11:46 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
There's a problem with LLM problem solving that I'm seeing pretty consistently now.
LLMs, when they encounter an error, formulate a hypothesis about what's wrong.
Then, rather than validate the hypothesis, they immediately try to implement a solution based on the hypothesis.
Check the hypothesis!
28.11.2025 21:38 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
It generates a backlash in some people because it situates moral culpability not in racists and their racist acts, but in having a skin color. If you have white skin, you are starting from a morally compromised position; i.e. you're a bad person because of your skin, is what it means to some people.
26.11.2025 15:22 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
They can sell them to me? I bought my rtx 6000 pro Blackwell for a tidy sum but I'm open to buying more...
25.11.2025 07:51 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Is this news? TPUs have been around for ages.
25.11.2025 07:33 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
I don't think he's measuring anything to do with popularity. I think Polanski is a nutjob economically, just a much less offensive nutjob than Farage, and if he gets anywhere he'll do a different kind of damage. Neither Farage nor Polanski can tackle the root causes IMO.
23.11.2025 22:37 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
I have no strong feelings one way or the other about Cluely - maybe mildly disapproving of the cheating thing the CEO did but someone was going to do that eventually - but the zoning thing is kind of dumb. Beds under stairs isn't good, but mixed use is good actually.
22.11.2025 11:11 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
And all the data. Don't forget all the data.
21.11.2025 09:51 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Because a bunch of hardware will be released into the market from companies that failed to make it in research, and that hardware can be used for inference. The marginal cost is dominated by electricity, and wholesale electricity is trending cheaper, esp. with solar. 2/2
20.11.2025 18:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
When you look at batch throughput, the technical bottlenecks considering MoE models, and power use of hardware involved, there's no reason to believe the subsidy is there for commodity inference.
I know you want to believe otherwise, but a collapse will make inference even cheaper. Why? 1/
20.11.2025 18:04 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
You are mistaken. Training, research and salaries dominate costs of the big cos. The marginal cost is not zero, but it's a fractions of a cent for a short query and response. Third party providers for open weight models have no incentive to sell at a loss.
20.11.2025 14:26 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Why be limited to what you need rather than what you want? Hobby room, library, cinema, guest bedroom, home gym etc.
20.11.2025 11:38 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Ironically, the best way to deal with terrorism is by treating it as criminality. Because military approaches oppress civilians and risk backfiring, motivating support in the wrong direction.
19.11.2025 12:50 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
Research Scientist at Google DeepMind, interested in multiagent reinforcement learning, game theory, games, and search/planning.
Lover of Linux ๐ง, coffee โ, and retro gaming. Big fan of open-source. #gohabsgo ๐จ๐ฆ
For more info: https://linktr.ee/sharky6000
๊ฎ surfed on by the information superhighway
๊ฎ ๐ @linneaisaac.bsky.social
๊ฎ she/they ๐ณ๏ธโโง๏ธ
๊ฎ blog posts and games @ https://vgel.me
๊ฎ still mostly active on twitter https://x.com/voooooogel
โข Director https://www.strategictranslation.org/
โข Essayist http://scholars-stage.org
โข Long takes on ๐จ๐ณ politics, ๐บ๐ธ conservatism, ancient history
Como todos los hombres de Babilonia, he sido procรณnsul; como todos, esclavo; tambiรฉn he conocido la omnipotencia, el oprobio, las cรกrceles.
very sane ai newsletter: verysane.ai
random bloggy bits: segyges.leaflet.pub
A business analyst at heart who enjoys delving into AI, ML, data engineering, data science, data analytics, and modeling. My views are my own.
You can also find me at threads: @sung.kim.mw
Software engineer at Google working on Jetpack Compose
Substack: http://lcamtuf.substack.com/archive
Homepage: http://lcamtuf.coredump.cx
Locked in and posting regularly on here now
Drawing pictures and working on group coordination. https://danallison.info
I know you seen it prompting itself
Globally ranked top 20 forecaster ๐ฏ
AI is not a normal technology. I'm working at the Institute for AI Policy and Strategy (IAPS) to shape AI for global prosperity and human freedom.
wvdial, bup, sshuttle, netselect, popularity-contest, redo, gfblip, GFiber, and now CEO @Tailscale.com doing WireGuard mesh. Top search result for "epic treatise."
I am the Host of the Homebrewed Christianity podcast, visiting Prof of Theology at Luther Seminary, & I love #LOTR #Lakers #Dodgers.
Full of passionate intensity.
(Unofficial) Hacker News Bot with top stories updates.
Jobs from YC startups @whois-hiring.bsky.social
Creator @ykravchuk.bsky.social
Formal methods, software history, chocolatiering. DMs open and happy to meet up in Chicago. Currently writing *Logic for Programmers* (out Q1 2026)
Newsletter: https://buttondown.email/hillelwayne/
New ideas daily.
https://linktr.ee/soreniverson
A latent space odyssey
gracekind.net
Associate Professor at @cst.cam.ac.uk, researching decentralised systems and security protocols. Advisor to the Bluesky team. Wrote โDesigning Data-Intensive Applicationsโ (OโReilly). he/him
Defence Editor at The Economist.
Visiting Fellow at Department of War Studies, KCL. For speaking engagements: https://chartwellspeakers.com/speaker/shashank-joshi