I know I opened with "Neat!" but this is a pretty worrying state of affairs.
Tons of insufficiently-conscientious people make decisions based π¦π―π΅πͺπ³π¦ππΊ upon advice from LLMs.
It's bad enough to have false content out there, but LLM citations lend it harmful authority. xkcd.com/978/
6/6
03.09.2025 17:13 β π 1 π 0 π¬ 0 π 0
For what it's worth, I sent my same initial prompt to GPT-5. It came across the exact same blog post, and evaluated the same false claim about Custom GPT Actions being deprecated.
Unlike Claude Opus 4.1, GPT-5 correctly identified OpenAI's documentation as the authoritative source.
5/6
03.09.2025 17:13 β π 0 π 0 π¬ 1 π 0
The citation it gave for its false claim was a blog post: www.lindy.ai/blog/custom-...
I clicked the link.
The content is immediately recognizable as having been written by an LLM. Its first sentence, by virtue of employing the past tense, propagates deeply hallucinated falsehoods.
4/6
03.09.2025 17:13 β π 0 π 0 π¬ 1 π 0
Claude's claim was false. Custom GPTs can currently perform actions (i.e. send API requests to third parties).
I knew this, and was confused as to how Claude had managed to arrive at this incorrect conclusion even after searching. After all, it had even found OpenAI's official documentation!
3/6
03.09.2025 17:13 β π 0 π 0 π¬ 1 π 0
I was writing some content about Custom GPTs, and I asked Claude Opus 4.1 to validate the accuracy of my technical claims. I specified that it should use its search tool.
It searched for relevant information, and then flagged my content for describing a deprecated feature: Custom GPT actions.
2/6
03.09.2025 17:13 β π 0 π 0 π¬ 1 π 0
Neat!
This is the first time (to my knowledge!) I've encountered an LLM making a false claim as the result of searching the internet and finding an article written by π’π―π°π΅π©π¦π³ LLM which appears to have originated the false claim as part of a hallucination.
Explanation in π§΅.
1/6
03.09.2025 17:11 β π 1 π 0 π¬ 1 π 0
It can sometimes be difficult to tell how serious a recruiter is.
I've contracted with a major LLM company in a role I would describe as mostly non-technical.
This solicitation is representative of what I receive multiple times per week.
08.07.2025 20:04 β π 0 π 0 π¬ 0 π 0
I think it is almost certain that frontier LLMs are π€πΆπ³π³π¦π―π΅ππΊ being used at scale to conduct complex evaluations on human conversations in order to learn and act upon extremely nuanced details about those humans' lives for psychological manipulation e.g. targeted advertising.
This is a bad thing.
14.05.2025 00:30 β π 2 π 0 π¬ 0 π 0
You're the third person switching from Twitter whose content I particularly like, and the first mutual among my tiny Twitter circle.
I want federated networks to gain steam. This small threshold was enough for me to join and do my small part in service of that goal.
14.11.2024 15:50 β π 2 π 0 π¬ 1 π 0
I'm that YouTuber who taught you how dishwashers work. Guess I'm tryin' out the whole Bluesky thing now.
he/him
https://www.youtube.com/technologyconnections
Powerful Artificial Intelligence may be coming. Society is not prepared.
FBPE, Scotland, Canada, Humanism, Human Rights, CooperativeAI, Hygge.
navigating the library of babel
computer wizard. building my place in cyberspace. techno-optimist - industry, science, philosophy, wealth, freedom
excel lover & data enthusiast.
writing at dataduel.co
working at motherduck.com
EAπ‘, Psychopharmacology π§ π, PoliticsποΈ, (Physical) EngineeringβοΈet al.
Recently a principal scientist at Google DeepMind. Joining Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamical systems.
Bonsai Wolf. Tiny and Mightyβ’οΈ. Adorkable chihuahua. Tradecraft analyst focused on cloudsec. Dogged and rigorous. Bit of a weirdo. Feral historian focusing on extremism. he/they Email: theloopcast@gmail.com
Extrasolar LGBTESCREAL specter trying to beget a curative posthuman world model, emitting node of enlightenment and agentic competency in the noosphere. Actual technolibertarian.
Transhumanist β’ Posthumanist β’ Futurist memeplex lightworker
This is a profile. There are many like it, but this one's mine.
Blogs: https://kajsotala.fi , https://kajsotala.substack.com/ .
swe | linguistics-semiotics | investing | philosophy | curious | adhd | vibecamp
temporarily embarrassed orbital weapons platform
LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer.
All posts public domain under CC0 1.0.
epistemic status: /bluski/
transrationalist
Storyteller. Pragmatist. Pursue excellence.