Hallucinations in code are the least dangerous form of LLM mistakes
A surprisingly common complaint I see from developers who have tried using LLMs for code is that they encountered a hallucinationβusually the LLM inventing a method or even a full β¦
I got fed up of posting the same comment every time the topic of LLM hallucinations in code comes up (short version: they don't matter because you'll spot them the second you try to run the code) - so I've turned that comment into a longer form blog post simonwillison.net/2025/Mar/2/h...
02.03.2025 06:27 β π 219 π 31 π¬ 25 π 5
"I like to say that my interest in open source is actually really selfish. I figured something out. I never want to have to do this work ever again."
I wrote this segment up for my blog a few weeks ago: simonwillison.net/2025/Jan/24/...
13.02.2025 16:24 β π 60 π 6 π¬ 2 π 0
I'd guess about 60% of my usage is code-related - I love them for code because hallucinations don't really matter, they become clear the moment I try and execute the code they wrote for me
With ChatGPT Code Interpreter and Claude's JS execution tool sometimes the models spot those bugs themselves!
13.02.2025 00:56 β π 1 π 1 π¬ 0 π 0
Now that we are 2+ years into the general public having access to ChatGPT-style systems, are there any credible studies out there exploring how susceptible to LLM-generated mistruths users of these things are?
Do people tend to develop good instincts on whether they should trust their output?
13.02.2025 00:06 β π 214 π 28 π¬ 42 π 2
YouTube video by Dwarkesh Patel
Jeff Dean & Noam Shazeer β 25 years at Google: from PageRank to AGI
Delighted to have joined my good friend & colleague Noam Shazeer on a podcast with @dwarkesh.bsky.social for a 2+ hour discussion on early Google, ML hardware, training 1T+ token LLMs in '07, model sparsity, continual learning, and more.
Thanks, Noam and Dwarkesh! π
youtu.be/v0gjI__RyCY?...
12.02.2025 21:59 β π 99 π 14 π¬ 1 π 3
Aside from the interesting new friends, S-tier programmer memes, and new work opportunities, why would someone want to spend 6β12 weeks at the @recursecenter.bsky.social ?
Because itβs one of the most potent environments for growing both your taste and agency as a programmer!
07.02.2025 18:25 β π 10 π 3 π¬ 1 π 0
Animated demo of the SQLite explorer tool, clicking through different pages to explore different pages of the database with a detailed breakdown of each one.
A trio of SQLite nerdery on my blog today:
- simonwillison.net/2025/Feb/6/s... about a neat tool for exploring SQLite's binary file format
- simonwillison.net/2025/Feb/7/a... is a tool I built for playing with APSW via Pyodide
- simonwillison.net/2025/Feb/7/s... lets you back a SQLite DB with S3
07.02.2025 02:25 β π 56 π 4 π¬ 2 π 0
From where I left - <antirez>
Dear friends, I'm rejoining Redis. It's a long story, so it deserved a blog post to explain all the details: antirez.com/news/144
10.12.2024 16:40 β π 468 π 97 π¬ 37 π 18
Independent App Developer at fruitfulapps.com / Blogger at mertbulan.com
Hamburg, Germany
Biztech/biotech journo. Coedit CrazyStupidTech.com w/ @Om.co. ex @Wired, @NYTMag, @FortuneMagazine, @USNews, @WSJ. Author: "Dogfight:How Apple and Google went to war .....", Board: @thewritersgrotto.bsky.social, @columjournreview.bsky.social Epidiolex fan.
Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence.
Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
I cover antitrust. So good at my job that some senators complained. Currently reporting for Bloomberg News. Former Politico, MLex, CQ. Email me at lnylen2 at bloomberg.net. On Signal: leahnylen.88 Opinions are my own.
Historian. Author. Professor. Budding Curmudgeon. I study the contrast between image and reality in America, especially in politics.
Photographer, Farmer, Software Engineer.
Contact: https://linktr.ee/colinsurprenant
Photography: @colinsurprenantphoto.social
Delivered effective, efficient, and secure digital services for the American people until we were forced to stop on March 1, 2025. Not an official government account. Reposts are not endorsements. Our new website: https://18f.org/ #AltGov
Mountains and websites; maps and birds; context collapse. Projects: https://subject.space/. Work: Data, research and technology for investigations @bellingcat.com. Personal account, posts/opinions are my own. NYC/Amsterdam via Oregon.
UW biology prof.
I study how information flows in biology, science, and society.
Book: *Calling Bullshit*, http://tinyurl.com/fdcuvd7b
LLM course: https://thebullshitmachines.com
Corvids: https://tinyurl.com/mr2n5ymk
I don't like fascists.
he/him
Independent AI researcher, creator of datasette.io and llm.datasette.io, building open source tools for data journalism, writing about a lot of stuff at https://simonwillison.net/
Just in from the wasteland.
Just another Twitter refugee looking for a home
Webbin' so long. Axios editor. Author of some tech books.
nyc, software eng, early datadoghq.com, turntable.fm et al, he/him, read my blog at jmoiron.net
Founder & reigning monarch at TPM. Lapsed historian. Hand tool woodworker. Jew.
Novelist and historian: The Breakup (2026), Evil Geniuses, Fantasyland, Heyday, Turn of the Century, etc. Co-created Command Z (with Steven Soderbergh) and Studio 360, New Yorker writer, New York editor-in-chief, Spy co-founder https://www.kurtandersen.com
Lifelong Programmer, Host of the Developer Voices podcast.
VP of Community @ Zig Software Foundation β’ Zig Livecoding http://twitch.tv/kristoff_it β’ Creator of http://softwareyoucan.love β’ Blogging http://kristoff.it β’ Host of https://zig.show β’ π§ loris@sycl.it
Professor of Social Psychology in Society at the University of Cambridge and Author of FOOLPROOF: Why We Fall for Misinformation and How to Build Immunity (2023) + The Psychology of Misinformation (2024). Bad News Game.
www.sandervanderlinden.com