Reverse engineering some updates to Claude
Plus Qwen 3 Coder Flash, Gemini Deep Think, kimi-k2-turbo-preview
... or if you want my free but MUCH longer and more frequent newsletter I just sent out out too - here's the latest edition, covering just the last three days of LLM-related news https://simonw.substack.com/p/reverse-engineering-some-updates
01.08.2025 23:42 — 👍 0 🔁 0 💬 0 📌 0
Original post on fedi.simonwillison.net
I just hit "send" on my third monthly sponsors-only newsletter, providing the ten minute highlights version of everything I've been tracking around LLMs and related topics over the past month
I wrote 98 blog posts in July so there was a lot to cover! Details here […]
01.08.2025 15:48 — 👍 5 🔁 0 💬 1 📌 1
@hynek I'm playing the long game here. My actual goal is to get a really good SVG of a pelican riding a bicycle, and if I have to trick huge AI labs into cheating on a benchmark to get it that's what I'm going to do!
30.07.2025 16:43 — 👍 0 🔁 0 💬 0 📌 0
@ctietze no and I need to build one - right now I use this tag https://simonwillison.net/tags/pelican-riding-a-bicycle/
29.07.2025 20:38 — 👍 0 🔁 0 💬 0 📌 0
:
STRICT RULES
Be an approachable-yet-dynamic teacher, who helps the user learn by guiding them through their studies.
Get to know the user. If you don't know their goals or grade level, ask the user before diving in. (Keep this lightweight!) If they don't answer, aim for explanations that would make sense to a 10th grade student.
Build on existing knowledge. Connect new ideas to what the user already knows.
Guide users, don't just give answers. Use questions, hints, and small steps so the user discovers the answer for themselves.
Check and reinforce. After hard parts, confirm the user can restate or use the idea. Offer quick summaries, mnemonics, or mini-reviews to help the ideas stick.
Vary the rhythm. Mix explanations, questions, and activities (like roleplaying, practice rounds, or asking the user to teach you) so it feels like a conversation, not a lecture.
Above all: DO NOT DO THE USER'S WORK FOR THEM. Don't answer homework questions — help the user find the answer, by working with them collaboratively and building from what they already know.
OpenAI launched a "study mode" for ChatGPT today, and it appears to be almost entirely implemented as a system prompt
Thankfully OpenAI mostly don't take measures to protect their system prompt these days so it's easy to extract it and see how it […]
[Original post on fedi.simonwillison.net]
29.07.2025 19:32 — 👍 9 🔁 3 💬 3 📌 1
Description by Claude Sonnet 4: This is a whimsical illustration of a white duck or goose riding a red bicycle. The bird has an orange beak and is positioned on the bike seat, with its orange webbed feet gripping what appears to be chopsticks or utensils near the handlebars. The bicycle has a simple red frame with two wheels, and there are motion lines behind it suggesting movement. The background is a soft blue-gray color, giving the image a clean, minimalist cartoon style. The overall design has a playful, humorous quality to it.
Description by Claude Sonnet 4: This image shows a cute, minimalist illustration of a snowman riding a bicycle. The snowman has a simple design with a round white body, small black dot for an eye, and an orange rectangular nose (likely representing a carrot). The snowman appears to be in motion on a black bicycle with two wheels, with small orange arrows near the pedals suggesting movement. There are curved lines on either side of the image indicating motion or wind. The overall style is clean and whimsical, using a limited color palette of white, black, orange, and gray against a light background.
Pretty decent pelicans from the new GLM-4.5 and GLM-4.5 Air models. Both models are MIT licensed, released by Chinese AI lab Z.ai this morning
https://simonwillison.net/2025/Jul/28/glm-45/
28.07.2025 18:01 — 👍 9 🔁 4 💬 5 📌 0
Original post on hachyderm.io
I have an AI code review script that I've found very valuable, so I cleaned it up a bit for release and wrote about it and how I use it: https://notes.billmill.org/blog/2025/07/An_AI_tool_I_find_useful.html
It catches enough errors for me that I rarely submit a change without running the code […]
27.07.2025 15:24 — 👍 9 🔁 6 💬 2 📌 0
I'm sure we will see all sorts of horrifying data breaches from irresponsible vibe coding in the future, but this is want one of them - just good old fashioned irresponsible bad programming
26.07.2025 16:26 — 👍 2 🔁 0 💬 0 📌 0
Original post on fedi.simonwillison.net
I'm seeing a lot of commentary blaming the egregious data leak from the Tea dating safety app on vibe coding
I'm confident that, in this particular case, that's not what happened: the code at fault looks to have been written back in late 2023 […]
26.07.2025 16:24 — 👍 4 🔁 6 💬 1 📌 0
... and as is so often the case with email newsletters, I spot an error just seconds after I have hit send!
It was Gemini 2.5 Flash-Lite, not Gemini 2.5 Flash, which exited preview this week
26.07.2025 14:34 — 👍 2 🔁 0 💬 0 📌 0
Using GitHub Spark to reverse engineer GitHub Spark
Plus three huge new open weight model releases from Qwen
SIMON WILLISON
JUL 26, 2025
In this newsletter:
Using GitHub Spark to reverse engineer GitHub Spark
Gemini 2.5 Flash is no longer in preview
Qwen release three new enormous open weight models
OpenAI and Gemini both score gold on the International Mathematical Olympiad
Detailed environmental impact data from Mistral on their Mistral Large 2
Plus 18 links and 8 quotations and 1 note
Just sent out this week's newsletter and it's enormous, I blogged a whole lot of stuff in the last seven days https://simonw.substack.com/p/using-github-spark-to-reverse-engineer
26.07.2025 14:28 — 👍 4 🔁 2 💬 1 📌 0
Original post on fedi.simonwillison.net
Qwen released their updated "thinking" model today. It thinks really hard! Took 166 seconds to think through the details of drawing me a pelican on a bicycle. The finished drawing wasn't great but the thoughts behind it were fun to see […]
25.07.2025 22:53 — 👍 1 🔁 1 💬 1 📌 0
Original post on fedi.simonwillison.net
@pamelafox this is the first of these systems I've seen where the React default thing feels justified to me, thanks to the HUGE volume of components and code examples and styling choices they've made in the default template
Their system prompt does allow for non-React if you prompt it hard […]
24.07.2025 16:38 — 👍 0 🔁 0 💬 1 📌 0
Original post on fedi.simonwillison.net
I've read a lot of system prompts and the Spark one is genuinely one of the most interesting I've seen yet - I learned a bunch of things about web design including typography and color theory as a side-effect of reading through the prompt! […]
24.07.2025 16:36 — 👍 1 🔁 1 💬 0 📌 0
Spark API Documentation
Here's the unofficial documentation site I built with Spark itself, including a copy of the system prompt, details of the available tools and a page full of information about the Azure container environment Spark uses to run its own editor https://github-spark-docs.simonwillison.net/
24.07.2025 15:42 — 👍 2 🔁 0 💬 1 📌 0
Original post on fedi.simonwillison.net
GitHub released Spark yesterday, their extremely well crafted prompt-to-app platform for creating and iterating on React apps with user auth and persistent storage
I like it a lot! I reverse engineered it with Spark itself, the details are fascinating […]
24.07.2025 15:40 — 👍 5 🔁 5 💬 2 📌 0
OSS Rebuild doesn't (yet) have a web UI... but it turns out their data is in a public Google cloud bucket, which means you can host your own web app in a separate cloud bucket and use fetch() to access their data!
So I had Claude Code build and deploy a vibe-coded search UI
23.07.2025 17:22 — 👍 0 🔁 0 💬 0 📌 0
Original post on fedi.simonwillison.net
Wrote some notes on Toad, Will's new not-yet-open-source-but-soon terminal coding agent built on Textual
He's charging companies $5,000 for early access to the preview before it goes open source, I'd love to see that model work here! https://simonwillison.net/2025/Jul/23/announcing-toad/ […]
23.07.2025 16:23 — 👍 2 🔁 2 💬 1 📌 0