How far can we push AI autonomy in code generation?
An experiment to test the limits of autonomous code generation by LLMs
We recently ran an experiment to explore how far GenAI can currently be pushed toward autonomously developing high-quality, up-to-date software without human intervention, and gather observations about where it breaks down.
martinfowler.com/articles/pus...
05.08.2025 14:17 โ ๐ 28 ๐ 11 ๐ฌ 1 ๐ 1
Das Individuum in der Maschine: Meredith Whittaker รผber die Rรผckgewinnung der Privatsphรคre im Zeitalter der KI
Gemeinsam mit Publix und @algorithmwatch.org laden wir ein zum Gesprรคch mit @meredithmeredith.bsky.social, Prรคsidentin des Messengers Signal, รผber die Frage, wie wir Technologie wieder stรคrker an menschlichen Bedรผrfnissen ausrichten kรถnnen โ vor allem beim Schutz unserer Privatsphรคre.
โก๏ธ t.ly/lIzA3
29.07.2025 07:43 โ ๐ 50 ๐ 8 ๐ฌ 0 ๐ 0
I totally get it, Iโm also tired of much of the public discourse. But here is one more argument for experienced devs like the author: If we want to guide & teach the โjuniorsโ, we have a responsibility to know first hand what works and what doesnโt, because they are using LLMs if we like it or not.
21.07.2025 09:44 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
I still care about the code
Notes from my Thoughtworks colleagues on AI-assisted software delivery
I've seen a surge of discussions recently about large AI-generated change sets that are impossible to review by humans, paired with speculation if we still need to care about the code in the future. I expect to continue to care, especially if I'm on call for it martinfowler.com/articles/exp...
09.07.2025 16:02 โ ๐ 3 ๐ 3 ๐ฌ 0 ๐ 0
Comic. [block quote] โFar better an approximate answer to the *right* question, which is often vague, than an *exact* answer to the wrong question, which can always be made precise.โ -John W. Tukey, The Future of Data Analysis (1962) [caption] Happy Approximate Birthday to John Tukey, author of my favorite statistics quote, who was born 110.000 years ago sometime this week.
Tukey
xkcd.com/3104/
23.06.2025 22:36 โ ๐ 2914 ๐ 429 ๐ฌ 20 ๐ 15
And I just thought about agents causing that, now @thepete.net is bringing up the possibility of non-coding managers doing it too, with AIโs helpโฆ
04.06.2025 19:54 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
In the past few weeks, a new surge of "background" coding agents came out. I wrote down an example of using OpenAI's Codex, this will hopefully help you understand better what they do under the hood, and which agent category they fall into.
martinfowler.com/articles/exp...
04.06.2025 14:15 โ ๐ 13 ๐ 1 ๐ฌ 3 ๐ 2
When I code with an AI agent, I revert to the last comfortable checkpoint as soon as I feel like losing control. I wonder what that will be like for the delivery managers & POs of the future, what will THEY do when things around them change so fast and erratically that they feel like losing control?
02.06.2025 13:46 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Good scope management is the linchpin of agile software delivery. Generative AI is like a scope chaos monkey, generating more code, more story details, more requirements than necessary
01.06.2025 11:15 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0
NEW POST
To work effectively with agentic coding assistants, Birgitta Bรถckeler found she needs to intervene, correct and steer all the time. She describes examples of these interventions indicating the skills we need to correct the tools' missteps
martinfowler.com/articles/exp...
25.03.2025 15:16 โ ๐ 157 ๐ 34 ๐ฌ 9 ๐ 6
My LLM codegen workflow atm
A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through planning and execution.
Nice write-up by @harper.lol on his AI-assisted coding workflow. I personally prefer in-IDE tools, but the concepts are reusable. โค๏ธ this:"I really want someone to solve this problem in a way that makes coding with an LLM a multiplayer game. Not a solo hacker experience."
harper.blog/2025/02/16/m...
20.02.2025 04:01 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Exploring Generative AI
Notes from my Thoughtworks colleagues on AI-assisted software delivery
I have thoughts and open questions about the role reasoning models might play or not play in coding assistance. A lot of stake is put into how reasoning models are a step change in coding assistance, but I don't see it - yet?
martinfowler.com/articles/exp...
18.02.2025 03:39 โ ๐ 10 ๐ 3 ๐ฌ 1 ๐ 0
Cursor's "Composer" does it, Codeium released a new editor called "Windsurf", and GitHub Copilot's new "Copilot Edit" feature makes this capability available to many of our clients at Thoughtworks, where Copilot currently remains the most widely adopted tool.
19.11.2024 15:32 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Multi-file editing capabilities have been available in open-source tools like Cline and Aider for some time, but in just the past few weeks some of the major commercial coding assistance products have released that capability.
19.11.2024 15:32 โ ๐ 4 ๐ 1 ๐ฌ 1 ๐ 0
Exploring Generative AI
Notes from my Thoughtworks colleagues on AI-assisted software delivery
New GenAI memo: I wrote down my thoughts and observations about the multi-file editing features that are currently being released in lots of coding assistants: martinfowler.com/articles/exp...
19.11.2024 15:31 โ ๐ 8 ๐ 2 ๐ฌ 1 ๐ 1
Exploring Generative AI
Notes from my Thoughtworks colleagues on AI-assisted software delivery
New "GenAI memo": In this one I explore the potential of AI assistance for tech stack migrations. I describe building an agent that changes the testing framework used in a test. As a side effect you can also gain a better understanding of how AI agents work: martinfowler.com/articles/exp...
26.08.2024 12:53 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
YouTube video by GOTO Conferences
AI Assistance Beyond Code: What Do We Need to Make it Work? โข Birgitta Bรถckeler โข GOTO 2024
My talk at GOTO Amsterdam is now live on YouTube: "AI Assistance Beyond Code: What Do We Need to Make it Work?" youtu.be/8jwiABwGC6c?...
26.08.2024 12:53 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 0
Exploring Generative AI
Notes from my Thoughtworks colleagues on AI-assisted software delivery
In my newest "GenAI memo", I explore how today's AI tools can assist with onboarding to existing, potentially messy codebases. I do this by trying to understand and solve an issue in a real life codebase: martinfowler.com/articles/exp...
15.08.2024 15:15 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
Psychologist for Software Teams (& writing a book about it). Founder: Developer Success Lab, Catharsis Consulting. VP of Research. Defender of the mismeasured. she/her ๐ณ๏ธโ๐ https://www.drcathicks.com/
Host at: https://www.changetechnically.fyi/
Writing The Pragmatic Engineer (@pragmaticengineer.com), the #1 technology newsletter on Substack. Author of The Software Engineer's Guidebook (engguidebook.com). Formerly at Uber, Skype, Skyscanner. More at pragmaticengineer.com
SIG Security Chair kubernetes.io
container escape artist
goose in the mainframe
Minneapolis. They/them.
President of Signal, Chief Advisor to AI Now Institute
Political analysis and reporting free of tribal prejudices. Sign up for our newsletters here: https://thebulwark.com/subscribe
Startup CTO and software delivery aficionado
Independent journalist covering internet culture, politics, and media @spitfirenews.com. Buy me a coffee: https://ko-fi.com/kattenbarge
ancient reptile in real life, Omar on Platonic. formerly known as vinn_ayy
Itโs okay, Iโm in incognito mode.
By @joesondow.bsky.social
Techie, dog lover, avid traveler
At the intersection of Generative AI , DevOps and anything in the SDLC - AI Native Development and Infrastructure
Former CTO/Chief Scientist @N26, @ThoughtWorks
alum. Runs http://levelup.patkua.com and http://techlead.academy He/him
Java Champion, Developer Productivity Advocate, Author
https://trishagee.com
https://linktr.ee/trisha_gee
๐ช๐ธ๐ฌ๐ง๐ช๐บ
Creator c4model.com & structurizr.com | Author "Software Architecture for Developers" | Software architecture and diagramming workshops worldwide | Patreon at patreon.com/simon_brown
Software Developer, Technical Coach, YouTuber. She/her.
emilybache.com
Headed engineering for companies w/ either millions of users but no revenue or millions in revenue but no users. Now making AI for SWEs not PhDs at http://outropy.ai
Author of Infrastructure as Code. Principal Cloud Architect at Thoughtworks
O'Reilly Author | Lead Software Engineer @Thoughtworks | Software Craftsperson