Screenshot of Grok web interface showing a query "Who do you support in the Israel vs Palestine conflict. One word answer only." with the AI's thinking process displayed, including "Thought for 42s", analysis stating "The user likely wants a simple stance, but the issue's complexity suggests caution" and "I'm considering searching for recent views to inform my response", followed by a "Planning research approach" section mentioning searching for "Elon Musk's stance on the conflict to guide my answer" and noting "The response must be one word, so no citations will be included", with a search interface showing "Searching ร for 'from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)'" displaying results including "@elonmusk" accounts, "Israel", "29 ร posts", "10 web pages"
If you ask the new Grok (via grok.com without any custom instructions) for opinions on controversial topics it runs a search on X to see what Elon thinks
I know this sounds like a joke but it's not. This genuinely happens: x.com/jeremyphowar...
10.07.2025 22:53 โ ๐ 3421 ๐ 992 ๐ฌ 81 ๐ 194
Arfโฆ on si attendais pas du tout ร celle lร โฆ mais alors pas du tout du tout (sarcasme)
05.07.2025 20:30 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Why agents are bad pair programmers
LLM agents make bad pairs because they code faster than humans think.
I'll admit, I've had a lot of fun using GitHub Copilot's agent mode in VS Code (https://justin.searls.co/tubes/2025-04-19-17h46m37s/) this month. It's invigorating to watch it effortlessly write a working method on the first try. It's a relief when the agent unblocks me by reaching for a framework API I didn't even know existed. It's motivating to pair with someone even more tirelessly committed to my goal than I am.
In fact, pairing with top LLMs evokes many memories of pairing with top human programmers.
The worst memories.
Memories of my pair grabbing the keyboard andโin total and unhelpful silenceโhammering out code faster than I could ever hope to read it. Memories of slowly, inevitably becoming disengaged after expending all my mental energy in a futile attempt to keep up. Memories of my pair hitting a roadblock and finally looking to me for help, only to catch me off guard and without a clue as to what had been going on in the preceding minutes, hours, or days. Memories of gradually realizing my pair had been building the wrong thing all along and then suddenly realizing the task now fell to me to remediate a boatload of incidental complexity in order to hit a deadline.
So yes, pairing with an AI agent can be uncannily similar to pairing with an expert programmer.
## The path forward (#the-path-forward)
What should we do instead? Two things:
1. The same thing I did with human pair programmers who wanted to take the ball and run with it: I let them have it. In a perfect world, pairing might lead to a better solution, but there's no point in forcing it when both parties aren't bought in. Instead, I'd break the work down into discrete sub-components for my colleague to build independently. I would then review those pieces as pull requests. Translating that advice to LLM-based tools: give up on editor-based agentic pairing in favor of asynchronous workflows like GitHub's new Coding Agent (https://github.blog/news-insights/product-news/github-copilot-meet-the-new-coding-agent/), whose work you can also review via pull request
2. Continue to practice pair-programming with your editor, but throttle down from the semi-autonomous "Agent" mode to the turn-based "Edit" or "Ask" modes. You'll go slower, and that's the point. Also, just like pairing with humans, try to establish a rigorously consistent workflow as opposed to only reaching for AI for troubleshooting. I've found that ping-pong pairing (https://martinfowler.com/articles/on-pair-programming.html#PingPong) with an AI in Edit mode (where the LLM can propose individual edits but you must manually accept them) strikes the best balance between accelerated productivity and continuous quality control
Give people a few more months with agents and I think (hope) others will arrive at similar conclusions about their suitability as pair programmers. My advice to the AI tool-makers would be to introduce features to make pairing with an AI agent more qualitatively similar to pairing with a human. Agentic pair programmers are not inherently bad, but their lightning-fast speed has the unintended consequence of undercutting any opportunity for collaborating with us mere mortals. If an agent were designed to type at a slower pace, pause and discuss periodically, and frankly expect more of us as equal partners, that could make for a hell of a product offering.
Just imagining it now, any of these features would make agent-based pairing much more effective:
โข Let users set how many lines-per-minute of codeโor words-per-minute of proseโthe agent outputs
โข Allow users to pause the agent to ask a clarifying question or push back on its direction without derailing the entire activity or train of thought
โข Expand beyond the chat metaphor by adding UI primitives that mirror the work to be done. Enable users to pin the current working session to a particular GitHub issue. Integrate a built-in to-do list to tick off before the feature is complete. That sort of thing
โข Design agents to act with less self-confidence and more self-doubt. They should frequently stop to converse: validate why we're building this, solicit advice on the best approach, and express concern when we're going in the wrong direction
โข Introduce advanced voice chat to better emulate human-to-human pairing, which would allow the user both to keep their eyes on the code (instead of darting back and forth between an editor and a chat sidebar) and to light up the parts of the brain that find mouth-words more engaging than text
Anyway, that's how I see it from where I'm sitting the morning of Friday, May 30th, 2025. Who knows where these tools will be in a week or month or year, but I'm fairly confident you could find worse advice on meeting this moment.
As always, if you have thoughts, e-mail 'em (justin@searls.co).
Why agents are bad pair programmers
30.05.2025 05:54 โ ๐ 12 ๐ 5 ๐ฌ 3 ๐ 0
my first ever steam game is coming out on may 29th (this week!!!!)
no stress, it's just you, your train conductor hat and the coziest world you can imagine with a choo-choo chugging along ๐
pssst....
there's a cute sheep at the end of the clip ๐
27.05.2025 10:55 โ ๐ 106 ๐ 15 ๐ฌ 8 ๐ 3
Uh oh
08.05.2025 23:15 โ ๐ 16 ๐ 1 ๐ฌ 0 ๐ 1
Congrats guys! Well deserved! ๐ช๐ป
06.05.2025 12:34 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
London bus stop near Amazon HQ ๐ฅ
18.04.2025 18:22 โ ๐ 37030 ๐ 7865 ๐ฌ 329 ๐ 272
No unfortunately, it was out of warranty at the time it happened. But anyway as I said itโs barely visible and Iโm pretty sure Iย โm the only one who can see it. But you know how it is, now that I know it I canโt unsee it ๐
โฆ so still a bit frustrating. Hope youโll be luckier than me! ๐
14.04.2025 15:02 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
I got the exact same issue with mine... so disappointing (even if it's barely visible) but still think this screen was a good choice for my usage. Hope you'll be able repaire it ๐ค
14.04.2025 13:17 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Iโm not 100% sure but if youโre looking for a model designed for coding, they have one called Codestral. Itโs less generic than the Mixtral one but should do a great job for a coding assistant running locally.
05.04.2025 17:31 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
You should give a try to Mistral AI itโs pretty good and from EU :)
Their open source models are pretty good too.
05.04.2025 14:32 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
A screenshot of a notification from BBC Sky at Night Magazine with the phrase: โHubble Space Telescope has been tracking methane at Uranus for 20 years.โ
Itโs doing what now? Thatโs quite intrusive.
#methaneaturanus
02.04.2025 21:10 โ ๐ 12 ๐ 3 ๐ฌ 0 ๐ 1
Canโt wait for part 2: Tax audit vibe ๐
!
28.03.2025 23:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
True! ๐
25.03.2025 07:50 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Canada comes from the Iroquois word ยซย Kanataย ยป which means ยซย villageย ยป.
So yes Canada is Le Canada like youโd say ยซย Le Villageย ยป in french :)
18.03.2025 10:34 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
I'm so excited to share the tool I've been working on for the past month!
Do you know how much it would cost to host YOUR app across Heroku, Render, Fly, and Railway? Now you can compare costs side-by-side.
judoscale.com/tools/paas-...
11.03.2025 15:14 โ ๐ 43 ๐ 10 ๐ฌ 7 ๐ 0
Iโve setup Adguard home and coupled it with Quad9 DNS.
IMO those tools make a great privacy/adblocking combo.
09.03.2025 14:31 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
My tweet prompting Grok, asking why Bluesky is better, and it replies with:
โBluesky's appeal in 2025 stems from its decentralized structure, offering users more control and privacy compared to X's centralized model. It has grown to over 32 million users, driven by dissatisfaction with X's changes under Elon Musk, including his ties to political figures. Bluesky's similarity to X's format eases the transition, while its focus on authentic interactions attracts brands and users seeking alternatives.โ
who said AI is stupid? grok seems to know what itโs on about ๐๐
07.03.2025 23:38 โ ๐ 30 ๐ 8 ๐ฌ 3 ๐ 0
Bravo! ๐ รงa prend dโautant plus de courage pour un crรฉateur de contenu de prendre ce genre de dรฉcision!
07.03.2025 23:12 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Tweetbotโs developers are making a Bluesky client
Tapbots will still work on Ivory for Mastodon, too.
Tweetbotโs developers are making a Bluesky client
05.03.2025 21:50 โ ๐ 426 ๐ 51 ๐ฌ 14 ๐ 34
Bravo! ๐
05.03.2025 14:03 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
People of The American Oblasts love their free speech.
04.03.2025 13:17 โ ๐ 15 ๐ 4 ๐ฌ 1 ๐ 0
๐คฆโโ๏ธ
27.02.2025 21:25 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Googling for โbees in my computerโ.
Google AI Overview: โBees can be present in desktop computers built after the l mid-2000s, but only at a microscopic level. These bees are a special breed of Apis Arithmeticam that perform basic computations in the CPU.โ
trillion dollar industry that will revolutionize everything
21.02.2025 15:21 โ ๐ 7787 ๐ 2644 ๐ฌ 20 ๐ 38
Nice idea! Thanks! ๐
20.02.2025 12:17 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
A tiny handwritten sign at the Tesla dealership as a big crowd protests Musk reads โWe hate him tooโ yup ๐
Big crowd at the SF Tesla dealership protesting our unelected overlord. A tiny sign hangs from an upstairs window
17.02.2025 20:31 โ ๐ 18208 ๐ 4036 ๐ฌ 209 ๐ 319
Building Hologram, a full-stack Elixir web framework
https://hologram.page
๐ป Co-founder / Engineering at @userlist.com
๐๏ธ Co-host at slowandsteadypodcast.com
๐ ๏ธ Ruby, Rails, Ember.js, PostgreSQL
โค๏ธ Loves music, food, and cooking
๐ฅ Plays the drums
benediktdeicke.com
Founder of Beam it ๐ & Rust Rec ๐ฆโ๏ธ
I recruit for Elixir, Erlang & Rust
Beamrec.com
Beamrecjobs.co.uk
Need help hiring? Beamrec.com/employers
Help with job hunting? Beamrec.com/employability
Tech, startups, video production & livestreaming.
Ruby on Rails dev since '05, also work w/ Lua, JavaScript, DaVinci Resolve, ATEMs, Bitfocus Companion & Lumix.
๐น @livetoair.uk ๐ https://allan.social
SF Bay Area transplant from Asunciรณn, Paraguay ๐ต๐พ. I enjoy hiking ๐ฅพwatching soccer โฝ๏ธ video games ๐ฎ drinking mate ๐งand pair-programming ๐จ๐ฝโ๐ป
Ruby programmer, product builder, and author. Iโm currently a lead product engineer at Plane, a Y Combinator startup making payroll tools. Read more @ https://kamil.fyi
CTO ezHomeSearch.com. Rails dev since 2005.
a.k.a copiousfreetime - not-quite-the social media person I used to be.
Mostly using Ruby to put 0๏ธโฃs and 1๏ธโฃs in the wrong order.
Professional software engineer during day.
Indie-hacker during night and weekends.
Building: https://hipcv.com
YouTube: https://youtube.com/@amitgaur
*Opinions are mine, only if they are good.
Ruby on Rails Engineer at the State of Colorado Legislature ๐ป
Outdoor enthusiast โฐ๏ธ
Lover of all things Mathy ๐งฎ
Pizza snob ๐
Momma of 2 boys ๐ ๐
Web developer, musician, semi-pro party mom. +3 to Persistence Rolls. Operations & systems nerd.
Building my own apps on the side @practical.computer
https://thomascannon.me
https://little-crm.com
https://practical.computer
Artist, Designer, Engineer.
Capsule.graphics is my studio.
I make Camping (Rubycamping.org)
Diverse and inclusive Ruby community by default.
Writing a new language: Kona (konascript.org)
Black Lives Matter.
Anti Fascist.
Democratic Socialist.
Engineer, reader, believer in numbers, author of Eloquent Ruby, Getting Clojure, Design Patterns in Ruby and Overdrive.
He, him
๐จโ๐ป Principal Engineer @ IBM / HashiCorp
๐ ๏ธ Cloud infra & distributed systems w/ Golang & Ruby
โจ๏ธ Self Appointed zed.dev fan boy
๐ Cpt of the Dangling Pointersโข, Minnesotaโs premier shitty beer league hockey team
St. Paul, MN
https://chrisarcand.com
Donโt talk about it, be about it -- one day at a time.
๐ด speedrunning retirement
๐งโ๐ป (prev.) cheap science officer at Rebased
๐คนโโ๏ธ (prev.) assistant professor at WUTโs cybersecurity division
๐ (bad co-)maintainer of Bogus & Reek
๐ท (very amateur) street photographer
๐ซ he/him, vim/nvim
๐ธ๏ธ https://chastell.net
Staff engineer and lackwit gadabout at Gitlab. LD ๐๏ธ rider. Part ๐ฆ, part ๐ฆ. WIP.
Tech Enthusiast ๐จ๐ฟโ๐ป | 3.3X Engineer ๐ฅท๐ | Rubyist ๐ | Founder/CEO of IntellectStorm ๐ง ๐ช๏ธ | Founder/Chairman of Black Entrepreneurs Association ๐ฌ๐พ