Corollary to this - biggest mistake I see people do creating a MCP server is reflect their entire REST API shape as MCP tools.
You donโt have to do that. 10 REST API endpoints for placing an order can be easily made into one tool - place_order. Hide the wiring behind the wall.
12.02.2026 21:32 โ ๐ 17 ๐ 5 ๐ฌ 2 ๐ 0
If however you don't have a clear vision for what this thing should become, the agents start spinning their wheels in no time. It's great to be able to test your ideas like this. Sometimes it doesn't come together, and now the fault is usually with the vision more than the agent's ability to execute
06.02.2026 21:26 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Sure, some of the orchestration of the plan-build-PR pipe could probably be delegated too. But there is something you have that the agent does not and that is *intent*. In this sense the situation with agents is not so different from working with skilled but poorly business-aligned engineers.
06.02.2026 21:19 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Take a capable agent like CC and it's even worse. You put down your half-baked shower thoughts and ask it to make a plan. It's beautiful. You tell it to build. It does, wonderfully. Another agent looks at the PR and you wave it through. Could you go gas town and replace yourself with another agent?
06.02.2026 21:14 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Leading engineers is bad enough. Half the time you feel badly out of your depth. You handwave at a technical approach and hint at the business value and good engineers take that and build something amazing. It's easy to feel like an imposter but I've seen what a difference that direction makes ...
06.02.2026 20:38 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Ok great so terminal wins for agents but then the question becomes how do you sandbox it effectively because the current YOLO approach is going to become a security disaster if widely adopted
06.02.2026 12:31 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
screenshot of a claude conversation that says
too many em-dashes
You're right--let me fix that:
๐คจ
29.01.2026 03:21 โ ๐ 20 ๐ 1 ๐ฌ 2 ๐ 2
One of the problems with being UTC+1 is that my morning bluesky feed is mostly late night, bleary-eyed, two-drinks-in American expressions of rage and grief over the death of the republic. It's quite the way to wake up
28.01.2026 07:12 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Surprised to see OpenAI disable SSL validation here in the chatgpt sandbox. I'm sure it's a manageable risk in their setup but kind of unnecessary to accept it
26.01.2026 19:42 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
A very sane AI usage policy for any open source project that still cares about quality.
23.01.2026 18:10 โ ๐ 0 ๐ 1 ๐ฌ 0 ๐ 0
One of the hardest thing for humans to do consistently is get risk factors in the correct order. Confusing primary effects with third order fourth order or fifth order effects is shockingly human.
This kid is now over 20โฆ I wish some reporter would do a follow up story
23.01.2026 02:04 โ ๐ 11 ๐ 1 ๐ฌ 0 ๐ 0
The Lidl panettone has been open for a month now and it's still completely fresh. A few more Christmases and when it's time I won't need embalming
22.01.2026 18:22 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Codex Web is similarly handicapped at the moment, presumably it's a security thing (the repo branch is checked out before the agent starts so it has no access to credentials)
22.01.2026 11:41 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
the rate of change in the AI space is so fast that i cannot explain to my work colleagues that the API we have access to (GPT-4o) is so outdated as to be useless because there's no single chart that depicts both models on the same benchmarks.
21.01.2026 22:16 โ ๐ 52 ๐ 3 ๐ฌ 5 ๐ 0
Vibing My Blog Back to Life with Codex Web
''
Spent stolen moments during the holidays vibing a refresh of my old blog with codex web. I came away with a fancy Sveltia CMS and an appreciation for Codex Web, which is limited yet highly addictive www.johancarlin.com/vibing-my-bl...
21.01.2026 19:19 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
docx linters anyone?
20.01.2026 06:03 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
There's a step down in agent performance from software development to standard office apps. In part it's that the tooling is not there yet (office/gdocs gui integrations are crude compared to VS Code).
But in part there's just no feedback loop without automatic tests and that's harder to fix
20.01.2026 06:02 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Creator of node.js
20.01.2026 03:25 โ ๐ 71 ๐ 6 ๐ฌ 5 ๐ 2
Brian Eno: "The great benefit of computer sequencers is that they remove the issue of skill, and replace it with the issue of judgement."
19.01.2026 15:28 โ ๐ 49 ๐ 8 ๐ฌ 1 ๐ 0
Spent a bit of time vibing from my phone with codex web. The big showstopper so far is it has no way to handle merge conflicts or indeed respond to any incoming changes from the feature branch. Makes it tricky to run parallel tasks
16.01.2026 18:42 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Recruiting developers is tricky these days, half the CVs are LLM-reworded versions of your job ad...
15.01.2026 12:28 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The future is unevenly distributed.
12.01.2026 09:21 โ ๐ 140 ๐ 17 ๐ฌ 4 ๐ 2
We really need to come up with better metaphors for AI agents than junior developers and interns. Itโs far more of a reflection of how disrespectfully we treat those roles than it is a useful metaphor for how agents behave
03.01.2026 09:20 โ ๐ 55 ๐ 14 ๐ฌ 3 ๐ 0
Yeah it's exactly backwards. In our codebase we're having to write documentation because the LLM doesn't do word of mouth. Onboarding LLMs means writing everything down in the repo and that's a huge win for humans also
03.01.2026 14:58 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Stateful agentic LLMs are cool and all but I hope we get past the anthropomorphizing tamagotchi phase soon. I don't think it's a helpful mental model to think of these things as individuals with agency
02.01.2026 20:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Has nobody dealt with their legal, finance, and procurement folks at their companies? Y'all realize that people buy software to have "a throat to choke", right?
The functionality is... not that important. It's why RedHat exists, for example: the software is free, but big companies don't *want* free
30.12.2025 21:08 โ ๐ 8 ๐ 1 ๐ฌ 2 ๐ 0
Why Opinionated, Code-First Systems Are the Future of Data
How AI amplifies strong abstractions instead of replacing them
The idea that code-first BI tools will gain traction so that AI workflows can generate end to end analytics pipelines that work with both the opinions of the tools and the opinions on the developers is really compelling.
open.substack.com/pub/hanson37...
30.12.2025 13:09 โ ๐ 0 ๐ 1 ๐ฌ 1 ๐ 0
Tibo @thsottiaux
X.com
Background Terminal is a Codex feature we've been baking for a bit and will soon turn on by default.
More powerful than it sounds, it allows to Codex to multiplex work and significantly speeds things up.
Default-on within OpenAl. You can enable with / experimental in the CLI.
10:01PM โข 12/27/25 โข 87K Views
63
17 59
737
203
Relevant v
View quotes >
Tibo
@thsottiauxโข 19h
...
When the feature is enabled, you can often catch the model doing something else while it waits for a command to finish for example. Or spin up multiple ones to run multiple test suites concurrently.
11
t7 13
โข 204
ฤฑll 8.2K
ไผ
background terminal in Codex seems like it would really speed things up
29.12.2025 01:14 โ ๐ 18 ๐ 1 ๐ฌ 2 ๐ 0
The occasional Google inactive account manager reminder emails are a real memento mori
17.12.2025 05:48 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Jerome Powell Defense Association
ML Eng. and econometrics. Honorary Commodities Trader**. more left-posting than normal.
**Regrettably degen trading on here, im sorry
Views dont reflect my employer
aka Scott Jennings. "a writer of some cursed repute" - @ed3d.net
MMO game dev for too long. Wrote that blog you snorted at in 1999. Currently working on an arcane hex and counter wargame from the 1970s, only on your PC.
signal: lum.66
A mirror account of @Kimi_Moonshot on Twitter (https://twitter.com/Kimi_Moonshot)
"Built by Moonshot AI to empower everyone to be superhuman. โก๏ธAPI: https://platform.moonshot.ai @KimiProduct where we share cool use cases and prompts."
You may know me from the Austrian parliament or national library, "hacktivist", computers this and that.
https://cards-for-ukraine.at
https://www.wired.com/story/heisse-preise-food-prices/
@badlogic@mastodon.gamedev.place
https://mariozechner.at
Lead product for Google AI Studio, working on the Gemini API, and AGI, my views!
โ๏ธ Blogger: https://den.dev
๐ซ Podcaster: https://theworkitem.com
๐ค Core Maintainer: https://modelcontextprotocol.io
๐ชด Member of Technical Staff: @anthropic.com
๐จ Aspiring Digital Artist
Global Chief Economist @ Arch Capital Group | ex JPM AIG HUD | Husband to Jamie; Dad to Landon, Greyson & Logan | No investment advice & views are my own ๐ฆฌ๐บ๐ธ๐ฎ๐ฑ
https://www.linkedin.com/in/econ-parker/
#rustlang, #jj-vcs, atproto, shitposts, urbanism. I contain multitudes.
Working on #ruelang but just for fun.
Currently in Austin, TX, but from Pittsburgh. Previously in Bushwick, the Mission, LA.
Main Character Syndrome, but fun. Boston-based. My wood shop and my dogs are my personality. If you're still reading this, I'm your dad now. Sorry, I don't make the rules.
I don't speak for my employer.
signal: edr.91
engineering and infosec ยท he/him ยท vis tacita
maine bluesky feed: https://bsky.app/profile/did:plc:txfqncx66asrjzitxfur3of6/feed/aaap7ldsnvpkw
THE INTERNET IS SCARY: @internetisscary.org
Uses machine learning to study literary imagination, and vice-versa. Likely to share news about AI & computational social science / Sozialwissenschaft / ็คพไผ็งๅญฆ
Information Sciences and English, UIUC. Distant Horizons (Chicago, 2019). tedunderwood.com
Author: Verified: How to Think Straight, Get Duped Less, and Make Better Decisions about What to Believe Online (University of Chicago Press).
Researcher, infolit/misinfo/rhetoric/civic reasoning. Currently researching AI as tool for critical thinking.
the only thing worse than my code are my jokes
governance @opentelemetry.io
director of ai strategy @honeycomb.io
av by @extinctinks.net
(gender)queer transsexual / they/them / eng @oxide.computer / atheist and scientific humanist / ๐ฆ #rustlang / https://nexte.st / https://sunshowers.io
also on mastodon: @rain@hachyderm.io / https://hachyderm.io/@rain
I make software for people who make software.
https://danilocampos.com
Engineer at Posit (fka RStudio), technical lead at quarto.org
Learning Brazilian Choro, the 7-string guitar, and to deal with the Minneapolis winter after 10 years in Tucson