Johan Carlin's Avatar

Johan Carlin

@johancarlin.com.bsky.social

Python, data, AI. Recovering academic. twitter.com/johancarlin - abandoned for obvious reasons https://fosstodon.org/@johancarlin

86 Followers  |  323 Following  |  132 Posts  |  Joined: 09.11.2024  |  1.9147

Latest posts by johancarlin.com on Bluesky

Corollary to this - biggest mistake I see people do creating a MCP server is reflect their entire REST API shape as MCP tools.

You donโ€™t have to do that. 10 REST API endpoints for placing an order can be easily made into one tool - place_order. Hide the wiring behind the wall.

12.02.2026 21:32 โ€” ๐Ÿ‘ 17    ๐Ÿ” 5    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

If however you don't have a clear vision for what this thing should become, the agents start spinning their wheels in no time. It's great to be able to test your ideas like this. Sometimes it doesn't come together, and now the fault is usually with the vision more than the agent's ability to execute

06.02.2026 21:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Sure, some of the orchestration of the plan-build-PR pipe could probably be delegated too. But there is something you have that the agent does not and that is *intent*. In this sense the situation with agents is not so different from working with skilled but poorly business-aligned engineers.

06.02.2026 21:19 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Take a capable agent like CC and it's even worse. You put down your half-baked shower thoughts and ask it to make a plan. It's beautiful. You tell it to build. It does, wonderfully. Another agent looks at the PR and you wave it through. Could you go gas town and replace yourself with another agent?

06.02.2026 21:14 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Leading engineers is bad enough. Half the time you feel badly out of your depth. You handwave at a technical approach and hint at the business value and good engineers take that and build something amazing. It's easy to feel like an imposter but I've seen what a difference that direction makes ...

06.02.2026 20:38 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Ok great so terminal wins for agents but then the question becomes how do you sandbox it effectively because the current YOLO approach is going to become a security disaster if widely adopted

06.02.2026 12:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
screenshot of a claude conversation that says 

too many em-dashes

You're right--let me fix that:

screenshot of a claude conversation that says too many em-dashes You're right--let me fix that:

๐Ÿคจ

29.01.2026 03:21 โ€” ๐Ÿ‘ 20    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

One of the problems with being UTC+1 is that my morning bluesky feed is mostly late night, bleary-eyed, two-drinks-in American expressions of rage and grief over the death of the republic. It's quite the way to wake up

28.01.2026 07:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Surprised to see OpenAI disable SSL validation here in the chatgpt sandbox. I'm sure it's a manageable risk in their setup but kind of unnecessary to accept it

26.01.2026 19:42 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

A very sane AI usage policy for any open source project that still cares about quality.

23.01.2026 18:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

One of the hardest thing for humans to do consistently is get risk factors in the correct order. Confusing primary effects with third order fourth order or fifth order effects is shockingly human.

This kid is now over 20โ€ฆ I wish some reporter would do a follow up story

23.01.2026 02:04 โ€” ๐Ÿ‘ 11    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The Lidl panettone has been open for a month now and it's still completely fresh. A few more Christmases and when it's time I won't need embalming

22.01.2026 18:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Codex Web is similarly handicapped at the moment, presumably it's a security thing (the repo branch is checked out before the agent starts so it has no access to credentials)

22.01.2026 11:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

the rate of change in the AI space is so fast that i cannot explain to my work colleagues that the API we have access to (GPT-4o) is so outdated as to be useless because there's no single chart that depicts both models on the same benchmarks.

21.01.2026 22:16 โ€” ๐Ÿ‘ 52    ๐Ÿ” 3    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 0
Vibing My Blog Back to Life with Codex Web ''

Spent stolen moments during the holidays vibing a refresh of my old blog with codex web. I came away with a fancy Sveltia CMS and an appreciation for Codex Web, which is limited yet highly addictive www.johancarlin.com/vibing-my-bl...

21.01.2026 19:19 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

docx linters anyone?

20.01.2026 06:03 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

There's a step down in agent performance from software development to standard office apps. In part it's that the tooling is not there yet (office/gdocs gui integrations are crude compared to VS Code).

But in part there's just no feedback loop without automatic tests and that's harder to fix

20.01.2026 06:02 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Creator of node.js

20.01.2026 03:25 โ€” ๐Ÿ‘ 71    ๐Ÿ” 6    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 2

Brian Eno: "The great benefit of computer sequencers is that they remove the issue of skill, and replace it with the issue of judgement."

19.01.2026 15:28 โ€” ๐Ÿ‘ 49    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Spent a bit of time vibing from my phone with codex web. The big showstopper so far is it has no way to handle merge conflicts or indeed respond to any incoming changes from the feature branch. Makes it tricky to run parallel tasks

16.01.2026 18:42 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Recruiting developers is tricky these days, half the CVs are LLM-reworded versions of your job ad...

15.01.2026 12:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Gas Town Emergency User Manual Itโ€™s been a busy 12 days since I launched Gas Town. ๏ธIโ€™ve merged over 100 PRs from nearly 50 contributors, adding 44k lines of code that noโ€ฆ

WTAF
steve-yegge.medium.com/gas-town-eme...

14.01.2026 19:39 โ€” ๐Ÿ‘ 149    ๐Ÿ” 9    ๐Ÿ’ฌ 30    ๐Ÿ“Œ 24
Post image Post image

The future is unevenly distributed.

12.01.2026 09:21 โ€” ๐Ÿ‘ 140    ๐Ÿ” 17    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 2

We really need to come up with better metaphors for AI agents than junior developers and interns. Itโ€™s far more of a reflection of how disrespectfully we treat those roles than it is a useful metaphor for how agents behave

03.01.2026 09:20 โ€” ๐Ÿ‘ 55    ๐Ÿ” 14    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

Yeah it's exactly backwards. In our codebase we're having to write documentation because the LLM doesn't do word of mouth. Onboarding LLMs means writing everything down in the repo and that's a huge win for humans also

03.01.2026 14:58 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Stateful agentic LLMs are cool and all but I hope we get past the anthropomorphizing tamagotchi phase soon. I don't think it's a helpful mental model to think of these things as individuals with agency

02.01.2026 20:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Has nobody dealt with their legal, finance, and procurement folks at their companies? Y'all realize that people buy software to have "a throat to choke", right?

The functionality is... not that important. It's why RedHat exists, for example: the software is free, but big companies don't *want* free

30.12.2025 21:08 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Preview
Why Opinionated, Code-First Systems Are the Future of Data How AI amplifies strong abstractions instead of replacing them

The idea that code-first BI tools will gain traction so that AI workflows can generate end to end analytics pipelines that work with both the opinions of the tools and the opinions on the developers is really compelling.

open.substack.com/pub/hanson37...

30.12.2025 13:09 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Tibo @thsottiaux
X.com
Background Terminal is a Codex feature we've been baking for a bit and will soon turn on by default.
More powerful than it sounds, it allows to Codex to multiplex work and significantly speeds things up.
Default-on within OpenAl. You can enable with / experimental in the CLI.
10:01PM โ€ข 12/27/25 โ€ข 87K Views
63
17 59
737
203
Relevant v
View quotes >
Tibo
@thsottiauxโ€ข 19h
...
When the feature is enabled, you can often catch the model doing something else while it waits for a command to finish for example. Or spin up multiple ones to run multiple test suites concurrently.
11
t7 13
โ€ข 204
ฤฑll 8.2K
ไผ

Tibo @thsottiaux X.com Background Terminal is a Codex feature we've been baking for a bit and will soon turn on by default. More powerful than it sounds, it allows to Codex to multiplex work and significantly speeds things up. Default-on within OpenAl. You can enable with / experimental in the CLI. 10:01PM โ€ข 12/27/25 โ€ข 87K Views 63 17 59 737 203 Relevant v View quotes > Tibo @thsottiauxโ€ข 19h ... When the feature is enabled, you can often catch the model doing something else while it waits for a command to finish for example. Or spin up multiple ones to run multiple test suites concurrently. 11 t7 13 โ€ข 204 ฤฑll 8.2K ไผ

background terminal in Codex seems like it would really speed things up

29.12.2025 01:14 โ€” ๐Ÿ‘ 18    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

The occasional Google inactive account manager reminder emails are a real memento mori

17.12.2025 05:48 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@johancarlin.com is following 20 prominent accounts