You need evals to ship AI features
AI features are unpredictable and traditional tests fall short. Evals, automated checks for AI behavior, help you prevent regressions and measure success.
If you're shipping AI without evals, you're running prod on vibes.
At Builder, we wire evals into CI, A/B tests, and prod to catch regressions and prove AI feature upgrades actually help our users.
Here's how to set up your own.
www.builder.io/blog/ai-evals
11.08.2025 16:29 — 👍 0 🔁 0 💬 0 📌 0
A two-panel meme using stills of the main character from the TV show Squid Game. The top panel shows the character smiling gleefully with the caption, "watching ai generate code." The bottom panel shows the same character with a serious, stressed expression, captioned, "reviewing ai generated code."
Some tips for code reviews without the tears: www.builder.io/blog/code-r...
05.08.2025 14:34 — 👍 1 🔁 0 💬 0 📌 0
Test-driven development with AI
Learn how AI transforms test-driven development (TDD) from a time-consuming chore into your secret weapon for building robust and bug-free applications.
TDD feels bad. If I know how to write the function, why start with the spec?
But AI agents switch it up. They need guardrails, and tests let them iterate on even the most complex problems.
I'm convinced AI TDD is the way. Here's what changed my mind:
www.builder.io/blog/test-d...
29.07.2025 16:02 — 👍 1 🔁 0 💬 0 📌 0
So, does macOS 26 solve the bug where the Passwords app always opens behind your currently open app? I think that's all I care about for advancements.
26.07.2025 16:00 — 👍 0 🔁 0 💬 0 📌 0
looking at the sink: "ugh, time to do plate laundry again"
21.07.2025 21:40 — 👍 3 🔁 0 💬 0 📌 0
you gotta scrape OpenAI back sometimes
17.07.2025 13:57 — 👍 0 🔁 0 💬 1 📌 0
Keeping Figma in sync with Storybook (and your entire design system) just got way easier.
14.07.2025 13:55 — 👍 1 🔁 0 💬 1 📌 0
Weird anti-AI scraping technique: digital tar pits.
Websites hide links that lead bots to an endless maze of auto-generated gibberish that all link back to itself, poisoning training data.
Wouldn't recommend, but still an interesting tactic against overly eager scrapers.
10.07.2025 14:31 — 👍 0 🔁 0 💬 0 📌 0
Design to Code with the Figma MCP Server
Turn Figma designs into code using MCP servers. Skip screenshot guesswork and let AI access structured design data directly through Figma's API.
Figma's *official* Dev Mode MCP server is now in open beta, and it's a pretty awesome tool!
I've updated this article with how best to use it, how its AI works under the hood, and where some alternate workflows can augment your design to code process.
www.builder.io/blog/figma-...
08.07.2025 15:33 — 👍 1 🔁 1 💬 1 📌 0
sometimes when i listen to Hozier, i'm just like, "Andrew, stop it, we're in _public_"
07.07.2025 15:50 — 👍 0 🔁 0 💬 0 📌 0
A diagram illustrating the MCP architecture, with the title "Pick the right one" pointing to the client. The 'MCP client' is shown connecting to various AI models (like ChatGPT), while it communicates with an 'MCP server' that connects to different backend services (like Figma, Notion, and GitHub).
Hot take: Design handoffs won't work until designers can ship code.
Current MCP clients force you into dev tools or basic chat. PMs spec in docs, designers mock in Figma, and devs rebuild everything.
What if one MCP client let the whole team code? 👇
builder.io/blog/mcp-cl...
03.07.2025 16:05 — 👍 0 🔁 0 💬 0 📌 0
Debugging dark mode issues in an old, messy SvelteKit codebase is usually nightmare fuel.
This morning, I connected my repo and put Builder Fusion on the case. It agentically iterated with the code and full DOM context, and it figured out the problem with a single prompt.
27.06.2025 14:40 — 👍 1 🔁 0 💬 1 📌 0
Dictionary entry for "bafflegab," a noun that means "incomprehensible or pretentious language, especially bureaucratic jargon."
new favorite word just dropped
27.06.2025 09:33 — 👍 0 🔁 0 💬 0 📌 0
A visual comparison of two development workflows. The top timeline moves from “Idea” through “Wireframe,” “Prototype,” and “Code” to “PR,” ending with a red X. The bottom skips intermediate steps, jumping from “Idea” directly to “PR,” with a dotted line labeled “Time saved” and a green checkmark.
What if you could mock up prototypes that stakeholders can actually click through?
You don't have to just pretend it works.
Use a real GitHub PR workflow and mock a site all in less time than it takes to wireframe. Here’s how.
www.builder.io/blog/mock-u...
10.06.2025 16:09 — 👍 0 🔁 0 💬 0 📌 0
Image illustrating Figma Model Context Protocol (MCP) translating a product card design (left, showing visual layers and dimensions) into a React code component structure (right, with placeholders like ProductImage, ProductName).
Figma's new MCP server gives AI access to structured design data instead of screenshots. But is the resulting code pixel-perfect?
I've put it to the test is my latest post.
www.builder.io/blog/figma-...
02.06.2025 16:47 — 👍 0 🔁 0 💬 0 📌 0
The word “login” comes from throwing a log attached to a rope with knots overboard a ship to see how many knots go by over time (see also, knots as speed). You’d then put that info in the “log book.” You’d “log in” on a regular basis. This wasn’t from 1959, it was likely from 1689! Etymology baby!
17.05.2025 07:46 — 👍 1086 🔁 178 💬 28 📌 14
Visual comparison highlighting a preferred method for AI development. The "Vibe coding" approach (disapproved with a red X) shows a direct, simplistic request to "Create an ecommerce app". In contrast, "AI for grown-ups" (approved with a green checkmark) shows a more structured approach involving design (Figma), code, and data considerations feeding into the AI prompt.
My problem with vibe coding isn't the AI; it's the vibe.
Many AI tools disrespect your team's hard work and talent. What we need is AI for grown-ups.
Here's my thoughts on what that means.
builder.io/blog/ai-for...
15.05.2025 16:03 — 👍 0 🔁 0 💬 0 📌 0
an AI agent that reminds me why I walked into a room
07.05.2025 19:46 — 👍 17 🔁 1 💬 3 📌 0
ugh stop posting such relevant content
07.05.2025 20:37 — 👍 1 🔁 0 💬 1 📌 0
I want less girlboss & more girlfail having a very bad day but being very brave representation
07.05.2025 07:40 — 👍 305 🔁 26 💬 6 📌 0
You can teach any agent to fish, but wouldn't you rather it know who to call to get fish on demand?
This is what Google's new A2A protocol promises: your agent gets a list of contacts for when the questions get too tough.
www.builder.io/blog/a2a-pro...
05.05.2025 18:05 — 👍 1 🔁 0 💬 0 📌 0
AI agents are like Twitter devs: great solo, terrible at collaboration.
MCP gave agents standardized tool access to information across the internet, but the story isn't done.
A generalist agent with tons of tools still isn't as useful as an orchestrated network of specialists.
05.05.2025 18:01 — 👍 3 🔁 1 💬 1 📌 0
Yep. I’d rather a hastily thrown together diagram than some six-fingered action figures.
05.05.2025 09:20 — 👍 0 🔁 0 💬 0 📌 0
This is dedication. 😁
01.05.2025 17:56 — 👍 0 🔁 0 💬 1 📌 0
Fine-tune an LLM: Why, when, and how
Fine-tuning LLMs can save tokens, guarantee output formats, & bake in edge-case fixes. Learn when to fine-tune & how to do it effectively for your AI projects.
Teach the model once, and then let it work.
In other words: fine-tuning.
- 50 great examples > 5k-token prompt.
- Tone of voice and tool calls get baked right into the model weights.
- Adapters make this possible for all. No second mortgage for GPUs.
www.builder.io/blog/fine-t...
28.04.2025 18:41 — 👍 1 🔁 0 💬 0 📌 0
Diagram showing the composition of a full prompt sent to an LLM, consisting of a large "chonky system prompt" ($0.25 per request) and a small "user query" ($0.001 per request), highlighting the cost difference.
If you're in the midst of yet another 5,000-word manifesto-as-AI-prompt, it might be time to take a beat.
There's a nicer route...
28.04.2025 18:41 — 👍 3 🔁 1 💬 1 📌 0
Senior Concept Artist
She/her. *1993.
https://ko-fi.com/undercurrent32
🏳️⚧️🏳️🌈
Art, mechs, hypnosis, 🔞
Married with @butch-ish.bsky.social
Pfp by @straybimboroadk1ll.bsky.social
css enjoyer, opinionated urbanist, gay and trans
building @namesake.fyi 🏳️⚧️
https://eva.town/guestbook
tired, gay, makes internet on computer.
👨🏼💻 care about the web & tools. prev glitch, fastly, various civic techs
🏠 beautiful oakland, ca
https://keith.is
(She/her) Freelance character artist, primarily for ttrpgs and most things fantasy!
Tags: #art , #exalted , #oc
💻 Designer & Product Manager.
🕸️ Now: Web Platform @ Igalia. Prev: Microsoft Edge.
📚 Author of Design for Developers.
🔮 Sassy web witch & actual witch.
🏴 Seattle gal in England.
💍 @jhey.dev
https://seaotta.dev
Full time bog hag part time designer, anti tech bro, accessibility + inclusive design advocate
💙 She/they in Seattle
Modern Landscape Painter
JimMusil.com
Designers should build.
Nordcraft combines a visual design tool with a powerful web framework, providing everything you need to craft extraordinary web apps and websites.
Sign up at http://nordcraft.com 🌲
Top 99% software engineer.
Girl dad and boy boss.
Co-founder of https://nordcraft.com
permanent Philadelphian in NYC, opinions mine
politics @teenvogue.com
member @transjournalists.org
@leximcmenamin elsewhere
linktr.ee/leximcmenamin
❤️ Generative AI + developer tools
💻 AWS Senior Principal Engineer, Amazon Q Developer
🔗 https://clare.dev/
cofounder/CTO @honeycombio, co-author of Observability Engineering and Database Reliability Engineering. I test in production and so do you. 🐝🏳️🌈🦄
Principal engineer @ Cloudflare. I talk about tech sometimes
Location: Squamish, BC 🇨🇦
Website: https://jeremymorrell.dev/
Senior dev @github.com| Speaker & educator | Talking accessibility, refactoring, AI & career growth | Host of Overcommitted | The Balanced Engineer | Building sustainable careers in tech | brittanyellich.com #pdx
📹 I make tech videos
👩💻 Software engineer 12+ years
☁️ Currently @ Fly.io
Previously @ Render and Heroku
Fly.io: https://www.youtube.com/@flydotio
Annie: https://www.youtube.com/@AnnieSexton1
I also make comic books. See my art @anniecomics.bsky.social
Just a spooky girl livin’ in a Barbie world ✨🖤
27 🎂 Book lover 📚 government hater 😒 queer poc 🥰
Squid biologist
Science communicator
Artist
Philadelphian
I run @SkypeAScientist.bsky.social
Creator of the SquidMobile
https://linktr.ee/Sarahmackattack
- OG web - design - hci - reading - writing - saas biz - edu biz - photos - cats - desert - she
29. she/they ✨ malaysian!! ✨ queer
[HOBBY ARTIST]
do not rp/ claim my ocs.
I do not allow my art to be used as display icons!
Ko-fi: https://ko-fi.com/tujiux
NSFW: @zhuquex.bsky.social
Dice: https://earthlydice.etsy.com
pfp: @kurocyou.bsky.social
Bass player and synth maker
Bass with Kelly Romo, co-founder of The Coven makerspace
She/they 🏳️⚧️ ATL
Trans joy is praxis.
Links: okaysure.cool