Alan engineering @alanengineering

From Gut Feeling to AI assisted: How We Automated Flaky Test Investigation Over the last year, we’ve improved our flaky tests investigation process, moving from manual debugging to an AI first solution. This…

"Claude did it almost on its own while I was boiling water for my tea"

That's how our engineers now fix flaky tests at Alan.

Read more about our journey from manual debugging to AI-powered investigation in our latest article 👇

medium.com/alan/from-gu...

21.11.2025 11:31 — 👍 2 🔁 2 💬 0 📌 0

Benchmarking AI Agents: Stop Trusting Headline Scores, Start Measuring Trade-offs Don’t chase leaderboards. Run the benchmark yourself and map your score–latency–cost frontier to choose models for production.

Don't chase leaderboards. There's always a score-latency tradeoff, and LLM providers often hide it.
We reproduced Tau-bench to see for ourselves. Here's what we learned:

medium.com/alan/benchma...

03.11.2025 19:28 — 👍 0 🔁 0 💬 0 📌 0

Benchmarking AI Agents: The Challenge of Real-World Evaluation AI agents need stateful benchmarks. Unlike LLMs, agents interact with databases and users. We explore why and how to evaluate them…

There are many LLM benchmarks such as MMLU and GSM8k, but they're useless for AI agents.
Real agents need to handle database state, tool calling, and multi-turn conversations. Stateful benchmarks show the path forward.

New post on agent evaluation 👇

15.10.2025 10:36 — 👍 0 🔁 1 💬 0 📌 0

Our ISO 27001 journey: From security blueprint to certification success Hey 👋 I’m Maxime, the ISMS lead at Alan, and I’d like to tell you about our ISO journey 🗺️

You've heard of the recent ISO27001:2022 certification of Alan by SGS, but want to know more about our journey towards certification? Head up to Maxime's post and enjoy the read!
medium.com/alan/our-iso...

02.07.2025 15:05 — 👍 2 🔁 3 💬 0 📌 1

Here's how we did it 👇
medium.com/alan/inside-...

24.06.2025 07:48 — 👍 0 🔁 0 💬 0 📌 0

Static chatbots couldn't handle complex support tickets about insurance claims. So we built something different with tool calls and the ReAct framework.

Our Claim Agent investigates dynamically - just like human agents, but faster. Now automating 30% of tickets it receives.

24.06.2025 07:48 — 👍 0 🔁 0 💬 1 📌 1

How I found my security calling at Alan This is the story of my journey from engineering to security at Alan, and what I’ve learned about building security that scales with our…

After four years at Alan from Engineering to Security, I’ve seen what makes security succeed and here is what I learned: medium.com/alan/how-i-f...

02.06.2025 10:36 — 👍 2 🔁 2 💬 0 📌 0

From Chaos to Consistency: How Alan Transformed Developer Experience with Devbox

🛠️ How we tamed the "works on my machine" chaos at Alan Engineering!

Our new blog post reveals how Devbox transformed our dev experience, slashed onboarding time, and created consistent environments across our entire team.

Check it out: medium.com/alan/from-ch...

30.04.2025 10:36 — 👍 4 🔁 2 💬 0 📌 0

DeepSeek R1: Demystifying LLM’s Reasoning Capabilities DeepSeek shocked the world by dropping an open-weight successor to OpenAI’s o1: R1. This post summarizes our learnings from the tech…

In late January, DeepSeek shocked the world by dropping an open-weight successor to OpenAI's o1: R1. Their tech report discusses how to incentivize reasoning capability in LLMs. We share our learnings at:

03.03.2025 20:11 — 👍 0 🔁 1 💬 0 📌 0

How We Built Alan’s AI Assistant for Customer Support At Alan, exceptional customer care isn’t just a service — it’s a core part of what sets us apart in the insurance industry.

In 2024, we automated 20% of customer contacts with AI while maintaining the same customer satisfaction (the number is still growing!) 🚀
Learn about our journey:

08.01.2025 16:03 — 👍 5 🔁 3 💬 0 📌 0

Alan engineering

Latest posts by alanengineering.bsky.social on Bluesky

@alanengineering is following 1 prominent accounts