From Gut Feeling to AI assisted: How We Automated Flaky Test Investigation
Over the last year, weβve improved our flaky tests investigation process, moving from manual debugging to an AI first solution. Thisβ¦
"Claude did it almost on its own while I was boiling water for my tea"
That's how our engineers now fix flaky tests at Alan.
Read more about our journey from manual debugging to AI-powered investigation in our latest article π
medium.com/alan/from-gu...
21.11.2025 11:31 β π 2 π 2 π¬ 0 π 0
Benchmarking AI Agents: The Challenge of Real-World Evaluation
AI agents need stateful benchmarks. Unlike LLMs, agents interact with databases and users. We explore why and how to evaluate themβ¦
There are many LLM benchmarks such as MMLU and GSM8k, but they're useless for AI agents.
Real agents need to handle database state, tool calling, and multi-turn conversations. Stateful benchmarks show the path forward.
New post on agent evaluation π
15.10.2025 10:36 β π 0 π 1 π¬ 0 π 0
Our ISO 27001 journey: From security blueprint to certification success
Hey π Iβm Maxime, the ISMS lead at Alan, and Iβd like to tell you about our ISO journey πΊοΈ
You've heard of the recent ISO27001:2022 certification of Alan by SGS, but want to know more about our journey towards certification? Head up to Maxime's post and enjoy the read!
medium.com/alan/our-iso...
02.07.2025 15:05 β π 2 π 3 π¬ 0 π 1
Here's how we did it π
medium.com/alan/inside-...
24.06.2025 07:48 β π 0 π 0 π¬ 0 π 0
Static chatbots couldn't handle complex support tickets about insurance claims. So we built something different with tool calls and the ReAct framework.
Our Claim Agent investigates dynamically - just like human agents, but faster. Now automating 30% of tickets it receives.
24.06.2025 07:48 β π 0 π 0 π¬ 1 π 1
From Chaos to Consistency: How Alan Transformed Developer Experience with Devbox
π οΈ How we tamed the "works on my machine" chaos at Alan Engineering!
Our new blog post reveals how Devbox transformed our dev experience, slashed onboarding time, and created consistent environments across our entire team.
Check it out: medium.com/alan/from-ch...
30.04.2025 10:36 β π 4 π 2 π¬ 0 π 0
DeepSeek R1: Demystifying LLMβs Reasoning Capabilities
DeepSeek shocked the world by dropping an open-weight successor to OpenAIβs o1: R1. This post summarizes our learnings from the techβ¦
In late January, DeepSeek shocked the world by dropping an open-weight successor to OpenAI's o1: R1. Their tech report discusses how to incentivize reasoning capability in LLMs. We share our learnings at:
03.03.2025 20:11 β π 0 π 1 π¬ 0 π 0
How We Built Alanβs AI Assistant for Customer Support
At Alan, exceptional customer care isnβt just a serviceβββitβs a core part of what sets us apart in the insurance industry.
In 2024, we automated 20% of customer contacts with AI while maintaining the same customer satisfaction (the number is still growing!) π
Learn about our journey:
08.01.2025 16:03 β π 5 π 3 π¬ 0 π 0