Sylvain Kalache @sylvainkalache

Will LLMs and Vibe Coding Fuel a Developer Renaissance? While the actual usefulness of LLMs is still debated, one thing is certain: engineers are being asked to do more with less.

While the actual usefulness of LLMs is still debated, one thing is certain: engineers are being asked to do more with less.

By @sylvainkalache.bsky.social

14.07.2025 11:30 — 👍 1 🔁 1 💬 0 📌 0

Rootly Roundtable: From Weak Signals to Confident Fixes · Zoom · Luma This roundtable will explore best practices for filtering weak signals from alert noise, enriching alerts with automated ownership context, and streamlining…

Join us for a roundtable conversation "From Weak Signals to Confident Fixes"

We’ll tackle everything from cutting alert noise to discussing underrated and overrated alert signals, as well as bug validation workflow before triggering an incident.

Happening tomorrow at 12PM ET lu.ma/aq5mp94m

17.06.2025 19:15 — 👍 0 🔁 0 💬 0 📌 0

Thank you for having me 🙌

17.06.2025 19:14 — 👍 1 🔁 0 💬 0 📌 0

Automate Models Training: An MLOps Pipeline with Tekton and Buildpacks | Towards Data Science A step-by-step guide to containerizing and orchestrating an ML training workflow without the Dockerfile headache, using a lightweight GPT-2 example.

MLOps got you down? Sick of wrestling with Dockerfiles?

@sylvainkalache.bsky.social unpacks a streamlined approach to MLOps, showing you how to automate your training pipeline with a clean, reproducible, and cloud-native workflow.

11.06.2025 00:06 — 👍 2 🔁 1 💬 0 📌 0

Is AI-assisted coding an incident magnet? Now that AI-assisted code is making its way into systems, should we be worried about how it affects SRE?

🧲 Microsoft and Google report that AI writes 30% of their code. Is AI-assisted coding becoming an incident magnet?

Are SREs about to be overwhelmed with the volume and complexity of incidents?

That’s what I explore in my latest for @leaddev.com leaddev.com/software-qua...

19.05.2025 20:32 — 👍 0 🔁 0 💬 0 📌 0

SREs: not all traffic drops are outages. Sometimes it’s Diwali. Or the World Cup.

@rootly.com's digging into that with LLMs- check out what their Head of Devrel @sylvainkalache.bsky.social had to say about it at KubeCon.

08.04.2025 15:23 — 👍 2 🔁 1 💬 1 📌 0

Rootly | Llama 4 underperforms: a benchmark against coding-centric models Rootly AI Labs analyzes the performance of Meta’s Llama 4 models and finds they underperform compared to competitors like Claude 3.5 Sonnet and Qwen2.5

3️⃣ An older version – Llama 3.3 70B-Versatile – performed even better than Llama 4 Maverick.

The benchmark – designed by the
@rootly.com AI Labs – tests models' ability to pick the correct pull request for a given bug description. The full findings 👉 rootly.com/blog/llama-4...

14.04.2025 16:22 — 👍 0 🔁 0 💬 0 📌 0

2️⃣ Second, we wanted to test it against models tailored for coding tasks. Unsurprisingly, it performs way under those. Llama 4 Maverick achieved only a 70% accuracy score. Alibaba’s Qwen2.5-Coder-32B is ranking the best at (90%), closely followed by GPT o3-mini (89%).

14.04.2025 16:22 — 👍 0 🔁 0 💬 1 📌 0

1️⃣ First, we wanted to reproduce Meta's findings that Llama 4 outperformed GPT-4o, Gemini 2.0 Flash, and DeepSeek v3.1—we found the exact opposite.

It came last, 6% less than the next best-performing model (DeepSeek) and 18% behind the overall top-performing model (GPT-4o).

14.04.2025 16:22 — 👍 0 🔁 0 💬 1 📌 0

There's been a lot of controversy with the launch of Llama 4 and its performance. So we decided to do our own benchmark, and here is what we found:

14.04.2025 16:22 — 👍 0 🔁 0 💬 1 📌 0

Just finished building @rootly.com MCP server: go from incident to resolution in under a minute. ⏱️

-Plug it into your IDE
-Import an incident in Cursor’s chat
-Cursors investigate the issue based on the metadata
-Cursors suggest a fix, review, and save

github.com/Rootly-AI-La...

19.03.2025 16:34 — 👍 0 🔁 0 💬 0 📌 0

☕️ Or just meet for a coffee; DM me 😊

06.03.2025 15:24 — 👍 0 🔁 0 💬 0 📌 0

🎤 Interview guests wanted: speak about your favorite AI tool.
scheduler.default.com/7992/member/...

06.03.2025 15:24 — 👍 0 🔁 0 💬 1 📌 0

🚀 Code to Clarity: The Future of Monitoring, Observability, and Reliability · Luma What’s the Vibe? Monitoring and observability are evolving—are your systems keeping up? Join us for an invite-only, off-the-record gathering of engineering…

🔭 Join our Code to Clarity event: The Future of Monitoring, Observability, and Reliability with our friends at Checkly & @coralogix.bsky.social
lu.ma/fhl522f4

06.03.2025 15:24 — 👍 2 🔁 1 💬 1 📌 0

🎮 SRECon Arcade Happy Hour - Presented by Rootly x r/SRE x Sentry x Stanza x Cortex 🍻 · Luma What’s the Vibe? This is THE afterparty for everyone at SRECon. Whether you’re an SRE, platform engineer, or just passionate about reliability, you’re invited…

Are you going to #SREcon Americas? I’ll be there with @rootly.com, let’s meet! (4 ways)

🕹️ Join our SRECon Arcade Happy Hour with our friends
@sentry.io, Stanza, and Cortex
lu.ma/hid3pwq4

06.03.2025 15:24 — 👍 0 🔁 0 💬 1 📌 0

Rootly Roundtable: The State of AI in Incident Management · Zoom · Luma Join us for a Rootly Roundtable on Thursday, March 6th, at 12:00 pm ET. Rootly Roundtables are exclusive, invite-only discussions that bring together the best…

Join me for the next @rootly.com Roundtable to discuss AI in Incident Management.

Note: we won’t share a video recording of the event, like Las Vegas: what happens at the roundtable stays at the roundtable 😉
lu.ma/march_rootly...

04.03.2025 16:31 — 👍 0 🔁 0 💬 0 📌 0

📧 We are hiring across the board and are looking for contractors for the AI Lab – shoot me a DM if you are interested!

19.02.2025 20:07 — 👍 0 🔁 0 💬 0 📌 0

💡 The AI Lab mission is to leverage AI to improve incident management and systems operations. We’ll be building POCs, open-sourcing tools, and benchmarking models.

19.02.2025 20:07 — 👍 0 🔁 0 💬 1 📌 0

👨‍💻Joinly Rootly feels like the perfect next step. My career has always been about SREs—I worked as one, trained them, and helped startups engage with them.

19.02.2025 20:07 — 👍 0 🔁 0 💬 1 📌 0

Rootly | Classifying Error Logs with AI: Can DeepSeek R1 Outperform GPT-4o and Llama 3? Sylvain Kalache | Can a smaller AI model outperform a larger one? A distilled version of DeepSeek R1 (70B) outperformed Llama and nearly matched GPT-4o in classifying error logs. These results sugges...

🔥 I’ve joined @rootly.com, where I will lead developer relations and the AI Lab.

📈 My first project was a hackathon distilling DeepSeek R1 and proving it could outperform GPT-4o and Llama 3 on system log analysis

Read more 👇
rootly.com/blog/classif...

19.02.2025 20:07 — 👍 1 🔁 0 💬 1 📌 0

Obviously, @MistralAI promoted how good Le Chat is at finding food pairings for wine 🇫🇷.

That should be included in all model benchmarks.

07.02.2025 17:18 — 👍 0 🔁 0 💬 0 📌 0

The founders of SlideShare are making sharing docs more social with their new platform, Jaunt | TechCrunch When SlideShare, the presentation-sharing service acquired by Scribd, launched in 2006, generative AI technology was significantly less advanced than it New social platform Jaunt allows users to uploa...

SlideShare's founders are at it again with
@jaunthq.bsky.social, a document-based social site to read, share & post.

Congratulations @jboutelle.bsky.social, @rashmi.bsky.social & @amitranjan.bsky.social on the launch 🚀
techcrunch.com/2024/12/12/t...

12.12.2024 15:04 — 👍 1 🔁 1 💬 0 📌 0

Flux: GitOps Delivery Solution Flux is a GitOps tool that solves continuous delivery at scale for Kubernetes users with a focus on supply chain security. It is a collection of tools that r...

Heard about Flux? The incubating @cncf.bsky.social project simplifies continuous delivery for K8s and strengthens supply chain security.

In this episode of @thelandscape.bsky.social, Flux maintainer @stefanprodan.com shares his favorite feature and more

03.12.2024 22:51 — 👍 2 🔁 1 💬 0 📌 0

Headlamp: Extensible multi-cluster Kubernetes user interface The CNCF project Headlamp is a versatile, user-friendly UI for managing Kubernetes clusters. Bart and Sylvain speak to Joaquim Rocha, and they cover Headlamp...

In this episode with @joaquimrocha.com, we speak about Headlamp.

The sandboxed @cncf.bsky.social project provides a powerful and flexible UI for Kubernetes.🚀

Watch the full episode 👇

03.12.2024 17:18 — 👍 2 🔁 1 💬 0 📌 0

Perses, a sandboxed @cncf.bsky.social porject, provides standards for visualization and dashboards for metrics monitoring.

@schabell.org is sharing everything you need to know about the project

02.12.2024 22:51 — 👍 1 🔁 0 💬 0 📌 0

Looking for where to store your AI assets? Harbor - the @cncf.bsky.social incubating project - might be what you are looking for.

Learn more from Harbor maintainer Vadim Bauer by watching the full episode 👇

02.12.2024 17:18 — 👍 0 🔁 0 💬 0 📌 0

View from Hawaii Diamond Head

14.11.2024 16:56 — 👍 3 🔁 0 💬 0 📌 0

Sylvain Kalache

Latest posts by sylvainkalache.bsky.social on Bluesky

@sylvainkalache is following 14 prominent accounts