Sylvain Kalache's Avatar

Sylvain Kalache

@sylvainkalache.bsky.social

Leading the AI Labs @rootly.com - Former LinkedIn SRE and Founder of Holberton School

40 Followers  |  14 Following  |  24 Posts  |  Joined: 14.11.2024  |  1.7118

Latest posts by sylvainkalache.bsky.social on Bluesky

Preview
Will LLMs and Vibe Coding Fuel a Developer Renaissance? While the actual usefulness of LLMs is still debated, one thing is certain: engineers are being asked to do more with less.

While the actual usefulness of LLMs is still debated, one thing is certain: engineers are being asked to do more with less.

By @sylvainkalache.bsky.social

14.07.2025 11:30 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Rootly Roundtable: From Weak Signals to Confident Fixes ยท Zoom ยท Luma This roundtable will explore best practices for filtering weak signals from alert noise, enriching alerts with automated ownership context, and streamliningโ€ฆ

Join us for a roundtable conversation "From Weak Signals to Confident Fixes"

Weโ€™ll tackle everything from cutting alert noise to discussing underrated and overrated alert signals, as well as bug validation workflow before triggering an incident.

Happening tomorrow at 12PM ET lu.ma/aq5mp94m

17.06.2025 19:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thank you for having me ๐Ÿ™Œ

17.06.2025 19:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Automate Models Training: An MLOps Pipeline with Tekton and Buildpacks | Towards Data Science A step-by-step guide to containerizing and orchestrating an ML training workflow without the Dockerfile headache, using a lightweight GPT-2 example.

MLOps got you down? Sick of wrestling with Dockerfiles?

@sylvainkalache.bsky.social unpacks a streamlined approach to MLOps, showing you how to automate your training pipeline with a clean, reproducible, and cloud-native workflow.

11.06.2025 00:06 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Is AI-assisted coding an incident magnet? Now that AI-assisted code is making its way into systems, should we be worried about how it affects SRE?

๐Ÿงฒ Microsoft and Google report that AI writes 30% of their code. Is AI-assisted coding becoming an incident magnet?

Are SREs about to be overwhelmed with the volume and complexity of incidents?

Thatโ€™s what I explore in my latest for @leaddev.com leaddev.com/software-qua...

19.05.2025 20:32 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

SREs: not all traffic drops are outages. Sometimes itโ€™s Diwali. Or the World Cup.

@rootly.com's digging into that with LLMs- check out what their Head of Devrel @sylvainkalache.bsky.social had to say about it at KubeCon.

08.04.2025 15:23 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Rootly | Llama 4 underperforms: a benchmark against coding-centric models Rootly AI Labs analyzes the performance of Metaโ€™s Llama 4 models and finds they underperform compared to competitors like Claude 3.5 Sonnet and Qwen2.5

3๏ธโƒฃ An older version โ€“ Llama 3.3 70B-Versatile โ€“ performed even better than Llama 4 Maverick.

The benchmark โ€“ designed by the
@rootly.com AI Labs โ€“ tests models' ability to pick the correct pull request for a given bug description. The full findings ๐Ÿ‘‰ rootly.com/blog/llama-4...

14.04.2025 16:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

2๏ธโƒฃ Second, we wanted to test it against models tailored for coding tasks. Unsurprisingly, it performs way under those. Llama 4 Maverick achieved only a 70% accuracy score. Alibabaโ€™s Qwen2.5-Coder-32B is ranking the best at (90%), closely followed by GPT o3-mini (89%).

14.04.2025 16:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

1๏ธโƒฃ First, we wanted to reproduce Meta's findings that Llama 4 outperformed GPT-4o, Gemini 2.0 Flash, and DeepSeek v3.1โ€”we found the exact opposite.

It came last, 6% less than the next best-performing model (DeepSeek) and 18% behind the overall top-performing model (GPT-4o).

14.04.2025 16:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

There's been a lot of controversy with the launch of Llama 4 and its performance. So we decided to do our own benchmark, and here is what we found:

14.04.2025 16:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Just finished building @rootly.com MCP server: go from incident to resolution in under a minute. โฑ๏ธ

-Plug it into your IDE
-Import an incident in Cursorโ€™s chat
-Cursors investigate the issue based on the metadata
-Cursors suggest a fix, review, and save

github.com/Rootly-AI-La...

19.03.2025 16:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

โ˜•๏ธ Or just meet for a coffee; DM me ๐Ÿ˜Š

06.03.2025 15:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐ŸŽค Interview guests wanted: speak about your favorite AI tool.
scheduler.default.com/7992/member/...

06.03.2025 15:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
๐Ÿš€ Code to Clarity: The Future of Monitoring, Observability, and Reliability ยท Luma Whatโ€™s the Vibe? Monitoring and observability are evolvingโ€”are your systems keeping up? Join us for an invite-only, off-the-record gathering of engineeringโ€ฆ

๐Ÿ”ญ Join our Code to Clarity event: The Future of Monitoring, Observability, and Reliability with our friends at Checkly & @coralogix.bsky.social
lu.ma/fhl522f4

06.03.2025 15:24 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
๐ŸŽฎ SRECon Arcade Happy Hour - Presented by Rootly x r/SRE x Sentry x Stanza x Cortex ๐Ÿป ยท Luma Whatโ€™s the Vibe? This is THE afterparty for everyone at SRECon. Whether youโ€™re an SRE, platform engineer, or just passionate about reliability, youโ€™re invitedโ€ฆ

Are you going to #SREcon Americas? Iโ€™ll be there with @rootly.com, letโ€™s meet! (4 ways)

๐Ÿ•น๏ธ Join our SRECon Arcade Happy Hour with our friends
@sentry.io, Stanza, and Cortex
lu.ma/hid3pwq4

06.03.2025 15:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Rootly Roundtable: The State of AI in Incident Management ยท Zoom ยท Luma Join us for a Rootly Roundtable on Thursday, March 6th, at 12:00 pm ET. Rootly Roundtables are exclusive, invite-only discussions that bring together the bestโ€ฆ

Join me for the next @rootly.com Roundtable to discuss AI in Incident Management.

Note: we wonโ€™t share a video recording of the event, like Las Vegas: what happens at the roundtable stays at the roundtable ๐Ÿ˜‰
lu.ma/march_rootly...

04.03.2025 16:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ“ง We are hiring across the board and are looking for contractors for the AI Lab โ€“ shoot me a DM if you are interested!

19.02.2025 20:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ’ก The AI Lab mission is to leverage AI to improve incident management and systems operations. Weโ€™ll be building POCs, open-sourcing tools, and benchmarking models.

19.02.2025 20:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ‘จโ€๐Ÿ’ปJoinly Rootly feels like the perfect next step. My career has always been about SREsโ€”I worked as one, trained them, and helped startups engage with them.

19.02.2025 20:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Rootly | Classifying Error Logs with AI: Can DeepSeek R1 Outperform GPT-4o and Llama 3? Sylvain Kalache | Can a smaller AI model outperform a larger one? A distilled version of DeepSeek R1 (70B) outperformed Llama and nearly matched GPT-4o in classifying error logs. These results sugges...

๐Ÿ”ฅ Iโ€™ve joined @rootly.com, where I will lead developer relations and the AI Lab.

๐Ÿ“ˆ My first project was a hackathon distilling DeepSeek R1 and proving it could outperform GPT-4o and Llama 3 on system log analysis

Read more ๐Ÿ‘‡
rootly.com/blog/classif...

19.02.2025 20:07 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Obviously, @MistralAI promoted how good Le Chat is at finding food pairings for wine ๐Ÿ‡ซ๐Ÿ‡ท.

That should be included in all model benchmarks.

07.02.2025 17:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
The founders of SlideShare are making sharing docs more social with their new platform, Jaunt | TechCrunch When SlideShare, the presentation-sharing service acquired by Scribd, launched in 2006, generative AI technology was significantly less advanced than it New social platform Jaunt allows users to uploa...

SlideShare's founders are at it again with
@jaunthq.bsky.social, a document-based social site to read, share & post.

Congratulations @jboutelle.bsky.social, @rashmi.bsky.social & @amitranjan.bsky.social on the launch ๐Ÿš€
techcrunch.com/2024/12/12/t...

12.12.2024 15:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Flux: GitOps Delivery Solution Flux is a GitOps tool that solves continuous delivery at scale for Kubernetes users with a focus on supply chain security. It is a collection of tools that r...

Heard about Flux? The incubating @cncf.bsky.social project simplifies continuous delivery for K8s and strengthens supply chain security.

In this episode of @thelandscape.bsky.social, Flux maintainer @stefanprodan.com shares his favorite feature and more

03.12.2024 22:51 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Headlamp: Extensible multi-cluster Kubernetes user interface The CNCF project Headlamp is a versatile, user-friendly UI for managing Kubernetes clusters. Bart and Sylvain speak to Joaquim Rocha, and they cover Headlamp...

In this episode with @joaquimrocha.com, we speak about Headlamp.

The sandboxed @cncf.bsky.social project provides a powerful and flexible UI for Kubernetes.๐Ÿš€

Watch the full episode ๐Ÿ‘‡

03.12.2024 17:18 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Perses, a sandboxed @cncf.bsky.social porject, provides standards for visualization and dashboards for metrics monitoring.

@schabell.org is sharing everything you need to know about the project

02.12.2024 22:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Looking for where to store your AI assets? Harbor - the @cncf.bsky.social incubating project - might be what you are looking for.

Learn more from Harbor maintainer Vadim Bauer by watching the full episode ๐Ÿ‘‡

02.12.2024 17:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

View from Hawaii Diamond Head

14.11.2024 16:56 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@sylvainkalache is following 14 prominent accounts