Tom Cooper's Avatar

Tom Cooper

@tomcooper.dev.bsky.social

Senior Principal Engineer at IBM, Ex-Red Hat, Ex-Reddit, Ex-Twitter, Distributed Systems PhD, interested in all things stream processing, also cycling

201 Followers  |  129 Following  |  46 Posts  |  Joined: 01.12.2023  |  2.1047

Latest posts by tomcooper.dev on Bluesky

Phased upgrades of Strimzi managed Kafka fleets
YouTube video by Strimzi Phased upgrades of Strimzi managed Kafka fleets

A bit late, but the recording of my #StrimziCon2025 talk, Phased Upgrades of Strimzi Managed Kafka Fleets (hows that for a snappy title) is now up on Youtube: youtu.be/pEuYFFkQCUU?...

07.07.2025 10:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

It's my last day at Red Hat. Tomorrow I & the rest of middleware move to IBM. I wanted to work at RH since I heard about Fedora on the Linux Action Show (RIP) decades ago & I've loved developing in the open with RH. That'll continue after the move & after all, no one got fired for buying IBM...

30.06.2025 16:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

🐊 Kroxylicious Proposal: "Routing API"

This looks like a very useful addition to @kroxylicious.io (wire-protocol proxy for Apache Kafka): dynamic routing of requests to different upstream clusters, enabling use cases such as union clusters and client-based routing.

πŸ‘‰ github.com/kroxylicious...

14.06.2025 09:26 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1
Taming the Kafka Chaos: How Strimzi Helped Us Scale with Confidence Strimzi provides a way to run an Apache Kafka cluster on Kubernetes in various deployment configurations.

Going from 1 to 13 PB/day 🀯 thanks to moving from a self-managed to a Strimzi-managed Kafka cluster! Great to hear these stories of how Strimzi not only made life easier for infra teams but improved the service they provide!

strimzi.io/blog/2025/06...

09.06.2025 13:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It was a really great talk, you covered a lot in 15mins! I am keen to work on an OSS library for embeddings wrapping Langchain4j so may ping you for opinions/reviews if you have time?

21.05.2025 20:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Luckily we have 2 people on the Strimzi team who literally wrote the book on Kafka Connect and Strimiz could do something (TBD) to help.

21.05.2025 16:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Mostly about how everyone wants DR, try MM2, bounce off it and then end up with some very unique setups and trade offs. TL;DR MM2 (and connect more generally) is hard to get started with and has some sharp edges.

21.05.2025 16:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"Process" not "Polymorphic" πŸ€¦β€β™‚οΈ

21.05.2025 14:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Ending #Current25 with a talk on Polymorphic Table Functions in Flink🀘

21.05.2025 13:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Great hallway track at #Current25, lots of ideas around Kafka connect, mirror maker, geo-replication, cluster balancing. Lots to mull over on the train home...

21.05.2025 13:14 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"Vibe-coded database schemas" 😱

21.05.2025 13:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

So nice to get a deep technical deep dive on a subject. They did a great job of explaining the migration process. If you want more detail we have blog posts:

21.05.2025 12:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Woooo πŸŽ‰ great to see Strimzi maintainers Kate Stanley and Paolo Patierno on stage at #Current25 talking about how Strimzi helps you migrate your K8s Kafka clusters from Zookeeper to KRaft

21.05.2025 12:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

After a very interesting lunch time at #Current25 speaking to folks from OpenAI about Mirror Maker and what we could do in Strimzi to make things easier, I'm watching the Flink AMA MC'd by @rmetzger.bsky.social

21.05.2025 11:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

My ⚑ talk choice at #Current25 is @hpgrahsl.bsky.social and vector stores in Flink

21.05.2025 10:01 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Ok so the main thrust is actually about removing Kafka from the mix & what you need to do in Iceberg to emulate topic compaction. A really interesting custom solution due to missing features in the Flink Iceberg connector. The headline, it works, it doesn't need Kafka but it's not as fast #Current25

21.05.2025 09:36 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Onto something a bit more concrete now at #Current25. Improving CDC performance using Kafka, Flink and Iceberg

21.05.2025 09:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Kicking off day 2 of #Current25 by boarding the AI agent trainπŸ€–πŸš‚ with MongoDB 🌿

21.05.2025 08:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Wooop Open Source Datastack FTW! Pinterest moving from an in-house to OSS based architecture! #Current25

20.05.2025 16:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Final talk of the day at #Current25, Async processing with Flink SQL

20.05.2025 16:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Great to hear that OpenAI have identified and fixed several issues with PyFlink and intend to push those fixes upstream πŸ‘πŸ‘πŸ‘

20.05.2025 15:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

It's PyFlink for all the things and OpenAI uses the upstream Flink Kubernetes Operator

20.05.2025 15:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Learning about how OpenAI built its stream processing system using Kafka and Flink at #Current25

20.05.2025 15:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This was a very interesting talk full of intriguing ideas. However, there was a lot of "you _could_ do <complicated thing> and then do <even more complicated thing> based on that". I would love to see some real life demonstrations of all these agents linked together doing what we expect.

20.05.2025 15:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This talk has some really great examples of Flink's complex event processing (CEP) functionality in Flink SQL

20.05.2025 14:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Great to see a push to enhance upstream Flink SQL with addional AI orientated functions

20.05.2025 14:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Next up at #Current25 Flink 🐿️ but with robots πŸ€– (AI agents)!

20.05.2025 14:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Next at #Current25 a panel discussion on the past, present and future of @kafkastreams.bsky.social. I love the background...the likeness is uncanny...

20.05.2025 13:17 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Starting #Current25 with an overview of anomaly detection in Flink SQL. This is from Confluent and AFAICS uses their proprietary ML functionality. But there is work in the upstream to support the same that builds on FLIP-437 which added the model syntax to Flink SQL

cwiki.apache.org/confluence/d...

20.05.2025 12:06 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Ready to start another Kafka and Flink filled couple of days in London at #Current25

Looking forward to hearing about stream processing with Flink, integrating with open table formats and this newfangled AI thing...

20.05.2025 08:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@tomcooper.dev is following 20 prominent accounts