Which will be proprietary platform stuff I assume, so not going into the Iceberg spec? Is there any chance of caching layers also being open and standardised?
09.10.2025 05:34 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0@vanlightly.bsky.social
Researcher, advisor, writer, formal verification eng @ Confluent. Everything data (dist sys, databases, messaging, data eng/analytics). https://jack-vanlightly.com, https://www.hotds.dev Credit: ESO/B. Tafresh
Which will be proprietary platform stuff I assume, so not going into the Iceberg spec? Is there any chance of caching layers also being open and standardised?
09.10.2025 05:34 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Why donโt Iceberg or Delta Lake have secondary indexes?
Because analytics workloads and OLTP workloads optimize for opposite I/O patterns.
See my dive into data layout, pruning, and what โindexingโ really means in open table formats: jack-vanlightly.com/blog/2025/10...
New deep dive: Understanding Apache Fluss
I spent August reverse-engineering Fluss, Alibabaโs new table storage engine for Flink (partially forked from Kafka). This post covers its architecture, tiering, and how it tackles changelogs & low-latency state.
jack-vanlightly.com/blog/2025/9/...
New blog post: A Conceptual Model for Storage Unification.
The post defines what storage unification means, defines terminology and evaluates different building blocks and approaches to doing it.
jack-vanlightly.com/blog/2025/8/...
In a future of autonomous AI agents, we can't limit ourselves to error prevention and error detection, we must also include remediation.
jack-vanlightly.com/blog/2025/7/...
Ha! The Blondlot example is fascinating. Sometimes you can fail so spectacularly that they have to invent new math controls just to prevent future people from making a mistake as bad as yours. That's a special kind of immortality.
22.07.2025 17:53 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Science moves slowly because wrong theories waste decades. Engineering is careful because failures kill people. Software moves fast because mistakes are cheap, the expensive error isn't making the wrong choice, it's taking too long to make any choice. jack-vanlightly.com/blog/2025/7/...
22.07.2025 15:08 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0But no, I was not offended ๐ Keep on writing it how you see it!
16.07.2025 11:57 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0He could easily have framed it in a more positive way. But either way, I don't think it matters too much, your readers are intelligent and can benefit from both, and see the criticism from a more positive angle (picking the bits they like from both).
16.07.2025 11:57 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Your post was valid from that scale. Winston's response took issue with it because he's been thinking very deeply at the macro-scale of power systems that run our entire planet. And I appreciate his views there. But, I think it was an unfair take out and needlessly combative.
16.07.2025 11:57 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I think it's a matter of scale. For me, your post is from the position where you are at day-to-day, your subjective experience, helping run a company, pushing back on some of the BS you see in the industry (which we love!), all inextricably linked to your past.
16.07.2025 11:57 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0Where does reliability begin, and where does it end? In distributed business architectures, the answer is responsibility boundaries. New post: jack-vanlightly.com/blog/2025/7/...
15.07.2025 14:15 โ ๐ 12 ๐ 5 ๐ฌ 0 ๐ 0Next time, I'll agree it's Wednesday ๐
03.07.2025 20:16 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0ChatGPT thought it was Tuesday, so I made fun of it and it admitted it was Wednesday. So I made fun of it again, and it admitted it was...Wednesday. But sure, AI agents are gonna steal my job ๐ค
03.07.2025 16:20 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0Like how to write an if statement or loop in bash ๐ I swear my brain is incapable of remembering that.
01.07.2025 08:28 โ ๐ 9 ๐ 0 ๐ฌ 3 ๐ 1It really makes me question how ready it is for autonomous agents. I'm still on the "I'll believe it when I see it" camp for AI agents.
24.06.2025 18:30 โ ๐ 6 ๐ 0 ๐ฌ 1 ๐ 0ChatGPT has hallucinated so many times for me today. It's invented scientific terms that don't exist, has been quite liberal with plausible answers based on what sounds reasonable, but without any real world justification. When challenged, it admits it's mistake.
24.06.2025 18:30 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0At this point, I can't tell if coffee makes me feel better in the morning because I am in withdrawal, or it actually picks me up.
20.06.2025 08:48 โ ๐ 3 ๐ 0 ๐ฌ 2 ๐ 0My musical evolution continues, discovered deep hypnotic drone music today. No drugs required ๐ The Hypnus Records label is great.
13.06.2025 14:33 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0The abstraction seems to make sense to people. And it isn't complicated either which is nice.
12.06.2025 06:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0How to reliably distribute work across microservices, stream processors, durable execution, event-driven, orchestration and now AI agents?
Coordinated Progress is a 4 part series that explores the common structure behind reliable distributed systems.
jack-vanlightly.com/blog/2025/6/...
I took a break from social media and my blog for a couple of months. ND burnout. But I'm tentatively back, probably just to post my writing here for now. HOTDS is on pause. Getting back to writing is therapeutic though. I'll post something this week that I've been working on.
09.06.2025 11:23 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0Agree, it's not the Hz but any kind of calming sound like pouring rain, or these slow atmospheric noises is what does it for me. It quietens my brain. In fact, since I discovered this, I haven't listened to music once. I only listen to these calming tracks now.
09.06.2025 11:20 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0My pleasure ๐
04.04.2025 18:29 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Another Humans of the Data Sphere is out, with issue 10! In this issue people are talking fsyncs, tips for running ClickHouse at scale, the problems with MCP and more. Plus I dig up a classic paper from 1962. www.hotds.dev/p/humans-of-...
04.04.2025 16:14 โ ๐ 5 ๐ 4 ๐ฌ 1 ๐ 0Just an oversight I guess.
03.04.2025 17:54 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Proud to have contributed formal verification (TLA+) for three key improvements in Kafka 4.0:
โ
KIP-966: Strengthens the replication protocol.
โ
KIP-996: Introduces PreVote for more stable KRaft leadership.
โ
KIP-848: Delivers more efficient, predictable rebalancing.
I just selected the Spotify Gamma Waves 40Hz playlist. Listen to it on good head phones for the immersion.
25.03.2025 14:49 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Wow, I just discovered gamma wave music. Wrote non-stop for three hours.
25.03.2025 13:00 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0