So cool, thanks. I didn't know there were so many apps using the ATprotocol
22.12.2025 14:34 β π 2 π 0 π¬ 0 π 0So cool, thanks. I didn't know there were so many apps using the ATprotocol
22.12.2025 14:34 β π 2 π 0 π¬ 0 π 0
Hopefully, we'll soon have Iceberg Materialized Views: github.com/apache/icebe...
They would be the perfect destination for the push queries.
That's awesome. I didn't know that.
28.11.2025 09:17 β π 1 π 0 π¬ 0 π 0
I always think about the Iceberg metadata as a 2 level tree based multidimensional spatial index. The way you construct the tree is up to you.
It's just that Iceberg was designed for really huge tables and they decided to split the metadata over multiple files.
I find the Datawarehouse Toolkit and Designing Data-Intensive Applications both incredibly good.
There will be a second edition of DDIA soon!
Too bad, doesn't look like there will be an online recording.
I'd love to hear your take on columnar formats for AI and if we can evolve parquet accordingly.
At long last, @chris.blue and I have submitted the final manuscript of Designing Data-Intensive Applications, second edition, to the publisher. There is always more that could be improved but at some point we just have to call it done. Now it goes into production; probably shipping in ~4 months.
20.10.2025 19:54 β π 309 π 23 π¬ 12 π 7I can't wait!
21.10.2025 05:31 β π 2 π 0 π¬ 0 π 0Looks like I will have to eventually give Omarchy also a try
20.10.2025 08:12 β π 1 π 0 π¬ 1 π 0
Regarding the battery life, do you have some kind of CPU scaling installed?
One bad thing about arch Linux is that it comes with almost no defaults installed.
I installed power-profiles-deamon and it helped a lot with the battery.
wiki.archlinux.org/title/CPU_fr...
Yeah classic, I should also do a service sometime. But probably something also has to happen first ;)
14.10.2025 08:05 β π 1 π 0 π¬ 1 π 0
It's a really difficult question, because in the end you need to pay developers to build the software.
I just feel that many of those companies get really greedy and want to raise a lot of money. And once you raised a lot you are forced to bring in bigger returns that aren't possible with OSS.
I think at some point you will need columnar storage to store the list of all table formats.
14.10.2025 05:59 β π 1 π 0 π¬ 0 π 0Uff, that's rough. Did you have the stuff to fix it? Or did you have to do the walk of shame?
14.10.2025 05:54 β π 0 π 0 π¬ 1 π 0As always, great article!
09.10.2025 05:54 β π 1 π 0 π¬ 0 π 0Great work! Really cool stuff.
08.10.2025 05:23 β π 0 π 0 π¬ 1 π 0Sounds really cool! I'll try to make the journey to NΓΌrnberg next week.
07.10.2025 08:27 β π 1 π 0 π¬ 0 π 0Yes, you can remove both. I tested it on Android.
22.03.2025 18:08 β π 0 π 0 π¬ 0 π 0
Same here. The design is so awesome.
Also love the "Unix" style interaction which makes it compose so well with other tools.
I tried Aider and Claude Code, their approaches are very similar but Claude Code feels much more powerful. It's really great at getting additional context in the process. While Aider only gets it beforehand.
The only thing that's missing from Claude Code is AI comments: aider.chat/docs/usage/w...
Awesome post, as always!
One thing I realized lately is that the authentication should be standardized with the Iceberg REST catalog (like an OIDC endpoint).
Otherwise every vendor has their own authentication and only their client will know how to authenticate.
Claude is actually getting pretty good at coding.
14.02.2025 13:08 β π 1 π 0 π¬ 1 π 0Did you find a good AI assistant for vim?
13.02.2025 15:54 β π 0 π 0 π¬ 1 π 0
@thorstenball.com is contemplating the same thing: [register spill](registerspill.thorstenball.com/p/how-might-...)
It might be CONTEXT.md
Looking forward!
22.01.2025 17:06 β π 0 π 0 π¬ 0 π 0
Sadly, this book is overlooked by too many people. It's a must-read if you're in data.
I wish there was an ebook version.
You're right, I wasn't entirely clear
18.12.2024 22:24 β π 1 π 0 π¬ 0 π 0Well, iceberg makes this metadata available at a higher level: the manifest-list and manifest files. Which means that you don't have to read all the parquet files.
18.12.2024 14:58 β π 0 π 0 π¬ 1 π 0
Every commercial data warehouse stores additional metadata like upper & lower bounds, statistics, and distinct counts on top of the actual data files to assist the query optimizer.
Iceberg is an open standard for this kind of metadata and provides speed ups over plain parquet.
Well, ideally it would be complete read and write support.
17.12.2024 12:35 β π 0 π 0 π¬ 2 π 0