Jan Kaul's Avatar

Jan Kaul

@jankaul.bsky.social

70 Followers  |  89 Following  |  87 Posts  |  Joined: 19.11.2023
Posts Following

Posts by Jan Kaul (@jankaul.bsky.social)

So cool, thanks. I didn't know there were so many apps using the ATprotocol

22.12.2025 14:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Materialized View Spec by JanKaul Β· Pull Request #11041 Β· apache/iceberg This PR implements the Iceberg Materialized View Proposal #10043 by adding a section for Materialized Views to the View spec. It follows the design of the proposal document. The idea is to resolve ...

Hopefully, we'll soon have Iceberg Materialized Views: github.com/apache/icebe...

They would be the perfect destination for the push queries.

07.12.2025 18:00 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That's awesome. I didn't know that.

28.11.2025 09:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I always think about the Iceberg metadata as a 2 level tree based multidimensional spatial index. The way you construct the tree is up to you.

It's just that Iceberg was designed for really huge tables and they decided to split the metadata over multiple files.

19.11.2025 18:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I find the Datawarehouse Toolkit and Designing Data-Intensive Applications both incredibly good.

There will be a second edition of DDIA soon!

02.11.2025 12:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Too bad, doesn't look like there will be an online recording.

I'd love to hear your take on columnar formats for AI and if we can evolve parquet accordingly.

23.10.2025 05:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

At long last, @chris.blue and I have submitted the final manuscript of Designing Data-Intensive Applications, second edition, to the publisher. There is always more that could be improved but at some point we just have to call it done. Now it goes into production; probably shipping in ~4 months.

20.10.2025 19:54 β€” πŸ‘ 309    πŸ” 23    πŸ’¬ 12    πŸ“Œ 7

I can't wait!

21.10.2025 05:31 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Looks like I will have to eventually give Omarchy also a try

20.10.2025 08:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
CPU frequency scaling - ArchWiki

Regarding the battery life, do you have some kind of CPU scaling installed?

One bad thing about arch Linux is that it comes with almost no defaults installed.

I installed power-profiles-deamon and it helped a lot with the battery.

wiki.archlinux.org/title/CPU_fr...

20.10.2025 05:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yeah classic, I should also do a service sometime. But probably something also has to happen first ;)

14.10.2025 08:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It's a really difficult question, because in the end you need to pay developers to build the software.

I just feel that many of those companies get really greedy and want to raise a lot of money. And once you raised a lot you are forced to bring in bigger returns that aren't possible with OSS.

14.10.2025 06:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I think at some point you will need columnar storage to store the list of all table formats.

14.10.2025 05:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Uff, that's rough. Did you have the stuff to fix it? Or did you have to do the walk of shame?

14.10.2025 05:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

As always, great article!

09.10.2025 05:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great work! Really cool stuff.

08.10.2025 05:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Sounds really cool! I'll try to make the journey to NΓΌrnberg next week.

07.10.2025 08:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, you can remove both. I tested it on Android.

22.03.2025 18:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Same here. The design is so awesome.

Also love the "Unix" style interaction which makes it compose so well with other tools.

26.02.2025 12:43 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Aider in your IDE Aider can watch your files and respond to AI comments you add in your favorite IDE or text editor.

I tried Aider and Claude Code, their approaches are very similar but Claude Code feels much more powerful. It's really great at getting additional context in the process. While Aider only gets it beforehand.

The only thing that's missing from Claude Code is AI comments: aider.chat/docs/usage/w...

26.02.2025 12:37 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Awesome post, as always!

One thing I realized lately is that the authentication should be standardized with the Iceberg REST catalog (like an OIDC endpoint).

Otherwise every vendor has their own authentication and only their client will know how to authenticate.

21.02.2025 07:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Claude is actually getting pretty good at coding.

14.02.2025 13:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Did you find a good AI assistant for vim?

13.02.2025 15:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
How might AI change programming? It will change it. But how?

@thorstenball.com is contemplating the same thing: [register spill](registerspill.thorstenball.com/p/how-might-...)

It might be CONTEXT.md

31.01.2025 12:18 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Looking forward!

22.01.2025 17:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sadly, this book is overlooked by too many people. It's a must-read if you're in data.
I wish there was an ebook version.

19.01.2025 09:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You're right, I wasn't entirely clear

18.12.2024 22:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well, iceberg makes this metadata available at a higher level: the manifest-list and manifest files. Which means that you don't have to read all the parquet files.

18.12.2024 14:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Every commercial data warehouse stores additional metadata like upper & lower bounds, statistics, and distinct counts on top of the actual data files to assist the query optimizer.
Iceberg is an open standard for this kind of metadata and provides speed ups over plain parquet.

18.12.2024 07:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Well, ideally it would be complete read and write support.

17.12.2024 12:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0