Peter Boncz's Avatar

Peter Boncz

@peterabcz.bsky.social

Professor Analytical Data Systems @cwi_da and @VUamsterdam. researcher, systems architect, educator, entrepreneur

306 Followers  |  15 Following  |  28 Posts  |  Joined: 17.11.2024  |  2.0977

Latest posts by peterabcz.bsky.social on Bluesky


Post image Post image Post image

DuckDB was added to the wall of fame of CWI, next to Dijkstra’s shortest path algorithm, the Atlantic Crossing of the Internet & the creation of Python

On the occasion of our 80th (really!) bday

Congrats to @hannes.muehleisen.org & @markraasveldt.bsky.social

@duckdb.org
@cwi-amsterdam.bsky.social

11.02.2026 23:40 β€” πŸ‘ 54    πŸ” 6    πŸ’¬ 3    πŸ“Œ 0
Post image

Azim had a stellar committee which included Daniel Lemire (TELUQ πŸ‡¨πŸ‡¦), Pinar TΓΆzΓΌn (ITU πŸ‡©πŸ‡°) and Viktor Leis (TUM πŸ‡©πŸ‡ͺ).

They gave talks at the Dutch Seminar on Data Systems Design (DSDSD) on SIMD-accelerated parsing, xNVMe and escaping from the insanity of SQL.

(video will be posted on dsdsd.da.cwi.nl)

09.01.2026 23:31 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Today Azim Afroozeh defended his PhD thesis on FastLanes, which earned him a Cum Laude distinction ("top 5%"πŸ‡³πŸ‡±research). Promotors Hannes MΓΌhleisen & me are very proud of his work!

thesis: research.vu.nl/files/447013431/afroozehphdthesis%20-%2069280d90f0bfd.pdf
FastLanes: github.com/cwida/FastLanes

09.01.2026 23:29 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

@duckdb.org 1.4.0 is feature-packed: MERGE INTO, compressed in-mem DBs, Iceberg writes..

PhD students also contributed:
- Laurens Kuiper: new k-way parallel mergesort duckdb.org/2025/09/24/sorting-again.html
- Lotte Felius @ccfelius.bsky.social: on-disk DB encryption
- Denis Hirn: materialized CTEs

26.09.2025 12:55 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Laurens Kuiper from @duckdb.org presented DuckDB's new memory assignment policy to run multi-join pipelines out-of-core with gracious performance degradation when join hash tables increasingly do not fit in RAM.

A well-attended and -delivered talk!

paper: vldb.org/pvldb/vol18/p2748-kuiper.pdf

04.09.2025 14:01 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Tobias Schmidt (TUM) @vldb.bsky.social at VLDB2025 presented SQLStorm, which uses LLMs to generate a huge amount of complex queries

SQLStorm now has 18K different complex queries and runs on a large real-world dataset (stackoverflow)

paper: vldb.org/pvldb/vol18/...
code: github.com/SQL-Storm/SQ...

04.09.2025 13:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Very honored to receive the @vldb.bsky.social 2025 Test of Time Award for the Join Order Benchmark (JOB)

Kudos to my very talented TUM co-authors, specifically Viktor Leis who was the driving force & gave a great award talk.

paper: www.vldb.org/pvldb/vol18/p5531-viktor.pdf
JOB: event.cwi.nl/da/job

03.09.2025 16:08 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

Azim Afroozeh gave a great talk at @vldb.bsky.social VLDB2025 in London on the FastLanes file format.

FastLanes compresses 1.4x better than Parquet/snappy and allows 40x faster reads on the PublicBI dataset!

Paper: vldb.org/pvldb/vol18/p4629-afroozeh.pdf
Code: github.com/cwida/FastLanes

03.09.2025 15:59 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

@sigmod2025.bsky.social Berlin is a wrap. Many πŸ™ to the organizers!

Next stop is @vldb.bsky.social London to present
- github.com/cwida/FastLanes v0.1 of a new big data format
- spilling multi-operator joins (via @duckdb.org)
- the SQLStorm benchmark of 30k LLM-generated complex queries (via TUM)

27.06.2025 13:53 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

Some pics of Leonardo Kuffo presenting his SIGMOD2025 paper on PDX.

PDX is a vertical layout that can accelerate vector search in principle in any vector index technique (it makes the distance calculation faster, using better SIMD + pruning).

ir.cwi.nl/pub/35044/3504…
github.com/cwida/PDX

25.06.2025 14:50 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And.. Azim Afroozeh put a lot of effort in open-sourcing the ALP floating point compressor (github.com/cwida/ALP). Leonardo Kuffo had written with him the SIGMOD2024 paper which now won a reproducibility award!

+ πŸ™πŸ™ to the reproducibility and artifacts committee - this is a ton of work.

25.06.2025 13:28 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

But @cwi_da has no reason to complain, here in Berlin.

Leonardo Kuffo at the preceding DaMoN2025 workshop won the Best Paper Award for a study that showed that for vector databases, it matters a lot which AWS CPU you pick.

Congratulations to him!

25.06.2025 13:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

SIGMOD2025 for the 1st time used a schedule where most papers are presented as posters only.

Tips for next time:
- gather user interest data prior to deciding poster-or-paper & room assignment.
- present posters in a (high ceiling) room with good acoustics & allot enough presentation space + time.

25.06.2025 13:25 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

🎞️ The stream of β€œDuckLake & The Future of Open Table Formats”, a conversation between Hannes MΓΌhleisen and Jordan Tigani, will start in two hours!

17.06.2025 13:01 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

The opening talk of #systemsdistributed, organized by our friends @tigerbeetle.com in the Eye Film museum in Amsterdam, was given by @hannes.muehleisen.org of
@duckdb.org about:

DuckLake (ducklake.select)

and this was very well received

movie poster refers back to CIDR2025 πŸ˜„

20.06.2025 09:10 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

DuckLake: leverage DB tech for Data Lake metadata.

works on duckdb, postgres, MySQL & SQLite

provides:
- multi-statement &
multi-table transactions
- SQL views
- delta queries
- encryption
- low latency: no S3 metadata &
inlining: store small inserts in-catalog
and more!

28.05.2025 21:47 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

A new preprint from database researchers found DuckDB the most environmentally efficient system: arxiv.org/pdf/2504.18980

01.05.2025 10:04 β€” πŸ‘ 24    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Post image

Introducing the DuckDB Local UI, the easiest way to explore local data files with DuckDB. Built in close partnership with @duckdb.org for the community.

duckdb -ui

Learn more:
duckdb.org/2025/03/12/d...

12.03.2025 17:08 β€” πŸ‘ 24    πŸ” 10    πŸ’¬ 0    πŸ“Œ 0

We strive to ensure that the DuckDB project stays open-source in the long term. That's why we set up the non-profit DuckDB Foundation in 2021. The Foundation owns the intellectual property of the DuckDB project and enshrines the availability of DuckDB as open-source in its notarized statutes.

25.02.2025 08:50 β€” πŸ‘ 69    πŸ” 6    πŸ’¬ 3    πŸ“Œ 0
Preview
Vanessa Evers appointed new director of CWI The Board of NWO-I, the institute organization of NWO, appointed Prof. Vanessa Evers as Director of CWI, the national research institute for mathematics and computer science in the Netherlands. In mid...

Vanessa Evers will be our new director!
Excited we will be able to draw on her expertise in integrating AI&tech in social interactions.
More than ever, it is important technology is advanced in ways it positively contributes to society.
www.cwi.nl/en/news/vanessa-evers-appointed-new-director-of-cwi

14.02.2025 17:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

CIDR2025 is a wrap!

Lived the many interesting papers & discussions, Gong Show, @duckdb reception..

ACM president Yannis Ioannidis gave an inspiring talk on open science.

Proceedings are in ACM DL & VLDB (see cidrdb.org).

πŸ™ all in+outside @cwi-amsterdam.bsky.social who helped organize!!

22.01.2025 17:58 β€” πŸ‘ 14    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

In five days the CIDR2025 conference (cidrdb.org) will start, and we are expecting around 170 attendees from all over the world.

On an unrelated note, the exotic "goldeneye" duck was just spotted in The Netherlands!

See: bit.ly/duck-goldeneye

14.01.2025 21:37 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
DuckPGQ: SQL/PGQ in DuckDB
YouTube video by LDBC Linked Data Benchmark Council DuckPGQ: SQL/PGQ in DuckDB

My @ldbcouncil.org TUC talk is online! πŸŽ₯ Learn about #DuckPGQ and #SQL/#PGQ here:
πŸ‘‰ www.youtube.com/watch?v=Fzci...

Catch me at @fosdem.bsky.social on Feb 1 in the Data Analytics room, where I’ll continue spreading the word about #DuckPGQ and #SQL/#PGQ. Hope to see you there! πŸš€ #FOSDEM2025

14.01.2025 17:24 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

Many congrats @hannes.muehleisen.org!

A well-deserved award, recognizing the innovations in @duckdb.org - the most successful open-source DB system to come from @cwi-amsterdam.bsky.social

Let me also honor his 1st PhD student, Mark Raasveldt (DuckDB Labs CTO), instrumental in shaping the project.

13.01.2025 21:53 β€” πŸ‘ 16    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@andypavlo.bsky.social's yearly database in review is out, and fun to read. Mentions @duckdb.org in the context of new postgres integrations ("shotgun weddings"?).

Andy will again be in Amsterdam for CIDR2025 (Jan 19-22) & there are 4 days to register for it: cidrdb.org/cidr2025/registration.html

01.01.2025 18:26 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
CIDR2025 will once again be held in the MΓΆvenpick hotel at the waterfront in the city center of Amsterdam, a walkable distance from the Central Station (just take a train there from the airport, no need for a taxi/uber).

CIDR2025 will once again be held in the MΓΆvenpick hotel at the waterfront in the city center of Amsterdam, a walkable distance from the Central Station (just take a train there from the airport, no need for a taxi/uber).

Amsterdam is once again hosting my favorite event, the Conference on Innovative Data Systems (CIDR2025).

Check its exciting program: www.cidrdb.org/cidr2025/pro...

It will be held January 19-22 in the Amsterdam MΓΆvenpick hotel.

Plan your trip quickly, because registration closes on Thursday!

16.12.2024 20:51 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
duckpgq DuckDB Community Extensions Extension that adds support for SQL/PGQ and graph algorithms

Exciting milestone: The DuckPGQ extension for #DuckDB has surpassed 10,000 downloads!πŸŽ‰

A huge thanks to the community for supporting DuckPGQ for graph analytics. Stay tunedβ€”the next update will bring property graph creation over attached databases!

Explore DuckPGQ here: duckdb.org/community_ex...

03.12.2024 09:26 β€” πŸ‘ 23    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Promotional image for DuckCon #6 in Amsterdam, taking place on January 31, 2025, at Pakhuis de Zwijger. The text highlights the talk topic: β€˜Unlocking graph analytics in DuckDB with SQL/PGQ,’ accompanied by a headshot of the speaker, Daniel ten Wolde, and the DuckDB Foundation logo.

Promotional image for DuckCon #6 in Amsterdam, taking place on January 31, 2025, at Pakhuis de Zwijger. The text highlights the talk topic: β€˜Unlocking graph analytics in DuckDB with SQL/PGQ,’ accompanied by a headshot of the speaker, Daniel ten Wolde, and the DuckDB Foundation logo.

Excited to speak at #DuckCon #6 in Amsterdam on Jan 31, 2025!πŸŽ‰

I’ll share how #DuckDB unlocks graph analytics with SQL/PGQ from the SQL:2023 standard using the #DuckPGQ extension.

πŸ“ Free to attend & livestreamed on YouTube!
πŸ“… Details + register: duckdb.org/events/2025/...

πŸš€ Hope to see you there!

03.12.2024 15:39 β€” πŸ‘ 24    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
MotherDuck team photographed in The Netherlands

MotherDuck team photographed in The Netherlands

Happy to see @motherduck.com opening shop in my hometown Amsterdam: bit.ly/motherduck-a...

In reality, they have already been renting offices for 1.5 years close to the Database Architectures research group at CWI, but with a Dutch legal entity, and soon an own office, things are solidifying.

03.12.2024 17:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

Thanks Andy. That is a big honor and it was an honor to have you in Amsterdam. Enjoy the thxgiving break!

29.11.2024 22:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@peterabcz is following 15 prominent accounts