Arjen P. de Vries's Avatar

Arjen P. de Vries

@arjenpdevries.bsky.social

https://arjenp.dev/ and @arjen@idf.social

206 Followers  |  148 Following  |  21 Posts  |  Joined: 27.09.2024  |  1.6848

Latest posts by arjenpdevries.bsky.social on Bluesky

Preview
Column | De voorzichtige terugkeer van het politieke midden Het politieke midden in Nederland heeft zijn stem hervonden. Dat is en blijft het verhaal van deze verkiezingen. Ja, de meeste mensen stemmen nog altijd rechts. En ja, de formatie wordt gecompliceerd....

Onderzoek van politicologen wijst stelselmatig uit dat kiezers niet verrechtsen. Het is het aanbod dat verrechtst, de positie van politieke leiders. Daarom is de voorzichtige terugkeer van het politieke midden juist nu belangrijk voor de democratie.
Mijn column in @nrc.nl
www.nrc.nl/nieuws/2025/...

01.11.2025 10:19 โ€” ๐Ÿ‘ 118    ๐Ÿ” 64    ๐Ÿ’ฌ 14    ๐Ÿ“Œ 8
Original post on idf.social

โ€˜The Democrats Still May Not Understand What They're Dealing Withโ€™ - POLITICO
https://www.politico.com/news/magazine/2025/10/11/elon-musk-donald-trump-silicon-valley-book-jacob-silverman-00603682

"A simple way to put it is that this is a group of people used to getting everything that they [โ€ฆ]

15.10.2025 06:40 โ€” ๐Ÿ‘ 0    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Preview
NerdVote.NL Ondersteuning voor en informatie over alle kandidaat-kamerleden met en warm hart voor onze privacy en eigen IT-vaardigheden.

Een sterke, competente vaste kamercommissie Digitale zaken.
Dit initiatief van @berthubert.bsky.social vraagt aandacht voor "genoeg kennis in de Tweede Kamer te behouden of te krijgen [zo] dat de commissie ook na de verkiezingen geloofwaardig door kan" #tk2025

nerdvote.nl/diza/

07.10.2025 19:06 โ€” ๐Ÿ‘ 11    ๐Ÿ” 7    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Preview
EFF Members' Speakeasy: Portland 2025 Join EFF staff and local online rights supporters for a Speakeasy meet up on Saturday, October 25 in Portland, Oregon at 10 Barrel Brewing!Raise a glass and discover EFF's latest work defending

Portland! We're excited to host a semi-secret members' Speakeasy on October 25 ๐Ÿป We hope to see you there: eff.org/speakeasy-p...

07.10.2025 00:26 โ€” ๐Ÿ‘ 36    ๐Ÿ” 11    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Original post on idf.social

FT: Deloitte issues refund for error-ridden Australian government report that used AI
https://www.ft.com/content/934cc94b-32c4-497e-9718-d87d6a7835ca

"The document contained multiple errors, including references and citations to non-existent reports by academics at the universities of Sydney [โ€ฆ]

07.10.2025 06:29 โ€” ๐Ÿ‘ 1    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - tobilg/duckdb-dns: DNS (Reverse) Lookup Extension for DuckDB DNS (Reverse) Lookup Extension for DuckDB. Contribute to tobilg/duckdb-dns development by creating an account on GitHub.

Created a DNS extension for @duckdb.org today:

github.com/tobilg/duckd...

Hopefully it will be available via the Community Extensions soon!

06.10.2025 17:50 โ€” ๐Ÿ‘ 16    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Airlines Sell 5 Billion Plane Ticket Records to the Government For Warrantless Searching New documents obtained by 404 Media show how a data broker owned by American Airlines, United, Delta, and many other airlines is selling masses of passenger data to the U.S. government.

Insight in the data economy:

https://www.404media.co/airlines-sell-5-billion-plane-ticket-records-to-the-government-for-warrantless-searching/

Seems dodgy! Did I get the option to not consent? I don't recall seeing this on my last flight booking.

17.09.2025 08:36 โ€” ๐Ÿ‘ 1    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Podcast Episode: Building and Preserving the Library of Everything All this season, โ€œHow to Fix the Internetโ€ has been focusing on the tools and technology of freedom โ€“ and one of the most important tools of freedom is a library. Access to knowledge not only creates ...

Building a decentralized, distributed web is vital to finding and preserving information for all, @archive.org founder @brewster.kahle.org tells EFFโ€™s Cindy Cohn and @thejasonkelley.com on the latest episode of โ€œHow to Fix the Internet.โ€

17.09.2025 15:05 โ€” ๐Ÿ‘ 54    ๐Ÿ” 19    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

๐Ÿ•Š๏ธ Lifetime Achievement Award at #ACL2025NLP

A standing ovation for Prof. Kathy McKeown, recipient of the ACL 2025 Lifetime Achievement Award! ๐ŸŒŸ

30.07.2025 13:03 โ€” ๐Ÿ‘ 30    ๐Ÿ” 5    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Post image Post image

๐ŸŒŸReally excited to share the fourth Strategic Workshop on Information Retrieval (SWIRL) report published in SIGIR Forum!

Paper ๐Ÿ‘‰๐Ÿป www.johannetrippas.com/papers/tripp...

More info ๐Ÿ‘‰๐Ÿป sites.google.com/view/swirl20...

#SWIRL2025 #SIGIR2026 #IR #GenAI #Research #CHIIR2026

02.09.2025 12:38 โ€” ๐Ÿ‘ 13    ๐Ÿ” 10    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Barbara Kathmann heeft fantastisch werk gedaan op het gebied van digitalisering en privacy, maar haar indrukwekkende prestaties op deze portefeuille kregen van de kandidatencommissie verrassend genoeg niet de waardering die ze verdienen.

Stem daarom op onze strijdbare anti-Musk! #RaiseTheBar

28.08.2025 01:00 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Original post on idf.social

Recording of the OWI launch event from June 6th:

https://vimeo.com/1112822052

+ Get introduced to the Open Web Index
+ Learn how to access and use the Open Web index
+ Licenses and options for raw data access
+ Use cases for the Open Web Index
+ Hands-on tutorial on our index access tools [โ€ฆ]

28.08.2025 16:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Python!

https://youtu.be/GfH4QL4VqJ0?si=lT2Y7QhgusMLNctH

The Python Documentary.
Featuring several CWI ex-colleagues, of course.

The premiere is tomorrow on YouTube at 17:00 UTC (that's 19:00 CET for Europeans). It's an hour and 24 minutes long.

27.08.2025 13:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Zo, lekker dan, alsof die mensen ook maar iets zinvols kunnen zeggen over hoe een zoekmachine werkt! Achtten ze jou daar niet toe in staat en ging de rechtbank daarin mee?!
Alas, het doet er denk ik niet meer toe.

Hansken is trouwens een followup op Xiraf (maar dat wist je natuurlijk al :-)).

11.07.2025 14:58 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Maakt nieuwsgierig naar de context David!
was het leuk om te doen?

11.07.2025 14:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Informagus @ SIGIR & ICTIR 2025

Sunday July 13 โ€“ Tutorials and DC Consortium

Enhancing Knowledge Injection in Large Language Models for Efficient and Trustworthy Responses
Heydar Soudani
Doctoral Consortium Paper - 16:30 - Doctoral Consortium

Monday July 14 โ€“ Main Conference

Score-Fitted Indexes and Constant Length Indexes for Information Retrieval
Djoerd Hiemstra
Short Paper - 14:00 - Short Papers Posters 1
In a Few Words: Comparing Weak Supervision and LLMs for Short Query Intent Classification
Daria Alexander, Arjen P. de Vries
Short Paper - 14:00 - Short Papers Posters 1
RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces
Santiago de Leon-Martinez, Jingwei Kang, Robert Moro, Maarten de Rijke, Branislav Kveton, Harrie Oosterhuis, Maria Bielikova
Resource Paper - 14:00 - Resource Papers Posters 1

Tuesday July 15 โ€“ Main Conference

Optimizing Compound Retrieval Systems
Harrie Oosterhuis, Rolf Jagerman, Zhen Qin, Xuanhui Wang
Full Paper - 14:00 - Search and Ranking 1

Wednesday July 16 โ€“ Main Conference

Learning to Rank with Variable Result Presentation Lengths
Norman Knyazev, Harrie Oosterhuis
Full Paper - 10:30 - Reranking
Adaptive Orchestration of Modular Generative Information Access Systems
Mohanna Hoveyda, Harrie Oosterhuis, Arjen P. de Vries, Maarten de Rijke, Faegheh Hasibi
Perspectives Paper - 12:15 - Perspectives 1

Thursday July 17 โ€“ Workshops

On the Neural Hype and Improving Efficiency of Sparse Retrieval
Djoerd Hiemstra
Keynote - 10:30 - ReNeuIRโ€™25 Workshop
Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation
Junru Wu, Le Yan, Zhen Qin, Honglei Zhuang, Paul Suganthan G.C., Tianqi Liu, Zhe Dong, Xuanhui Wang, Harrie Oosterhuis
Workshop Paper - 14:15 - ReNeuIRโ€™25 Workshop
An Axiomatic Examination of Uncertainty Estimation in RAG Systems
Heydar Soudani, Evangelos Kanoulas, Faegheh Hasibi
Workshop Paper - Time TBA - IR-RAGโ€™25 Workshop
FACE: A Fine-grained Reference Free Evaluator for Coโ€ฆ

Informagus @ SIGIR & ICTIR 2025 Sunday July 13 โ€“ Tutorials and DC Consortium Enhancing Knowledge Injection in Large Language Models for Efficient and Trustworthy Responses Heydar Soudani Doctoral Consortium Paper - 16:30 - Doctoral Consortium Monday July 14 โ€“ Main Conference Score-Fitted Indexes and Constant Length Indexes for Information Retrieval Djoerd Hiemstra Short Paper - 14:00 - Short Papers Posters 1 In a Few Words: Comparing Weak Supervision and LLMs for Short Query Intent Classification Daria Alexander, Arjen P. de Vries Short Paper - 14:00 - Short Papers Posters 1 RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces Santiago de Leon-Martinez, Jingwei Kang, Robert Moro, Maarten de Rijke, Branislav Kveton, Harrie Oosterhuis, Maria Bielikova Resource Paper - 14:00 - Resource Papers Posters 1 Tuesday July 15 โ€“ Main Conference Optimizing Compound Retrieval Systems Harrie Oosterhuis, Rolf Jagerman, Zhen Qin, Xuanhui Wang Full Paper - 14:00 - Search and Ranking 1 Wednesday July 16 โ€“ Main Conference Learning to Rank with Variable Result Presentation Lengths Norman Knyazev, Harrie Oosterhuis Full Paper - 10:30 - Reranking Adaptive Orchestration of Modular Generative Information Access Systems Mohanna Hoveyda, Harrie Oosterhuis, Arjen P. de Vries, Maarten de Rijke, Faegheh Hasibi Perspectives Paper - 12:15 - Perspectives 1 Thursday July 17 โ€“ Workshops On the Neural Hype and Improving Efficiency of Sparse Retrieval Djoerd Hiemstra Keynote - 10:30 - ReNeuIRโ€™25 Workshop Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation Junru Wu, Le Yan, Zhen Qin, Honglei Zhuang, Paul Suganthan G.C., Tianqi Liu, Zhe Dong, Xuanhui Wang, Harrie Oosterhuis Workshop Paper - 14:15 - ReNeuIRโ€™25 Workshop An Axiomatic Examination of Uncertainty Estimation in RAG Systems Heydar Soudani, Evangelos Kanoulas, Faegheh Hasibi Workshop Paper - Time TBA - IR-RAGโ€™25 Workshop FACE: A Fine-grained Reference Free Evaluator for Coโ€ฆ

If you are attending #SIGIR2025 and #ICTIR2025, come check out our research group's presentations! ๐Ÿ˜Š
We have 14 presentations in total, with at least one every day. ๐Ÿ˜„

If you want to know more about our group, e.g., the meaning of the Informagus name ๐Ÿ˜, do reach out to us.

See you in Padua! ๐Ÿ˜Ž โ˜€๏ธ ๐Ÿ‡ฎ๐Ÿ‡น

11.07.2025 08:49 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Column | Carte blanche geven aan Israรซl is een blunder van formaat In Europa: Caroline de Gruyter

Carte blanche geven aan Israรซl is een blunder van formaat
Mijn column in @nrc.nl

www.nrc.nl/nieuws/2025/...

21.06.2025 08:31 โ€” ๐Ÿ‘ 138    ๐Ÿ” 60    ๐Ÿ’ฌ 13    ๐Ÿ“Œ 11
A diagram showing the breakdown of the Common Pile corpus. It shows large chunks coming from Code, wikimedia, stackexchange etc.

A diagram showing the breakdown of the Common Pile corpus. It shows large chunks coming from Code, wikimedia, stackexchange etc.

An 8TB corpus of copyright-free text for training AI models.

https://github.com/r-three/common-pile/blob/main/paper.pdf

I'm glad somebody has finally done this. The "we need to break copyright or AI won't work argument" feels super-dodgy to me and we can just evaluate whether it's true. 1/n

06.06.2025 09:51 โ€” ๐Ÿ‘ 9    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
DIR 2025

#DIR2025, the 22nd Dutch-Belgian Information Retrieval workshop will take place at Radboud University Nijmegen on 27 October 2025!

https://informagus.nl/dir2025/

06.06.2025 09:07 โ€” ๐Ÿ‘ 5    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Welcome to the age of $10/month Lakehouses No, this article is not about buying properties close to lakes...

Welcome to the age of $10/month Lakehouses!

How to build and run a Lakehouse on top of @cloudflare.social R2 , Cloudflare Containers and Neon Postgres, all backed by the new DuckLake "SQL as Lakehouse" format, via @duckdb.org.

tobilg.com/the-age-of-1...

30.05.2025 18:28 โ€” ๐Ÿ‘ 45    ๐Ÿ” 10    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 1
Original post on idf.social

Radboud ICIS research with KU Leuven revealed a huge tracking scandal today: https://localmess.github.io/

Meta and Yandex (ab-)used Android's fuzzy rules on using localhost to track your browsing patterns without your consent. Yandex even owns a domain to hide these access patterns.

Read more [โ€ฆ]

03.06.2025 13:09 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Not really db infra, it can be "just another duckdb" if you like.

But honestly, if you handle iceberg files you are probably running a catalog server too, which you can now drop.

27.05.2025 20:33 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This is genius. If Iโ€™m understanding correctly, metadata is handled by a DuckDB compatible SQL database and the actual data is handled by an open file format of your choice.

You can perform familiar SQL queries and DDL, on highly scalable open format data files. Well done! #databs #dataengineering

27.05.2025 14:25 โ€” ๐Ÿ‘ 13    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Today we're launching DuckLake, an integrated data lake and catalog format powered by SQL. DuckLake unlocks next-generation data warehousing where compute is local, consistency central, and storage scales till infinity. โ ducklake is an open standard and we implemented it in the "ducklake" extension.

27.05.2025 13:12 โ€” ๐Ÿ‘ 143    ๐Ÿ” 40    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 27
Original post on idf.social

DuckLake announced today:
https://ducklake.select/

An integrated data lake and catalog format

"DuckLake delivers advanced data lake features without traditional lakehouse complexity by using Parquet files and your SQL database. It's an open, standalone format from the DuckDB team."

Podcast [โ€ฆ]

27.05.2025 17:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

For more on why the answer for all of AI may be 42, check out @emilymbender.bsky.social and @alexhanna.bsky.social โ€˜s new book!

23.05.2025 20:03 โ€” ๐Ÿ‘ 33    ๐Ÿ” 5    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Research on #OpenScience in the Netherlands: Call for proposals

https://www.openscience.nl/en/calls/research-on-open-science

23.05.2025 20:18 โ€” ๐Ÿ‘ 1    ๐Ÿ” 13    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Genoten bij S10 in Roosje vanavond

23.05.2025 21:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Original post on idf.social

Post-quantum crypto finds its use in the real world!

An informative article on the recent RHEL 10 release: https://www.redhat.com/en/blog/post-quantum-cryptography-red-hat-enterprise-linux-10

#Radboud University has a great track record in the research that led to this step, eg the recent PhD [โ€ฆ]

23.05.2025 06:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Save the Web?
Save the date!

June 6, 2025, 10am CEST

22.05.2025 22:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@arjenpdevries is following 20 prominent accounts