Philippe Noรซl's Avatar

Philippe Noรซl

@philippemnoel.bsky.social

CEO @paradedb.bsky.social โ€ข H'20 โ€ข ๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡จ๐Ÿ‡ฆ https://philippemnoel.posthaven.com

2,329 Followers  |  31 Following  |  200 Posts  |  Joined: 31.10.2024
Posts Following

Posts by Philippe Noรซl (@philippemnoel.bsky.social)

Preview
GitHub - paradedb/paradedb: Simple, Elastic-quality search for Postgres Simple, Elastic-quality search for Postgres. Contribute to paradedb/paradedb development by creating an account on GitHub.

4/4. I've long said that search engines and columnar databases have a lot in common... We'll likely write more about this in the future, but in the meantime, I recommend checking out the blog.

And of course don't hesitate to star the project: github.com/paradedb/par...

03.03.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

3/4. To accomplish this, we use two core technologies:

- Columnar storage, for fast, cache-friendly lookups
- Block WAND, which enables early pruning when doing BM25 scoring

Neither are native to Postgres, but ParadeDB supports them in our BM25 index thanks to Tantivy.

03.03.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

2/4. Top K queries are the core workload of any search engine. Whether you use Google Search as a consumer or a complex B2B search engine in a SaaS product, the core goal of search is "find me the K most relevant results." That's Top K optimization.

03.03.2026 21:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

1/4. The simplest questions are often the hardest to answer. In this technical blog, our CTO dives into how we optimized Top K queries in ParadeDB.

I *highly* recommend reading. But if you need some more convincing, here's why this matters ๐Ÿงต

03.03.2026 21:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

ParadeDB is at 99 contributors. Who will be the 100th? ๐Ÿ‘€

02.03.2026 22:10 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - getomnico/omni: Workplace AI Assistant and Search Platform Workplace AI Assistant and Search Platform. Contribute to getomnico/omni development by creating an account on GitHub.

Exciting project built with ParadeDB on the front page of HN today! github.com/getomnico/omni

02.03.2026 16:49 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

We now support Nix/NixOS thanks to the incredible work of Luc Perkins!

26.02.2026 17:45 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It is now possible to reference the ParadeDB Skill directly via npx:

24.02.2026 22:39 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I've moved to San Francisco! Every time I visit SF, I'm struck by how excited and full of ideas people here always are. The optimism is contagious. It was time to join in on the fun!

If you're around the Bay Area and want to chat databases, hit me up. โœŒ๏ธ

17.02.2026 18:20 โ€” ๐Ÿ‘ 8    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Claude Code is now an expert at ParadeDB code. This is the next step in our current push to integrate with the ecosystem.

Most of our largest customers already use this skill to accelerate their development with ParadeDB. Give it a try, feedback welcome.

10.02.2026 16:34 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

ParadeDB is hiring someone to help build integrations. ORMs, RAG frameworks, PaaS, etc. Remote within US/Canada timezones, preference for PST. You'll work directly with me.

It's the perfect time to join, just ahead of the 1.0 later this year.

08.02.2026 14:57 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The movement of rewriting old Python tools in Rust is phenomenal for development velocity. pre-commit --> prek, and so many others

07.02.2026 16:21 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

If you use LangChain / etc. RAG frameworks today, I'd love to ask you a few questions! DMs open

03.02.2026 23:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I am blown away at how much work a small, tight-knit and competent team can accomplish nowadays. The world has truly shifted from "time" to "taste" as the limited currency when striving to do great work

23.01.2026 22:53 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

High-quality search is more than keyword matching. Personalization is what takes search from good to great.

We're investing a lot in building the "unified retrieval stack" for Postgres this year. Expect lots of announcements.

For now, here's how to build personalization today.

22.01.2026 17:57 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Looking to hire/contract someone who is experienced making Kubernetes operator for a ~few weeks project. DMs open

13.01.2026 23:11 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - paradedb/paradedb: The transactional Elasticsearch alternative built on Postgres The transactional Elasticsearch alternative built on Postgres - paradedb/paradedb

6/6. Highly encourage reading this post; it's a particularly interesting one.

And as always, don't hesitate to give us a โญ github.com/paradedb/par.... Back to work!

10.12.2025 19:46 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

5/6. Result --> 15X faster aggregates in Postgres, all native. This blog post goes over the bucketing use case, but another one will follow, showcasing how regular COUNT(*) queries get accelerated the same way. Pretty powerful for:

- Search
- Live dashboards
- RAG
- and more!

10.12.2025 19:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

4/6. To enable our facets to be executed in a single index pass against our columnar index, we leveraged two features of Postgres:

- Planner hooks (to intercept the aggregate before Postgres gets it)

- Custom scans (to re-route it to our columnar index/Tantivy)

10.12.2025 19:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

3/6. BUT! That's one of the main features of a search engine. "Give me some results, tell me how many results there are, and categorize them." So we had to do something about it.

To solve this, we've leveraged ParadeDB's columnar index. Faceted search is essentially a type of aggregate, after all.

10.12.2025 19:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

2/6. Some time ago, there was a project from Cybertec about building better faceting in Postgres: github.com/cybertec-pos...

Needless to say, the interest is there. If you've ever run a COUNT query in Postgres, you know it does not scale well *at all*.

10.12.2025 19:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

1/6. Faceted search (read: aggregates) in Postgres, now at the speed of columnar.

We just released a blog post detailing how we built fast aggregates (COUNT, bucketing, etc.) inside Postgres. This has been one of our most requested features, and we're excited to take you through the journey. ๐Ÿงต

10.12.2025 19:46 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

5/5. Together, these changes have made using ParadeDB (we think) significantly easier. You can check it out at docs.paradedb.com. We'd love to hear your feedback on the new API.

And of course, don't hesitate to give us a star โญ

04.12.2025 23:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

4/5. We came up with two ideas:

- Tokenizers as Postgres types. This makes tokenizers a first-class construct in Postgres.

- Custom operators. Inspired by pgvector, we changed a few of our functions to be custom Postgres operators.

04.12.2025 23:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

3/5. As our adoption grew, we noticed two things:

- Defining tokenizers (a critical component of full-text search) was confusing

- ORM integrations were difficult

Developer experience is critical to any database, and so we had to fix this.

04.12.2025 23:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

2/5. Our initial API was JSON-based. Inside Postgres, that felt... odd. But there was a reason.

Elastic is JSON-based, and so are Lucene and Tantivy (our underlying search engine). We focused on exposing powerful search features, but not so much on their ergonomics.

04.12.2025 23:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

1/5. We just released V2 of the ParadeDB API. Elastic-quality search and aggregates, SQL-native, and ORM-friendly. Here's how we got there:

04.12.2025 23:02 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
The Transactional Elasticsearch Alternative - ParadeDB An open source, ACID-compliant alternative to Elasticsearch

7/7. We love feedback. Give it a try at docs.paradedb.com and let us know how it goes.

We've got a lot of good stuff coming (did somebody say faster JOINs?) and are excited to build upon these features to deliver a truly SQL-native Elasticsearch alternative.

26.11.2025 19:06 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

6/7. Using the proper tokenizers is critical to all complex search workloads and we've seen a large improvement in developer velocity with this new SQL interface.

It's also laying the ground work for us to integrate with ORMs, which is perhaps our second most requested feature.

26.11.2025 19:06 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

5/7. Last but not least, we've revamped our SQL interface. We've introduced a few new SQL operators to simplify queries, and have made tokenizers a first-party construct in SQL. Stemmers are tokenizers are now defined at the index creation time and can be customized as needed.

26.11.2025 19:06 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0