Thatβs a different concept though, right? As you said, here you have a proxy and you pick a different query engine over the same storage. I think using the term federation in this case will confuse people. I can see how this pattern can work.
18.10.2025 18:46 β π 2 π 0 π¬ 0 π 0
Full scans on different data sources that then need to be joined and a much closer to ETL workload. This will kill every federated query engine.
Plus what do you do when you have different semantic between different query engines? Letβs say how you handle decimal overflows.
18.10.2025 00:34 β π 3 π 0 π¬ 0 π 0
Oh no. Trino tried tried to do that. You really canβt do it. The problem with federated queries is that they work well when you can push computation down to the query engine you federate at and get out a highly reduced dataset. Thatβs not the case with ETL though.
18.10.2025 00:34 β π 3 π 0 π¬ 2 π 0
fenic 0.4.0 brings fenic and its expressive API for working with data, to agents.
With tooling becoming a catalog artifact, MCP servers and toolsets being available with just a cli command you can turn any data set you have into well curated context for your agents.
check it out!
09.09.2025 21:37 β π 0 π 0 π¬ 0 π 0
New episode: chatting with bauplan founders Jacopo Tagliabue and Ciro Greco on shipping AI with real-world data constraints.
Why listen
1. Data pipelines determine model effectiveness, far more than most teams admit.
08.09.2025 14:39 β π 4 π 3 π¬ 1 π 0
20.08.2025 06:02 β π 2 π 0 π¬ 0 π 0
https://github.com/typedef-ai/fenic
7/7
Give it a try, β the repo, open issues and join the community!
π t.co/zDj8rBO5Ce
07.08.2025 18:10 β π 1 π 0 π¬ 0 π 0
6/7
Performance & DX
Rust optimizations plus leaner default configs deliver performance gains and a frictionless setup experience.
so you spend less time tuning and more time building.
07.08.2025 18:10 β π 0 π 0 π¬ 1 π 0
5/7
New Functions & Models
Access built-in summarization, new semantic APIs, and multiple embedding providers (e.g. Cohere, Google Gemini) out of the box.
This broadens your toolkit, so you can prototype and productionize a wider range of AI workflows quickly.
07.08.2025 18:10 β π 0 π 0 π¬ 1 π 0
4/7
Composable Pipelines
Save intermediate DataFrames as persistent views in the fenic catalog.
Reuse and chain complex transformations across jobs without rewriting or rerunning upstream logic, accelerating iteration and collaboration.
07.08.2025 18:10 β π 0 π 0 π¬ 1 π 0
3/7
Typed Semantics
Define your output schema once with Pydantic and get back validated, strongly typed results.
This enforces consistency, surfaces errors early, and eliminates manual parsing of LLM responses.
07.08.2025 18:10 β π 1 π 0 π¬ 1 π 0
2/7
Robust Fuzzy Text Matching
Ground LLM outputs against your existing data: record linkage, deduplication, and typo-tolerant joins become first-class operations.
This improves precision in extraction pipelines and slashes downstream error rates.
07.08.2025 18:10 β π 0 π 0 π¬ 1 π 0
Here's a bit more information on each of the new π¦ fenic π¦ features.
1/7 π§΅
Dynamic Templating
Turn any column struct or array into a live prompt fragment. No more string concatenation hacks. You get per row, data driven prompts with minimal code, boosting relevance and reducing boilerplate.
07.08.2025 18:10 β π 0 π 0 π¬ 1 π 0
Using Jinja templates to dynamically create prompts for semantic filtering in fenic.
fenic v0.3.0 is out and it's a release I'm really excited about!
Here are a few of the things that this release is introducing.
Jinja as a column function
Robust Fuzzy Text Matching
Full Pydantic support in all semantic operators
Persistent views
More Functions & Models
Perf & DX improvements
06.08.2025 21:09 β π 4 π 1 π¬ 1 π 0
@steveklabnik.com Joined us on an episode where we discussed about
Why:
β’ Cargo & friendly errors > benchmarks
β’ 6-week releases > years-long committees
β’ How Rust united Ruby, FP & C++ devs
β’ Next-gen picks
and many more!
Check the episode on your favorite platform!
28.07.2025 14:36 β π 4 π 1 π¬ 0 π 0
Everyoneβs heads down on AI these days, but please take a break and soak in some deep systems wisdom from Josh Howards.
Heβs one of the folks behind R2 at Cloudflare.
After all, whatever you build in AI will sit on top of these foundations.
check @totrrocks.bsky.social for the episode link.
06.06.2025 23:30 β π 2 π 0 π¬ 0 π 0
Startups and new products increasingly prioritize serverless models to reduce user friction and accelerate adoption.
@philippemnoel.bsky.social from ep.12
15.05.2025 23:55 β π 2 π 1 π¬ 0 π 0
The value proposition of formal methods becomes clear when dealing with complex distributed transactions involving multiple independent services.
Jayaprabhakar(JP) Kadarkarai from ep.5
14.05.2025 14:51 β π 1 π 1 π¬ 0 π 0
User experience and developer interaction with complex data abstractions remain a significant challenge beyond the technical integration.
Nikhil Simha & Varant Zanoyan from ep.2
13.05.2025 23:49 β π 2 π 1 π¬ 0 π 0
Successful AI developer tools must balance synchronous co-pilot style assistance with asynchronous autonomous agent workflows.
@ivanburazin.bsky.social from ep.9
12.05.2025 23:39 β π 2 π 1 π¬ 0 π 0
Managing AI access and permissions requires careful role-based controls to prevent over-privileged AI actions in enterprise environments.
Well said, even before hashtag#MCP was as popular as today.
@ivanburazin.bsky.social from ep.9
12.05.2025 14:37 β π 1 π 1 π¬ 0 π 0
I had the rare opportunity to sit down and chat with someone who helped shape that story of Splunk, co-founder Erik Swan.
There's a lot to learn from him but what inspired me the most is his energy. Even after a success like Splunk, still learning and building
listen here @totrrocks.bsky.social
09.05.2025 17:13 β π 1 π 0 π¬ 0 π 0
Incremental materialization has stumped the industry for decades.
Epsio led by Gilad , is changing that: product-first, real-world incremental views.
If real-time data infra matters to you, check out my chat with Gilad on @totrrocks.bsky.social
25.04.2025 17:13 β π 5 π 0 π¬ 0 π 0
You should definitely check the project!
24.03.2025 16:58 β π 0 π 0 π¬ 0 π 0
Lakekeeper is an open source data catalog built on the Apache Iceberg REST catalog API.
If data infrastructure drives you, check out the project and catch Viktor Kessler's insights on the latest @TotrRocks episode!
24.03.2025 16:45 β π 1 π 0 π¬ 1 π 0
Peninsula Data Happy Hour Β· Luma
π₯ An Unmissable Evening of Data & Magic! π₯
π Back by popular demand, it's time for the March edition of our Peninsula Data Happy Hour! This time we've gotβ¦
We'll be hosting another event at our offices in San Mateo. We want to bring together people who are interested in data and infra, from systems engineers who build data platforms, AI engineers, VCs and everything in between.
Connect and have fun while we learn from each other.
lu.ma/2hc1qm1v
14.03.2025 16:36 β π 1 π 0 π¬ 0 π 0
Software engineer. Building things for lego.com. Distributed systems, serverless, event saucing.
Wrote Serverless Development on AWS (OβReilly)
https://lukehedger.dev/
π London
Noncompetitive Introverted Human Artist
don't expect much from me
β² Lowpoly β Pixel art
Founder CTO of https://actioniq.com/
Partner at https://verissimo.vc/
Host of https://techontherocks.show/
ScyllaDB | P99 CONF | "Writing for Developers: Blogs That Get Read" book (https://github.com/scynthiadunlop/WritingForDevelopersBook)
Breaking distributed systems, one fault at a time.
[bridged from https://mastodon.jepsen.io/@jepsen on the fediverse by https://fed.brid.gy/ ]
Conversations with amazingly smart people who are building the next generation of technology, from hardware to cloud. Hosted by
@cpard.bsky.social @nitayj.bsky.social
Creator of https://0x.tools, also a long-time computer performance geek. Perf & troubleshooting blog: tanelpoder.com. All onions are mine.
Mostly posts about PostgreSQL, Snowflake Postgres, and PostgreSQL extensions.
Formerly Crunchy Data, Microsoft, Citus Data, AWS, TCD, VU
Local First, TanStack DB, @pglite.dev and Sync Engines at @electric-sql.com.
More at https://samwillis.uk
Entrepreneur, software engineer, non-conformist.
π₯ Creator of shadowtraffic.io
Building Astral: Ruff, uv, and other high-performance Python tools, written in Rust.
Run linux workloads faster and safer than linux using the https://nanos.org unikernel.
#rustlang, #jj-vcs, atproto, shitposts, urbanism. I contain multitudes.
Working on #ruelang but just for fun.
Currently in Austin, TX, but from Pittsburgh. Previously in Bushwick, the Mission, LA.
Founder Resonate HQ | Distributed Async Await | Thinking in Distributed Systems | https://dtornow.substack.com
Distributed Systems & databases person. Works at Microsoft on Orleans & Aspire
Associate Professor at @cst.cam.ac.uk, researching decentralised systems and security protocols. Advisor to the Bluesky team. Wrote βDesigning Data-Intensive Applicationsβ (OβReilly). he/him
Writer @ davidsj.substack.com
Apache Arrow & DataFusion PMC Member. Original creator of Apache DataFusion.
Founder of SaaS Developer Community and Nile Database.
Research and analysis for experienced programmers at theconsensus.dev.
eatonphil.com