The Missing Layer in Your AI Stack: Context, Not Just State
From SQL to Semantics: The Rise of the Context Graph for AI Agents
As we move from dashboards to autonomous agents, something breaks.
Systems of record capture what happened, not why.
Why data platforms need Truth Registries + Context Graphs for the agentic era ๐
www.dataengineeringw...
#DataEngineering #AgenticAI #Graphs #LLMs
31.01.2026 04:19 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
Data Engineering Weekly's 254th edition is out. Context Graph is the new talk of the town!!
26.01.2026 04:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
The companies that build the most boring data stack often win the market!!!
Prove me wrong.
23.01.2026 15:30 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Data Contracts: A Missed Opportunity
The Conversation We Should Have HadโBefore Thought Leadership Replaced System Design
Data Contract: There was no shortage of activity around the topic. Definitions were proposed and refined. Conceptual boundaries were drawn and redrawn.
I pen down a reflection of the Data Contracts here
www.dataengineeringweekly.com/p/data-contr...
20.01.2026 23:02 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
How to build a scalable shopping agent?
Here's a wild thought:
What ifโand hear me outโwe let humans click that Buy Now button? Just throwing ideas out there.
14.01.2026 01:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Data Engineering Weekly #252
The Weekly Data Engineering Newsletter
This week, it is mostly about Multi-Agent Architecture. Do you think the data infrastructure is ready for a multi-agent architecture? Where is the gap?
12.01.2026 02:40 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
A Critique of Iceberg REST Catalog: A Classic Case of Why Semantic Spec Fails
How a Semantically Correct API Becomes Operationally Unreliable at Scale
Is semantic Spec Good enough to run an enterprise system? I listed challenges to adopting the Iceberg Rest Catalog
09.01.2026 06:16 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
DEW - The Year in Review 2025
From Digital Plumbers to Architects of Intelligence: The 7 Paradigm Shifts That Defined 2025
Continuing our yearly tradition of Year in Review Data Engineering Weekly, we published the 2025 Year in Review. What do you think is the most notable trend of 2025?
23.12.2025 05:04 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
www.dataengineeringw...
16.12.2025 02:24 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
12.12.2025 23:27 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0
Look at the tech stack IBM now controls:
๐ง Compute: Red Hat (Linux/OpenShift)
โ๏ธ IaC: HashiCorp (Terraform)
๐ฐ FinOps: Kubecost
๐ Streaming: Confluent (Kafka)
๐ง Vector/AI: DataStax (Cassandra)
โก Query Engine: Ahana (Presto)
๐ Ingest: StreamSets
08.12.2025 19:06 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0
Data Engineering Weekly #247
The Weekly Data Engineering Newsletter
LinkedIn moves FishDB to Rust, DoorDash builds AI swarms, and Dropbox masters context engineering. ๐คฏ Data Engineering Weekly #247 is packed with system design deep dives from the best engineering teams.
08.12.2025 01:31 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
If the Data Catalog is the answer for AI, the question was wrong.
04.12.2025 19:10 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
The Dark Data Tax: How Hoarding is Poisoning Your AI
Storage is cheap. Attention is finite. Hallucinations are expensive. Itโs time to stop building Data Lakes and start managing Data Metabolism
We stopped asking if data was useful because storage got cheap. Now, "Dark Data" is actively poisoning your AI context windows with hallucination vectors.
Read about the Data Sustainability index
19.11.2025 15:01 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
The open source companies built their success on top of open-source platforms, benefited from community contributions and adoption, but now must abandon open-source principles to survive commercially.
10.11.2025 02:47 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Data Engineering Weekly #244
The Weekly Data Engineering Newsletter
๐ The 244th edition of Data Engineering Weekly dives into:
AI agents as execution engines, LLM inference economics, databases for AI, personalization, and product evidence.
Read more ๐ www.dataengineeringw...
#DataEngineering #AI #LLMs
03.11.2025 09:29 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Cricket has been Indiaโs greatest force in overcoming centuries of colonial suppression. Todayโs Womenโs World Cup win echoes the spirit of 1983 โ a triumph that will inspire generations to come. ๐ฎ๐ณ๐
03.11.2025 00:40 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Thinking Like a Data Engineer
A Journey Beyond Code โ Toward Systems, Curiosity, and Confidence
This is the most personal essay that I have written in Data Engineering Weekly. I shared a few key moments in my life and how fortunate I was to meet mentors along my professional journey, which shaped my career.
23.10.2025 00:25 โ ๐ 9 ๐ 0 ๐ฌ 0 ๐ 1
Revisiting Medallion Architecture: Data Vault in Silver, Dimensional Modeling in Gold
How to Balance Flexibility and Performance in a Modern Data Platform
๐ Data Vault vs. Dimensional Modeling vs. Medallion Architecture โ When viewed through a modern enterprise data lens, these techniques interlock.
I break down how in Part 2 of my โRevisiting the Medallion Architectureโ series.
17.10.2025 14:54 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Fivetran and dbt form a strong foundation for modern data infrastructure, known for bringing simplicity to complex engineering workflows. That said, calling it โopenโ data infrastructure feels like a stretch.
17.10.2025 12:02 โ ๐ 5 ๐ 0 ๐ฌ 3 ๐ 0
Should we update the definition of an "Analytical Engineer"?
13.10.2025 17:53 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Engineering Growth: The Data Layers Powering Modern GTM
Building privacy-preserving pipelines that unify zero-, first-, second-, third-, and fourth-party data into a coherent GTM ecosystem.
As a data engineer, you can't treat zero-party (consent) and third-party (inferred) data the same way. This distinction is critical for building systems that are scalable, private, and trustworthy.
Hereโs my guide:
09.10.2025 00:35 โ ๐ 5 ๐ 0 ๐ฌ 0 ๐ 0
Could be. Composable CDP has not gained significant market share, as identity resolution is a key component that is often proprietary.
04.10.2025 16:34 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
With Census already in with Fiveatran and with dbt, it is most likely to evolve as a composable CDP.
04.10.2025 02:11 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Airbnb: Real-Time Key-Value Store
Airbnbโs next-gen key-value store supports real-time ingestion and bulk uploads with sub-second latency, powering feature stores and fraud detection.
Read the full story here: www.dataengineeringw...
02.10.2025 13:00 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Data Engineering Weekly #239
The Weekly Data Engineering Newsletter
Read the full story here:
01.10.2025 13:00 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Grab: Partner Gateway Metrics at Sub-Second Speed
Real-time partner analytics at scale is tough. Grab uses Apache Pinot, KafkaโFlink ingestion, partitioning, and Star-tree indexing to cut query latency to <300 ms, enabling efficient API monitoring and fast issue resolution.
01.10.2025 13:00 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Data Engineering Weekly #239
The Weekly Data Engineering Newsletter
๐ก Read the full story โ
30.09.2025 12:33 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Netflix Muse: Scaling Analytics at Trillion-Row Scale
Netflix evolved its Muse architecture to handle huge datasets efficiently: HyperLogLog sketches, Hollow in-memory feeds, and Druid optimizations cut query latency by ~50% and reduced concurrency load.
30.09.2025 12:33 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Data Engineering Weekly #239
The Weekly Data Engineering Newsletter
๐ Link in bio:
29.09.2025 12:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Collaborative software at Common Curriculum / CMU
I write about CRDTs etc: https://mattweidner.com/
http://github.com/frankmcsherry/blog
The unoffical Apache Kafka Streams account. Long live the otter.
CEO rocicorp.dev. Building replicache.dev and zerosync.dev, raising two great kids, trying to be a better person. Also found at http://aaronboodman.com.
https://southbaysystems.xyz
https://youtube.com/@southbaysystems
(Unofficial) Hacker News Bot with top stories updates.
Jobs from YC startups @whois-hiring.bsky.social
Creator @ykravchuk.bsky.social
For more AI&Tech content, check here www.luok.ai
๐Apple Die Hard Fan๏ฝ ่นๆ้ชจ็ฐ็ฒ
๐คGenAI Observer ๏ฝ GenAI่งๅฏ่
๐จ๐ปโ๐คCutting Edge Tech Enthusiast ๏ฝ ็งๆ็ฑๅฅฝ่
Applied economist working in international development policy
๐น๐ด๐ธ๐ง๐ป๐บ๐ซ๐ฏ๐ฆ๐บ๐ฒ๐ฒ (etc),
Mostly memes.
Sometimes post about #publicpolicy, research, data science, #PFM and #Rstats
Economist | Independent Consultant
All views my own
Fine rants since 2007.
He/him.
Creator, Founder and CEO of @tigerbeetle.com โ the financial transactions database designed for mission critical safety and performance.
Fast, Fresh, Actionable Insights at Scale! From the creators of
Apache Pinot. We're growing - Join the movement!
Software Development, Consulting, Staff Augmentation, & Training | #Golang #Rust #Docker #Kubernetes #Blockchain #Terraform
interests: compilers, chemistry, logic, nanofabrication, pharmacology โง transhumanist โง essays, fiction: borretti.me โง code: github.com/eudoxia0 โง ๐ณ๏ธโ๐
Data/tech guy. Also at https://fosstodon.org/@shrik
Principal Architect @posit.co, GP Composed Ventures, Co-founder Voltron Data. Open source: Apache Arrow, pandas, Ibis. "Python for Data Analysis" book
General Partner of Essence VC (www.essence.vc), all things infra software
Runs oss startup podcast and the infra pod
Hacker News articles with a score over 100.
with the help of https://atproto.blue/ & https://atproto.rocks/
(maintained by @xrkia.org)
Free-range computer scientist living in Evanston, Illinois. I wrote some Python books. If you want to talk code, take a CS course https://www.dabeaz.com/courses.html. I'm mainly here for dogs, bikes, trombones, and other random stuff.
A community of folks looking to put AI and ML into Production
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD. https://restate.dev