Real-time contextual data is no longer optional — it’s foundational.
LLMs demand live, relevant signals.
We’re not building for dashboards anymore. We’re building for agents.
From batch to streaming.
From static to dynamic.
From reports to APIs.
#LLM #AI #DataOps
05.08.2025 18:53 — 👍 2 🔁 1 💬 0 📌 0
The 230th edition of Data Engineering Weekly is out!! Read more
www.dataengineeringw...
28.07.2025 04:06 — 👍 4 🔁 0 💬 0 📌 0
Catch up on all the latest happenings in the data industry this week's DEW edition
www.dataengineeringw...
21.07.2025 02:33 — 👍 1 🔁 0 💬 0 📌 0
The upstream pipeline design and data modeling techniques, leveraging a vector store like S3 Vectors, and the Lakehouse architecture will drive the next-gen evolution of data engineering.
16.07.2025 16:51 — 👍 1 🔁 0 💬 0 📌 0
I truly wish to switch to Claude Code, but the reliability sucks. In a second attempt, I got the error. Dropping it for now.
API Error (529 {"type":"error","error":{"type":"overloaded_error","message":"Overloaded"}}) · Retrying in 1 seconds… (attempt 1/10
@AnthropicAI
16.07.2025 13:44 — 👍 0 🔁 0 💬 0 📌 0
I keep meaning to read all the Atlassian Work Life blog posts, but ironically, they’re stuck in my backlog—and every time I open Jira, my work-life balance dies a little more. 🤷♂️
07.06.2025 11:04 — 👍 0 🔁 0 💬 0 📌 0
What do you think of DuckLake?
02.06.2025 22:01 — 👍 5 🔁 0 💬 0 📌 0
The 222nd edition of Data Engineering Weekly is out!!
www.dataengineeringweekly.com/p/data-engin...
02.06.2025 03:56 — 👍 6 🔁 0 💬 0 📌 0
🚨 Beware of quick fixes!
“Applying a half-baked theory is like using duct tape on a parachute—technically a fix, until gravity files a formal complaint.”
Scaling flawed ideas often scales their consequences too. Always write down your assumptions before shipping.
29.05.2025 22:44 — 👍 0 🔁 0 💬 0 📌 0
The 221st edition of Data Engineering Weekly is out with a fresh set of articles.
www.dataengineeringw...
26.05.2025 01:01 — 👍 1 🔁 1 💬 0 📌 0
Systems like Gluten/ Comet replaces executors with the native implementation, but keep the planner, cluster manager and task scheduler as it is of Spark.
21.05.2025 22:28 — 👍 3 🔁 0 💬 0 📌 0
The 219th edition of Data Engineering Weekly is out. Find the latest trends in AI & data by clicking here 👇🏼
www.dataengineeringw...
04.05.2025 22:42 — 👍 2 🔁 0 💬 0 📌 0
The 218th edition of Data Engineering Weekly is out, featuring the latest AI & Data stories.
www.dataengineeringw...
29.04.2025 00:52 — 👍 1 🔁 0 💬 0 📌 0
Chaos testing is a feature.
25.04.2025 21:34 — 👍 2 🔁 0 💬 0 📌 0
This expert panel could have been replaced with ChatGPT, Claude & Gemini. You’re welcome.
23.04.2025 13:33 — 👍 0 🔁 0 💬 0 📌 0
KIP-1150 in Apache Kafka is a big deal (Diskless Topics)
TL;DR KIP-1150 introduces Diskless Kafka topics that write directly to S3 instead of replicating between brokers. It literally reduces costs by 97% (from $1.8M to $20K annually for a 1GiB/s cluster) a...
“KIP-1150 introduces Diskless Kafka topics that write directly to S3 instead of replicating between brokers.”
“Even using the expensive S3 Express (which a week ago lowered its prices by more than 50%) still saves 73% compared to traditional Apache Kafka.”
/ht @ananthdurai.bsky.social
21.04.2025 03:14 — 👍 18 🔁 9 💬 0 📌 0
The 217th edition of Data Engineering Weekly is out. Read more here
www.dataengineeringw...
21.04.2025 02:27 — 👍 4 🔁 0 💬 0 📌 0
The problem statement is real, and I'm curious to know what Bauplan's secret sauce is to solve this problem.
16.04.2025 21:16 — 👍 1 🔁 0 💬 0 📌 0
This looks like an exciting proposal
cwiki.apache.org/con...
16.04.2025 21:15 — 👍 0 🔁 0 💬 0 📌 0
The 216th edition is fresh and out!! Read more here
www.dataengineeringw...
14.04.2025 00:48 — 👍 2 🔁 0 💬 0 📌 0
Read more about composable data architecture in my recent blog.
www.dataengineeringw...
11.04.2025 20:44 — 👍 2 🔁 0 💬 1 📌 0
The 215th edition of Data Engineering Weekly is out!!
www.dataengineeringw...
06.04.2025 23:13 — 👍 1 🔁 0 💬 0 📌 0
On April 1st, Data Engineering Weekly officially acquired datashitposting.com, pinoner in data shit posting. As a bonus of the acquisition, please enjoy this shit post.
02.04.2025 02:29 — 👍 4 🔁 0 💬 0 📌 0
The 214th edition of Data Engineering Weekly is out!!
www.dataengineeringw...
30.03.2025 22:55 — 👍 7 🔁 0 💬 0 📌 0
I don’t know about others, but manufacturing and operating code are two parts of the software supply chain. We accelerated the manufacturing process, yielding mixed-quality code impacting operational efficiency.
27.03.2025 10:20 — 👍 1 🔁 0 💬 0 📌 0
The 213th edition of Data Engineering Weekly is out!!. Read more
www.dataengineeringw...
24.03.2025 02:42 — 👍 5 🔁 0 💬 0 📌 0
From WSDL to Claude MCP, creating structured, documented interfaces for remote services has been a decades-long problem. Both try to answer the question: "What can this service do?" "How do I talk to it?" and "What should I expect in return?"
19.03.2025 21:32 — 👍 0 🔁 0 💬 0 📌 0
bolt.new
Prompt, run, edit & deploy web apps
I tried to build a mobile app in react native using repl.it and bold.new; Both failed to build the project with react native dependency. It seems no level of intelligence can solve the dependency hell.
17.03.2025 20:55 — 👍 2 🔁 0 💬 0 📌 0
Applied economist working in international development policy
🇹🇴🇸🇧🇻🇺🇫🇯🇦🇺🇲🇲 (etc),
Mostly memes.
Sometimes post about #publicpolicy, research, data science, #PFM and #Rstats
Economist | Independent Consultant
All views my own
👨💼fine time tracking software at nokotime.com
📸 I killed Flash (with script.aculo.us)
🔭 #astrophotography - https://lightfrom.space
🕹️ #retrocomputing
🐈 #cats
Ally. He/him.
Creator, Founder and CEO of @tigerbeetle.com — the financial transactions database designed for mission critical safety and performance.
Fast, Fresh, Actionable Insights at Scale! From the creators of
Apache Pinot. We're growing - Join the movement!
Software Development, Consulting, Staff Augmentation, & Training | #Golang #Rust #Docker #Kubernetes #Blockchain #Terraform
Compiler engineer, eschatologist.
Data/tech guy. Also at https://fosstodon.org/@shrik
Principal Architect @posit.co, GP Composed Ventures, Co-founder Voltron Data. Open source: Apache Arrow, pandas, Ibis. "Python for Data Analysis" book
General Partner of Essence VC (www.essence.vc), all things infra software
Runs oss startup podcast and the infra pod
Hacker News articles with a score over 100.
with the help of https://atproto.blue/ & https://atproto.rocks/
(maintained by @xrkia.org)
Free-range computer scientist living in Evanston, Illinois. I wrote some Python books. If you want to talk code, take a CS course https://www.dabeaz.com/courses.html. I'm mainly here for dogs, bikes, trombones, and other random stuff.
A community of folks looking to put AI and ML into Production
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD. https://restate.dev
On Bluesky to find Data Twitter
Gain technology and business knowledge and hone skills with learning resources created and curated by O'Reilly experts
PostgreSQL re-engineered for multi-tenant apps
🇳 http://thenile.dev
🌟https://git.new/nile
📹 http://youtube.com/@niledatabase
💬http://discord.gg/8UuBB84tTy
ParadeDB is a modern Elasticsearch alternative built on Postgres. Built for real-time, update-heavy workloads.
⭐ Star us: http://github.com/paradedb/paradedb