Adrian Brudaru 's Avatar

Adrian Brudaru

@datateam.bsky.social

Data engineer & Cofounder @dlthub. Building out the tooling i wish i had.

2,204 Followers  |  1,300 Following  |  152 Posts  |  Joined: 27.10.2024  |  1.4384

Latest posts by datateam.bsky.social on Bluesky

Weโ€™ll go beyond basic ETL:

- Handling nested & evolving schemas
- Accelerating pipeline creation with LLMs
- Moving from scripts to reliable ingestion workflows
- Validating data and schema changes using the dlt dashboard and dlt MCP

12.02.2026 18:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
From APIs to Warehouses: AI-Assisted Data Ingestion with dlt ยท Luma This hands-on workshop focuses on building reliable data ingestion pipelines to data warehouses (for example, Snowflake) using dlt (data load tool), enhancedโ€ฆ

From APIs to Warehouses ๐Ÿ“ฆ

On Feb 17 (16:30 CET), together with DataTalks.Club, Aashish Nair will walk through building end-to-end ingestion pipelines with dlt, from raw APIs to production-ready warehouse loads.

Register here ๐Ÿ‘‡

12.02.2026 18:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
The Last Mile is Solved by Slop I didn't vibe-build a product. I wrote a messy scaffold that runs a pipeline, grabs the schema, and forces an agent to build a star schema. It works shockingly well.

What if dimensional modeling didnโ€™t mean hours of boilerplate SQL?

We built an AI workflow that turns raw data into semantic models in minutes, powered by 20 questions.

Rethinking data transformation ๐Ÿ‘‡

12.02.2026 16:40 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Whoโ€™s speaking in Berlin ๐Ÿ‘‡

- Francesco Mucio: integrating 20+ APIs
- Bijan Soltani: real-world analytics
- Nemanja Bibic: ingestion for AI memory
- Ken Schrรถder: analyst-friendly ingestion w/ dlt on AWS
- Violetta Mishechkina: AI agents, data quality & whatโ€™s next

See you there ๐Ÿš€

11.02.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
dltHub Community Meetup in Berlin with Cognee, Untitled Data Company, Gemma Analytics & Babbel ยท Luma Join us for the dltHub Community Meetup in Berlin. This evening is for curious minds who want to learn more about what weโ€™re building at dltHub. Weโ€™ll share aโ€ฆ

Berlin, itโ€™s meetup time!

Join us for the dltHub Community Meetup, an evening of real-world demos, lessons learned, and conversations with builders.

๐Ÿ“ Rosebud, Berlin
๐Ÿ“† Feb 17 | 18:00 โ€“ 21:00

Curious about what weโ€™re building at dltHub? Come by ๐Ÿ‘‹

11.02.2026 21:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Production pipelines donโ€™t fail loudly, they drift.

Feb 12 ยท 16:00 CET - Online
Hands-on workshop on operating pipelines in production:
โ€ข schema changes
โ€ข backfills
โ€ข CI/CD
โ€ข long-term reliability

Register โ†’ https://community.dlthub.com/workshop-maintaining-servicing-production-data-pipelines

10.02.2026 17:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ’˜ Data Valentine Challenge started today.

5 days. 5 live data sessions with:ย 
@datarecce.bsky.social, Greybeam, @databasetycoon.bsky.social, @bauplan.bsky.social

Our slot: Wednesday โ†’ Pipelines That Donโ€™t Ghost You

Feb 9โ€“13 | 9am PT | Online

https://reccehq.com/data-valentine-week-challenge/

09.02.2026 22:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Create, debug & maintain dlt pipelines in production - dltHub Workspace Go from writing pipeline code to ingesting data and delivering reports via Notebooks - all in one flow. Discover over 10,100 REST API data sources today.

โ€ข Markets (Kalshi, Polymarket, DEX Screener)
โ€ข AI platforms (fal, Jina AI, Kie AI)
โ€ข Macro data (World Bank, Finnhub, Alpha Vantage, Frankfurter)
โ€ข Entertainment (PokรฉAPI, OpenF1)

Explore the contexts ๐Ÿ‘‡

09.02.2026 10:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Januaryโ€™s Rising Stars in the dlt ecosystem ๐Ÿ‘‡

Builders are vibe coding pipelines around real-time markets, AI dev platforms, macro data, and more.

Whatโ€™s trending right now:

09.02.2026 10:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
3.7x Faster Pipelines: Benchmarking Arrow & ADBC vs. SQLAlchemy for EL Moved 5M rows from DuckDB to MySQL 3.7x faster, reducing time from 344s to 92s by switching from SQLAlchemyโ€™s row-by-row path to Arrow + ADBCโ€™s columnar pipeline.

Arrow + ADBC + dlt just broke the EL speed limit.

5M rows DuckDBโ†’MySQL:
SQLAlchemy 344s
Arrow + ADBC 92s (3.7ร— faster)

One line of code. Columnar end-to-end.

Benchmarks:

03.02.2026 18:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
The Builder: Outliving the Modern Data Stack We were told that democratization meant 'safety,' but all we got were expensive cages. The era of the SaaS hostage is ending; the era of the sovereign Builder has begun.

The Modern Data Stackโ„ข was a comfy lie that turned data engineers into passive consumers, now the bill's due, market's schisming into vendor-locked hell vs builder freedom.

Read the Builder's Manifesto:

28.01.2026 15:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ“ Amsterdam
๐Ÿ—“ Jan 29 ยท 3โ€“6 PM (GMT+1)

Agenda highlights:
โ€ข Vision โ€” Matthaus Krzykowski (@matthausk.bsky.social) & Julian Alves | dltHub
โ€ข Demos โ€” Vincent D. Warmerdam (@koaning.bsky.social) | Marimo, Mehdi Ouazza (@mehdio.com) | MotherDuck
โ€ข First impressions โ€” Thomas in't Veld | Tasman Analytics

24.01.2026 13:08 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Together with @motherduck.com, @duckdb.org, and marimo, weโ€™re bringing together a toolkit built for full-stack data developers:
๐Ÿ”น ingest with dlt
๐Ÿ”น query fast
๐Ÿ”น serve instantly

Built for builders, not enterprise overhead.

24.01.2026 13:08 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
dltHub โค๏ธ Marimo โค๏ธ MotherDuck ยท Luma dltHub and Marimo and MotherDuck are having a child. What are looking at? dltHub provides the ELT, runtime, and execution layer, turning production dataโ€ฆ

Want to influence the tools you use every day?

Weโ€™re hosting a Builderโ€™s Data Stack meetup focused on developer flow, fast iteration, and shaping the roadmap with the community.

Pull up a chair:

24.01.2026 13:08 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
The Plutonium Protocol: Engineering Safety for the LLM Intern Era The โ€œdata is oilโ€ era is over. With LLMs, data is plutonium: powerful, toxic. Shift left and secure the reactor with 5 quality pillars.

An AI agent ignored a code freeze, wiped a prod DB, then hallucinated data to cover it up.

Data quality in the LLM era isnโ€™t optional, itโ€™s a safety problem.

We call it data as plutonium - powerful and dangerous without containment.

21.01.2026 18:55 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐ŸŽค Call for Speakers
Using dlt in your projects? Weโ€™re opening the mic to the community for short 10โ€“15 min talks sharing:
๐Ÿ› ๏ธ real use cases
๐Ÿ“š lessons learned

If this sounds like you, reach out via the event page.

Letโ€™s learn from each other in Paris!

15.01.2026 15:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
dlt Paris Community Meetup #2 with dltHub & Polycea ยท Luma Join us for an evening of community and conversation! Co-hosted by dltHub and Polycea, this meetup brings people together for short talks and networking withโ€ฆ

๐Ÿ‡ซ๐Ÿ‡ท Paris data folks ๐Ÿ’›

Weโ€™re hosting a dlt Community Meetup in Paris on Feb 4th (6โ€“9 PM) together with Polycea.

A community meetup focused on practical takeaways, shared learnings, and conversations with people using dlt hands-on.

Join us here:

15.01.2026 15:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Data quality is the vegetables of data engineering: everyone agrees it's important, but nobody wants to implement it.

To increase your ๐šŸฬถ๐šŽฬถ๐šฬถ๐šŽฬถ๐šฬถ๐šŠฬถ๐š‹ฬถ๐š•ฬถ๐šŽฬถ ฬถ๐š’ฬถ๐š—ฬถ๐šฬถ๐šŠฬถ๐š”ฬถ๐šŽฬถ test coverage, check out these 11 delicious recipes.

https://dlthub.com/blog/practical-data-quality-recipes-with-dlt

11.01.2026 17:07 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This gives you a self-healing system that keeps your semantic layer in sync as your data changes.

Huge thanks to Julien Hurault and Hussain Sultan for their contributions.

07.01.2026 21:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The blueprint:
1๏ธโƒฃ ingest with dlt to capture metadata
2๏ธโƒฃ use LLMs to infer semantic relationships
3๏ธโƒฃ auto-generate the BI layer

We went from a raw Sakila DB to a governed semantic layer automatically. Moving beyond manual mappings to automate the context itself.

07.01.2026 21:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Autofiling the Boring Semantic Layer: From Sakila to Chat-BI with dltHub Build one semantic model and reuse it across APIs, chatbots, and apps. Let LLMs handle the tedious mapping so you can ship data products that quietly just work.

Semantic layers are the most important "boring" part of data.ย 

Building them manually is a bottleneck for Chat-BI. dlt is changing the game by "autofilling" the metadata gap, turning months of modeling into minutes of automation.ย 

07.01.2026 21:31 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Most data quality failures happen because checks come too late.

dlt + dltHub treat quality as a lifecycle: in-flight checks, safe staging, and production monitoring.

Catch issues earlier, fix less and trust your data more.

docs: https://dlthub.com/docs/general-usage/data-quality-lifecycle

04.01.2026 19:28 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Dlt+ClickHouse+Rill: Multi-Cloud Cost Analytics, Cloud-Ready FinOps Made Easy: A Starter Repo to Oversee Cloud Costs from Different Hyperscalers.

@ssp.shย drops another great deep-dive, a declarative data stack dlt + @clickhouse.comย + @rilldata.comย that simplifies tracking cloud spend across multiple platforms.

A complete cloud-native FinOps setup in minutes.

๐Ÿ”— Explore the full walkthrough

05.12.2025 19:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Snowflake for Startups Pitch Night ยท Luma Join Snowflake for Startups and Proving Ground for an evening of innovation and networking at the SVAI Hub. This pitch night is a chance to see some of the Bayโ€ฆ

We're thrilled to take the stage at the Snowflake for Startups Pitch Night!

Join us at the Silicon Valley AI Hub to see how dlt's code & LLM-first infra-native data ingestion library is the fastest way to get compliant data into Snowflake.

05.12.2025 10:29 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Convergence: The Anti-Entropy Engine Most LLM runs donโ€™t fail. They converge fast, and the secret isnโ€™t smarter models but better scaffolds that guide the work instead of forcing it.

AI workflows break on LLM updates?

Our Anti-Entropy pattern fixes this with declarative scaffolds, error loops, and dashboards for antifragile convergence.

Saves time, acts like an invisible senior engineer.

Ditch crutches, skip the โ€˜Deer in Headlightsโ€™ panic!

28.11.2025 19:29 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Each integration came together quickly: Stripe with just a token, AWS CUR from S3, and GCP billing straight from BigQuery.

This powerful tool stack combined with dlt's connectors, @duckdb.orgย and @rilldata.com's interactive dashboards, deliver a real-time, consolidated analytics view.

25.11.2025 16:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

At the core is a single incremental pipeline powered by dlt, loading everything into Parquet & DuckDB for fast analysis.

Handling auth, pagination, and schema changes, the pipeline remains simple end to end, and because itโ€™s fully pluggable, adding Azure or Cloudflare is easy.

25.11.2025 16:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Multi-Cloud Cost Analytics: From Cost-Export to Parquet to Rill Learn how to unify AWS and GCP costs with revenue data in a single dashboard. Step-by-step guide using dlt, Parquet, and Rill. Clone and run immediately.

To overcome complex cloud cost analysis, @ssp.shย showed how dlt can ingest and normalize AWS, GCP, and Stripe data into a unified cost dashboard.

The result is a single view for ROI analysis powered by a simple ELT.

Check out the full solution! ๐Ÿ‘‡

25.11.2025 16:00 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Motherduck Europe & dlt DuckLake support MotherDuck lands in Europe with serverless DuckDB warehousing. dlt adds DuckLake support, giving EU teams a fast, modern data stack.

European data teams can enjoy lightning-fast analytics & production-ready pipelines with @motherduck EU region fully available.

Choose loading via MotherDuck or dlt's native DuckLake destination โ†’ support for Postgres, DuckDB, SQLite, MySQL.

13.11.2025 15:23 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Ever launched a data pipeline & wondered whatโ€™s happening under the hood? The dlt Workspace Dashboard gives real-time visibility into pipeline state, schemas, live dataset queries, run traces โ†’ all in one web app. Built withย 
@marimo.io.ย 
Try it now: ๐Ÿ‘‰ https://dlthub.com/docs/general-usage/dashboard

12.11.2025 15:39 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@datateam is following 20 prominent accounts