Last friday I published my most personal Substack post so far. I tell the tale of my job hopping in Data Engineering early in my career.
Check it out:
thedatasitter.substack.com/p/my-history...
@thedatasitter.com.bsky.social
Taming Data Tantrums. Senior Data Engineer @ Caylent (2024 AWS Partner of the Year) - subscribe to https://thedatasitter.substack.com/welcome ๐ง๐ท - opinions are my own
Last friday I published my most personal Substack post so far. I tell the tale of my job hopping in Data Engineering early in my career.
Check it out:
thedatasitter.substack.com/p/my-history...
Let's talk about Partitioning and Bucketing as a performance strategy for Spark!
open.substack.com/pub/thedatas...
Friday's post will be about the Architect role. Hope you guys like it!
12.03.2025 11:58 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0After some good years of job hopping, today is the first time I have a one-year anniversary at a company. Gotta say, I love my job at Caylent
12.03.2025 11:58 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0How can AI kill software engineering jobs if it can't even kill SQL? lol
11.03.2025 16:45 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0How do you guarantee you won't have any bad rows maculating your source of truth?
Your answer is here:
thedatasitter.substack.com/p/netflixs-w...
Nevertheless, the spirit of the division is the same. You'll have your go-to production data at the end - the gold or analytics layer.
These are supposed to be the most trusted, up-to-date, clean, and good looking tables.
AWS names them โraw, staging, and analytics.โ
Databricks likes the medallion convention of โbronze, silver, and gold.โ
In some scenarios, you would add a layer for events. Some call them โtransient,โ others โlanding.โ
If you hold 10 data engineers in a room and ask them the correct naming convention for the layers of a data lake, odds are you'll receive 11 different answers.
11.03.2025 13:56 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Also, if you missed, this week I wrote about parquet files and vectors. You can take a look at thedatasitter.com
06.03.2025 12:09 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Just finished preparing The Data Sitter's Friday post. I'll discuss memoir books. Hope it doesn't feel off, as it is a tech blog
06.03.2025 12:09 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0โMy kid never got vaccinated and heโs fineโ
The kid:
The most โgoing to gym in a small brazilian townโ pic youโre going to see today:
28.02.2025 17:00 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0preach
28.02.2025 14:16 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0So far, this has been the book of the year for me. I had no idea of how messy is Elonโs management of Twitter/X.
There was a time when he ended contracts with the companies that cleaned the Twitter offices lol
Employees had to bring toilet paper from home or go to bathrooms in nearby coffee shops
The only thing that can pacify the world nowadays
27.02.2025 16:55 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0I tried and failed to study math for ML for 3 years now, and finally now, after treatment started, I can concentrate on these complex topics and actually learn lol
27.02.2025 16:37 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Earlier this year I've been diagnosed with ADHD. I started medication on Monday w/ Concerta and it's been life changing.
27.02.2025 16:37 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0What's the best `hello_world` for Bluesky?
27.02.2025 16:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0