Juan Ramos's Avatar

Juan Ramos

@jjuanramos.bsky.social

95 Followers  |  43 Following  |  18 Posts  |  Joined: 25.11.2024  |  1.7481

Latest posts by jjuanramos.bsky.social on Bluesky

Hey, thank you so much for your answer - forgot to get back to you.

09.01.2025 16:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Absolutely - Iโ€™m thinking about that, too: how does one make sure that the analytics or data engineer has explored the data of the model they have built?
Asking for descriptive stats seems like a good way to do so, as even in new tables there arenโ€™t that many โ€œdifferent-from-usualโ€ columns

20.12.2024 19:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thanks for the recommendation!! Currently in the process of implementing dbt-checkpoint in our CI for making sure that our style guide is followed through

20.12.2024 15:11 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@nicoritschel.com hey Nico! Iโ€™m finding out about sidequery and it looks so cool.
Is it possible to query s3 parquet files? Thinking it might be a good way to explore data transformations that come out of dbt in a qualitative way

19.12.2024 08:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I can think of adding a PR template that makes the developer say โ€œyeah, I audited the dataโ€, and that might be a first good practice. Itโ€™s just so contingent of the business requirements to make sure that the data can be useful

19.12.2024 08:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

How do folks handle developers validating data quality before doing a PR? Iโ€™m talking about the small things, such as not naming โ€œsomething_idโ€ a string column with unrelated column and such.
Impossible to check programmatically (afaik), but has huge impact on data consumption #databs

19.12.2024 08:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

"I meet lots of idealistic folks who think that all theyโ€™re missing is money, or credentials, or access to the levers of power. More often, what theyโ€™re really missing is friends."

14.12.2024 15:39 โ€” ๐Ÿ‘ 99    ๐Ÿ” 23    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Is this a career? The trouble with โ€œworking in data.โ€

After reading the latest Bennโ€™s piece, open.substack.com/pub/benn/p/i..., Iโ€™m curious: what cliffs to climb are there for data people?
I can see growth roles as pointed by Abhi, product roles, finance roles.
I guess itโ€™s really open, harder question being how does one get there? #databs

14.12.2024 11:42 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Each time more convinced that, at sales-driven companies, departments different from the Sales one end up becoming cynical about the product. Thereโ€™s this sense that youโ€™re bluffing the customer, as if in the process of zeroing in selling more you expel the care and soul from the product

13.12.2024 13:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Alright so now most of my followers are porn bots.
I wanted to become Bluesky famous but not like this ๐Ÿ˜ซ

04.12.2024 08:57 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Anyone using the Activity Schema in practice? How does it hold up against more known ones such as Dimensional Modeling?

Asking because we are starting a project from scratch and curious about how it holds

#databs

02.12.2024 17:18 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

New job starting today. Super excited.

Wish me luck!

02.12.2024 07:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

So glad you did ๐Ÿ‰

02.12.2024 07:25 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Data Work would be akin to using the Hunting Horn in Monster Hunter: youโ€™re not essential for the hunt, but you get to enjoy being the coolest while gifting buffs to the ones that can hit the hardest

28.11.2024 19:54 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

well this looks super cool! congrats, will get it for sure :)

27.11.2024 09:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

What I'm most excited about though is the focus on delivering value & the concretes way to do so brought by people such as @cedricchin.bsky.social + XmR Charts & @abhisivasailam.bsky.social + Metric Trees.

"Identifying the levers of the business & helping pulling them" is so hot for the data people

25.11.2024 17:52 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

little update: uv added ~3s of overhead, so 4s it is.

Point remains, though: `sdf lint` seems an actual contender to displace sqlfluff in the linter category just because of how fast it is

25.11.2024 17:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

and I say this without wanting to throw any shade to sqlmesh at all. They've done so many amazing design choices.
However, following JS' trend of using Rust (or Zig or whatever) as the tooling language seems the right decision. Speed matters. It seems to unlock other cool stuff along the way too.

25.11.2024 17:29 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

What's cool about sdf is that it is the first transformation tool in the data space that feels fast. A pity that it's not open source.

Sqlmesh, in comparison, is really slow. Running `uv run sqlmesh plan` in a personal, small project takes ~7s to run, which is great if I go back to dbt, but still.

25.11.2024 17:29 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@jjuanramos is following 20 prominent accounts