How big should your data team be?
Data teams are often oversized. A company of 200 people rarely needs 15+ data staff, usually 5% of org size is enough
dataactionmentor.com/knowledge-ba...
@jayatillake.bsky.social
Writer @ davidsj.substack.com
How big should your data team be?
Data teams are often oversized. A company of 200 people rarely needs 15+ data staff, usually 5% of org size is enough
dataactionmentor.com/knowledge-ba...
Try to find a non-traditional role that is more suited to a future where engineering is very cheap. If you have an idea, try build it yourself. The experience of trying to found is more valuable than employee experience now and even more so in the coming years.
28.09.2025 08:53 β π 2 π 0 π¬ 0 π 0The amount you love someone is proportional to how often you Ghiblify their pictures.
29.07.2025 16:59 β π 1 π 0 π¬ 0 π 0This week I look at agents.
I think this is a new way to build where we donβt intentionally build code-based software.
open.substack.com/pub/davidsj/...
BERT and ERNIE! π
tracking.tldrnewsletter.com/CL0/https:%2...
I don't usually share photos of my family on social media for good reason, but I'm happy to share these ones!
20.06.2025 16:00 β π 5 π 0 π¬ 0 π 0This post encapsulates how I feel about the current state of LLMs and doomers etc. Really great read:
fly.io/blog/youre-a...
So when I've attended Snowflake summit before, I've usually written a blog post talking about the new features released, etc. Is someone going to do that this year, given I didn't go? π
#datasky #databs
It is possible to build machine learning systems which punch up instead of punching down.
06.06.2025 01:52 β π 691 π 128 π¬ 9 π 3Got a cool story about something in the data engineering space? You should π― submit it as a talk to Current 2025 in New Orleans π
Do it! Now! CfP is open until 15th June.
sessionize.com/current-2025...
(Pro-tip: you only need an abstract at this point; writing the talk can be later π
)
#dataBS
This is genuinely one thing you can rely on AI for.
23.05.2025 17:16 β π 2 π 0 π¬ 1 π 0It was actually very impressive. Lots of stuff I want to try.
21.05.2025 20:11 β π 1 π 0 π¬ 0 π 0At the London Data Practitioners Meetup with @pedramnavid.com @jayatillake.bsky.social @rittmananalytics.bsky.social and the London Dagster community
14.05.2025 17:15 β π 2 π 1 π¬ 0 π 0I also think people donβt use the tags as we have found each other. I almost exclusively use the popular with friends feed.
14.05.2025 07:06 β π 2 π 0 π¬ 0 π 0Itβs not but you donβt have to keep declaring ctes. May be able to have partial queries too.
14.05.2025 06:54 β π 3 π 0 π¬ 0 π 0Theyre still here just quieter than at the start. More of them though
14.05.2025 06:51 β π 2 π 0 π¬ 1 π 0Doctorβs orders π«‘
27.04.2025 12:54 β π 4 π 0 π¬ 1 π 0I still think this is the biggest prize in AI. If Siri could actually do most things you do on a phone manually...
9to5mac.com/2025/04/22/s...
Haha yes but he fits the bill.
23.04.2025 07:10 β π 0 π 0 π¬ 0 π 0@petefein.bsky.social
22.04.2025 22:41 β π 0 π 0 π¬ 1 π 0I wonder what the limit difference between CSV and Parquet would be under real conditions, where most queries only need a tiny subset of large datasets. You could probably handle >petabyte datasets on that EC2 machine with good partitioning of Parquet or using Iceberg.
22.04.2025 22:37 β π 3 π 0 π¬ 0 π 0Well, if it works, the real engineers can tidy it up or more likely do nothing and talk about code standards.
22.04.2025 12:09 β π 1 π 0 π¬ 1 π 0Has anyone tried Llama 4 Maverick yet? How big a machine does it need to run locally?
@simonwillison.net
Looks like Nintendo became the best at console FPS.
02.04.2025 13:30 β π 0 π 0 π¬ 0 π 0Oh no! Iβve been enjoying bluesky for the data stuff but can imagine that itβs swung very radically left on other topics.
01.04.2025 08:41 β π 0 π 0 π¬ 0 π 0@windsurfai.bsky.social
24.03.2025 16:07 β π 1 π 0 π¬ 0 π 0I've seen many blog posts and social posts by these supposed true artisans saying that they tried this method, and the output was subpar.
Well, maybe it would have taken just as long if you had just written the code, but for the rest of us, we now have an option to build without you.
Once again, we've devised a derogatory name for something many of us are doing: "Vibe coding".
Just like "Citizen Data Scientist", "Excel Data Analyst", and many other terms made to belittle by the supposed true artisans that came before.
open.substack.com/pub/davidsj/...
yeah but was there coffee down there, and if so was it any good?
17.03.2025 23:22 β π 4 π 0 π¬ 1 π 0