Kirsten Lum's Avatar

Kirsten Lum

@machsci.bsky.social

🌲🌲 Applied ML/AI, data science, MLOps | Wife of 1, mom of 2 | Co-Founder and CTO of http://storytellers.ai python 🐍 AI πŸ€– cloud ☁️ data πŸ“Š I also talk about Jesus here: @itskirstenlum.bsky.social

8,073 Followers  |  208 Following  |  451 Posts  |  Joined: 01.07.2023
Posts Following

Posts by Kirsten Lum (@machsci.bsky.social)

Faaaaascinating. Looking forward to your analysis of the results. Crossing my fingers for a Simpsons paradox

25.05.2025 13:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Those checks are run in-environment (rather than passing raw data to a service). That part was hard

11.02.2025 18:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes! We developed a way to perform statistical and natural language checks to identify joinable columns in real-world (that is, fubared) data. If brute-forcing it would be ~$100ks and weeks of runtime, we do it in ~$10 - $100s in minutes or hours

11.02.2025 18:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yes β€” it is that, and it is beyond that! Even in cases where your pk/fk are malformed, like mismatched types, mismatched column names, contain prefixes/suffixes, etc.

11.02.2025 18:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Cube gets it. We SHOULD be able to build a semantic layer in the data platform but we can’t (not one that actually helps the analytics workflow anyway). Thus these tools that fill the gap!

11.02.2025 17:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I think it’s easy for math folks because math operates logically β€” but what I think they miss is reality operates logically too!

11.02.2025 17:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well grain of salt, I managed to get out of 100% of math classes in my undergrad. But even among math heavy degree havers like engineers, when they’d ask me how I was able to understand/convince across disciplines/levels, I’d tell them to read the textbook from my logic class!

11.02.2025 17:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Ah yes, going the other way! Uhhhh I’m going to make a mental note that we could probably reverse this process πŸ€”πŸ€”

11.02.2025 17:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This exact impulse was what inspired this tool: bsky.app/profile/mach...

11.02.2025 15:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This dashboard could have been a Google sheet

11.02.2025 15:41 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Formal logic was the most useful course I took in college. Hard to explain that there is a style of thinking that helps you to quickly and precisely understand and explain what’s going on in any situation.

11.02.2025 14:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A much more hopeful picture!

01.02.2025 15:17 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

One of my professional β€œworry stones” is that the orgs who were able to set up data infrastructure tend to be bigger for-profits. If AI is revolutionary, that means orgs like education, non-profits, etc are left behind. It still takes way too long to set up a basic DW. Wish I knew the solution

01.02.2025 14:51 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

*rolls up sleeves* on it!

31.01.2025 19:52 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Finally able to prove I haven’t been just complaining this whole time!! 😭

31.01.2025 19:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And not a dumb question 😊😊

31.01.2025 19:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What the downside to doing both of these in SQL?

28.01.2025 17:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro

27.01.2025 21:46 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

That is, by default, do the transform in SQL, and only think about whether to do it in the database or the client if you run into a blocker doing it in SQL.

No need to litigate every transform β€” scarcity mentality in an era of compute riches

27.01.2025 21:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro

27.01.2025 21:46 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Does using the reports the service provides count? Like the GA dashboards inside GA? Or are you thinking more like grabbing some SQL/Tableau templates to run on the data in the DW?

22.01.2025 16:42 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Pivots πŸ₯²πŸ₯²

22.01.2025 04:43 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

And isn’t it wild that on the receiving side of the message, appreciation is one of the greatest gifts?

Feels small to give, but immense to receive

21.01.2025 18:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Sorry I literally just saw your name got autocorrected @vickiboykis.com. I assume this is a similar experience as when people call me Kristin, and I am proportionately appalled/apologetic.

20.01.2025 15:41 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a man in a suit and tie is dancing with his hands in the air ALT: a man in a suit and tie is dancing with his hands in the air
19.01.2025 05:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes! Like a daydream poking into reality

19.01.2025 05:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hmm like versus an app? For the audience I had in mind (like <12), I would say it’s at least preferable!

19.01.2025 05:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yess this is exactly the vibe that I had in mind. Something like this crossed with Teenage Engineering for the paperback sci fi vibe

19.01.2025 04:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

and maps analog inputs/outputs.

19.01.2025 00:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Startup idea β€” a toy that looks like a generic hand-held computer with a few analog inputs/outputs (buttons, switches, lights) and a screen. Kid can use voice to describe an adventure they want to have (exploring a jungle, space rescue mission, vet on Mars). Toy generates a relevant UI onscreen…

19.01.2025 00:01 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 5    πŸ“Œ 0