Kirsten Lum's Avatar

Kirsten Lum

@machsci.bsky.social

๐ŸŒฒ๐ŸŒฒ Applied ML/AI, data science, MLOps | Wife of 1, mom of 2 | Co-Founder and CTO of http://storytellers.ai python ๐Ÿ AI ๐Ÿค– cloud โ˜๏ธ data ๐Ÿ“Š I also talk about Jesus here: @itskirstenlum.bsky.social

7,934 Followers  |  208 Following  |  451 Posts  |  Joined: 01.07.2023  |  1.6295

Latest posts by machsci.bsky.social on Bluesky

Faaaaascinating. Looking forward to your analysis of the results. Crossing my fingers for a Simpsons paradox

25.05.2025 13:25 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Those checks are run in-environment (rather than passing raw data to a service). That part was hard

11.02.2025 18:21 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yes! We developed a way to perform statistical and natural language checks to identify joinable columns in real-world (that is, fubared) data. If brute-forcing it would be ~$100ks and weeks of runtime, we do it in ~$10 - $100s in minutes or hours

11.02.2025 18:19 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yes โ€” it is that, and it is beyond that! Even in cases where your pk/fk are malformed, like mismatched types, mismatched column names, contain prefixes/suffixes, etc.

11.02.2025 18:11 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Cube gets it. We SHOULD be able to build a semantic layer in the data platform but we canโ€™t (not one that actually helps the analytics workflow anyway). Thus these tools that fill the gap!

11.02.2025 17:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

I think itโ€™s easy for math folks because math operates logically โ€” but what I think they miss is reality operates logically too!

11.02.2025 17:49 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Well grain of salt, I managed to get out of 100% of math classes in my undergrad. But even among math heavy degree havers like engineers, when theyโ€™d ask me how I was able to understand/convince across disciplines/levels, Iโ€™d tell them to read the textbook from my logic class!

11.02.2025 17:48 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Ah yes, going the other way! Uhhhh Iโ€™m going to make a mental note that we could probably reverse this process ๐Ÿค”๐Ÿค”

11.02.2025 17:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This exact impulse was what inspired this tool: bsky.app/profile/mach...

11.02.2025 15:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This dashboard could have been a Google sheet

11.02.2025 15:41 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Formal logic was the most useful course I took in college. Hard to explain that there is a style of thinking that helps you to quickly and precisely understand and explain whatโ€™s going on in any situation.

11.02.2025 14:56 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A much more hopeful picture!

01.02.2025 15:17 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

One of my professional โ€œworry stonesโ€ is that the orgs who were able to set up data infrastructure tend to be bigger for-profits. If AI is revolutionary, that means orgs like education, non-profits, etc are left behind. It still takes way too long to set up a basic DW. Wish I knew the solution

01.02.2025 14:51 โ€” ๐Ÿ‘ 9    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

*rolls up sleeves* on it!

31.01.2025 19:52 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Finally able to prove I havenโ€™t been just complaining this whole time!! ๐Ÿ˜ญ

31.01.2025 19:48 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

And not a dumb question ๐Ÿ˜Š๐Ÿ˜Š

31.01.2025 19:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

What the downside to doing both of these in SQL?

28.01.2025 17:12 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro

27.01.2025 21:46 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

That is, by default, do the transform in SQL, and only think about whether to do it in the database or the client if you run into a blocker doing it in SQL.

No need to litigate every transform โ€” scarcity mentality in an era of compute riches

27.01.2025 21:49 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Trying to make each and every transform efficient in the micro leads to inefficiencies in maintenance in the macro

27.01.2025 21:46 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Does using the reports the service provides count? Like the GA dashboards inside GA? Or are you thinking more like grabbing some SQL/Tableau templates to run on the data in the DW?

22.01.2025 16:42 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Pivots ๐Ÿฅฒ๐Ÿฅฒ

22.01.2025 04:43 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

And isnโ€™t it wild that on the receiving side of the message, appreciation is one of the greatest gifts?

Feels small to give, but immense to receive

21.01.2025 18:34 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Sorry I literally just saw your name got autocorrected @vickiboykis.com. I assume this is a similar experience as when people call me Kristin, and I am proportionately appalled/apologetic.

20.01.2025 15:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
a man in a suit and tie is dancing with his hands in the air ALT: a man in a suit and tie is dancing with his hands in the air
19.01.2025 05:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yes! Like a daydream poking into reality

19.01.2025 05:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Hmm like versus an app? For the audience I had in mind (like <12), I would say itโ€™s at least preferable!

19.01.2025 05:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yess this is exactly the vibe that I had in mind. Something like this crossed with Teenage Engineering for the paperback sci fi vibe

19.01.2025 04:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

and maps analog inputs/outputs.

19.01.2025 00:01 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Startup idea โ€” a toy that looks like a generic hand-held computer with a few analog inputs/outputs (buttons, switches, lights) and a screen. Kid can use voice to describe an adventure they want to have (exploring a jungle, space rescue mission, vet on Mars). Toy generates a relevant UI onscreenโ€ฆ

19.01.2025 00:01 โ€” ๐Ÿ‘ 10    ๐Ÿ” 0    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 0

@machsci is following 19 prominent accounts