Trying to get back into writing. In my latest, I look at NBA rookie minutes and the Wizards and cacti. Like and subscribe!๐ต open.substack.com/pub/wizardsp...
01.08.2025 21:09 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0@jacobpstein.bsky.social
Evidence-based data science, vibes-based basketball fan. Here for #tidytuesday, mostly. Code here: https://github.com/jacobpstein
Trying to get back into writing. In my latest, I look at NBA rookie minutes and the Wizards and cacti. Like and subscribe!๐ต open.substack.com/pub/wizardsp...
01.08.2025 21:09 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0If you're on the data science job hunt and feeling discouraged just know that there are terrible clustering algos out there, in production, and you can do much better. Like, look at these 'similar shoes' from DSW. If you're reading this, you can get better results. I believe in you!
10.07.2025 18:10 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Still hurts to see the Wizards at 6 after seeing the Wizards at 18-64.
25.06.2025 23:32 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Say what you will about Poole as a player, but he went from laughing stock to really winning fans over in DC this past season
24.06.2025 19:27 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0In my old day job, I pushed for more simulations to inform causal design, check methods, and help us learn. I posted some code and did a little write up on LinkedIn with a colleague about a case of propensity score matching that crossed our desks. github.com/jacobpstein/...
23.06.2025 18:51 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Sometimes you accidentally write a recursive loop and that's when the fun really starts.
18.06.2025 18:16 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0If only someone would write a whole book about this!!!
16.06.2025 14:50 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1What a difference a default makes: on viz buzz last week I used theme_minimal, which defaults to a clear background. My viz was 7% similar to target viz due to transparency @libbyheeren.bsky.social & @nickwan.bsky.social checked again with a white background and wellโฆ m.twitch.tv/nickwan_data...
16.06.2025 14:13 โ ๐ 2 ๐ 1 ๐ฌ 1 ๐ 0Very cool data viz showing game flow in 3D for the NBA finals vsueiro.com/hoop-hills/
07.06.2025 17:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Trolli also had the sour Harden gummies back in the day
07.06.2025 02:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Thanks! So I think there is a serious selection problem in the data since Project Gutenberg doesnโt have access to a lot of modern literary copyrights. That said, I wouldnโt be surprised if the 19th century was the peak (at least adjusting for overall population)
05.06.2025 00:01 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0I barely had any time for #TidyTuesday this week and want to revisit these Gutenberg data sets with some LLM tools at some point. I looked at life spans but kept it to the period since the modern novel was born. This could be a good interactive if I were doing a quarto presentation
04.06.2025 17:36 โ ๐ 5 ๐ 1 ๐ฌ 2 ๐ 0Congrats on this! It feels like a big jump for the whole R multiverseโจcanโt wait to test it out
30.05.2025 22:32 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0I think ChatGPT is actually a pretty great learning tool! I still prefer Stackoverflow because I am an old man. The recent Posit session on LLMs had a nice overview of what theyโre good at and integrating them into workflows
30.05.2025 11:46 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0I meant to mention that I used Polars this week and found variable creation way easier than in pandas. Also, conditional, ifelse style creation is pretty smooth. Hereโs my notebook if itโs of interest: github.com/jacobpstein/...
30.05.2025 11:43 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0Bennedict Mathurin flies while Josh Hart tries to block him
I know it's lame to highlight a corporate-y Getty photo, but this is one of those cool basketball pics that highlights how these guys are so good at doing otherworldly stuff--like somehow shooting a ball while seemingly falling and being blocked
30.05.2025 00:03 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Love it. geom_point, quiet hero.
29.05.2025 23:31 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Yeah! It would be great to hear how other people approach these
29.05.2025 02:04 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0I donโt like live codingโitโs kind of like when someone asks if you know any jokes and you canโt think of a single funny thing youโve ever heard in your life. But itโs also probably good to live code occasionally so at least you know where you get stuck, what makes you nervous, etc.
29.05.2025 01:49 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0This week's #TidyTuesday was a tough one! Lots of correlated values, no domain knowledge, and small-n groups. I spent a long time flailing around trying to figure out what might be interesting. Predicting hit points based on the other data seemed like a good way to compare model types.
28.05.2025 14:30 โ ๐ 13 ๐ 2 ๐ฌ 1 ๐ 0Like what you did there with the ggplot theme๐ฒ
27.05.2025 22:20 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0I am also using this weekโs data to kick the tires on polars! It isโฆfine. The .to_pandas function call is getting a lot of use
27.05.2025 18:55 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Weird, don't know why the URL didn't render as a hyperlink. Here ya go! 0196f5d5-dc61-3977-66b7-ccd1e7b9cead.share.connect.posit.cloud#/title-slide
26.05.2025 17:16 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0The story I told to accompany the slides was basically about not being able to shake a stat and how even when you understand something rationally, the irrational hold it can have on you lingers
26.05.2025 17:15 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0I didn't get around to doing #TidyTuesday last week because I was hustling to finish slides for a presentation to the DC Data Viz meetup. Here are the slides--https://0196f5d5-dc61-3977-66b7-ccd1e7b9cead.share.connect.posit.cloud/#/title-slide
26.05.2025 17:15 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0@owenphillips.bsky.social don't quite know what to make of this, but the correlation between two point attempts and shot quality went positive on average for the first time this season. Could be spurious, could be mid-range theory at play
22.05.2025 02:48 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0I did econ, but a lot of this comes from putting in reps. A new data set is like walking into a restaurant you've never been to before--you probably have a sense of where the bathroom might be, or at least what to look for, and you'll know if you've accidentally wandered into the kitchen
18.05.2025 14:02 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Totally reasonable question! It's a scatter plot, but instead of doing two continuous variables on the x and y axes, you make one of the axes factor (or character or some other class). Code is here! github.com/jacobpstein/...
17.05.2025 12:07 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0thanks! I'm glad I finally took the plunge into Tidy Tuesday!
17.05.2025 02:34 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0I have been re-reading Ferrante's Neapolitan Novels so this week's #TidyTuesday felt very much on theme. I started to go down a rabbit hole of spatial modeling, but decided that for getting this done while I have a little time, it's better just to make a nice descriptive plot.
16.05.2025 17:54 โ ๐ 9 ๐ 0 ๐ฌ 1 ๐ 1