🚨Free data alert!! 🚨 Please share.
Large new dataset of Amazon product reviews, including full text and photos and product characteristics, with individual *reviews labeled as fake reviews*.
I believe this is the first publicly available data of this kind.
github.com/bretthollenb...
Histograms are incredibly useful, interpretable, and common in BI. But building histograms that work well out of the box — no matter the data — is trickier than it sounds. We share some of the challenges faced, and decisions made, when designing histograms for Observable Canvases:
What cyborg work looks like as an academic, by Robert Ghrist, a mathematician and associate dean of undergraduate education at the University of Pennsylvania.
📚😅🎉
Yay!! I just submitted the complete manuscript of my upcoming book to the publisher!
Learn to easily and clearly interpret (almost) any stats model w/ R or Python. Simple ideas, consistent workflow, powerful tools, detailed case studies.
Read it for free @ marginaleffects.com
#RStats #PyData
We streamlined six new DID-like estimators and created this tutorial for implementation in R.
yiqingxu.org/packages/fec...
Hope you no longer need to spend months figuring out what these estimators are and how to use them.
People Pay for the Right to Bid — and Then Overbid anderson-review.ucla.edu/people-pay-f...
🚨 New blog post! 🚨
If you want to learn about:
🎨 Monochrome colour palettes
📊 Designing better black & white visualisations
🛠️ Rethinking single-colour chart design
Read this ➡️ nrennie.rbind.io/blog/monochr...
#RStats #DataViz #ggplot2 #RLadies
We've added an article about RStudio's Rproj files and how to adapt related workflows, if you're starting to kick the tires on Positron. If this interests you, check it out 👀
positron.posit.co/rstudio-rpro...
#rstats #rstudio #positron
📊 vs. 🥧
I made a tiny teaching tool to help me interactively demo + share differences between 📊 and 🥧
Play: I find that tinkering with data + visuals in class reinforces understanding far more than slides or readings
Save + share: Copy the url to link the current data
👉 barvpie.netlify.app
My PhD syllabus for Introduction to Quantitative Marketing @rotmanschool. Updated for 2025. Comments welcome.
Feel free to suggest additional papers. Self promotion encouraged! All University of Toronto PhD students welcome to audit. Please get in touch.
In 24,678 people in 176 rural Honduras villages, we found that villagers have an average of 6.89 (SD 3.79) friends, and these friends have 8.40 (SD 2.52) friends.
Villagers have an average of 1.26 (SD 1.70) enemies, and these enemies have 3.40 (SD 2.11) enemies.
www.nature.com/articles/s41... 7/
From the archives: Have you (like me!) wondered what the ATT means and how it's different from average treatment effects? I use #rstats to explore why we care about (and how to calculate) the ATE, ATT, and ATU #polisky #episky #econsky www.andrewheiss.com/blog/2024/03...
Foursquare just open sourced their 100 million place point of interest dataset! Some notes on poking around with it using DuckDB (it's Parquet files on S3) simonwillison.net/2024/Nov/20/...
A few things I've been working on lately:
elmer, elmer.tidyverse.org, is a new package to make it easier to work with LLMs (hosted and local) from #rstats. It includes helps for structured data extraction and tool calling, and an easy way to upload a plot. Joint work with Joe Cheng.