Models as Prediction Machines: How to Convert Confusing Coefficients into Clear Quantities
Abstract
Psychological researchers usually make sense of regression models by interpreting coefficient estimates directly. This works well enough for simple linear models, but is more challenging for more complex models with, for example, categorical variables, interactions, non-linearities, and hierarchical structures. Here, we introduce an alternative approach to making sense of statistical models. The central idea is to abstract away from the mechanics of estimation, and to treat models as “counterfactual prediction machines,” which are subsequently queried to estimate quantities and conduct tests that matter substantively. This workflow is model-agnostic; it can be applied in a consistent fashion to draw causal or descriptive inference from a wide range of models. We illustrate how to implement this workflow with the marginaleffects package, which supports over 100 different classes of models in R and Python, and present two worked examples. These examples show how the workflow can be applied across designs (e.g., observational study, randomized experiment) to answer different research questions (e.g., associations, causal effects, effect heterogeneity) while facing various challenges (e.g., controlling for confounders in a flexible manner, modelling ordinal outcomes, and interpreting non-linear models).
Figure illustrating model predictions. On the X-axis the predictor, annual gross income in Euro. On the Y-axis the outcome, predicted life satisfaction. A solid line marks the curve of predictions on which individual data points are marked as model-implied outcomes at incomes of interest. Comparing two such predictions gives us a comparison. We can also fit a tangent to the line of predictions, which illustrates the slope at any given point of the curve.
A figure illustrating various ways to include age as a predictor in a model. On the x-axis age (predictor), on the y-axis the outcome (model-implied importance of friends, including confidence intervals).
Illustrated are
1. age as a categorical predictor, resultings in the predictions bouncing around a lot with wide confidence intervals
2. age as a linear predictor, which forces a straight line through the data points that has a very tight confidence band and
3. age splines, which lies somewhere in between as it smoothly follows the data but has more uncertainty than the straight line.
Ever stared at a table of regression coefficients & wondered what you're doing with your life?
Very excited to share this gentle introduction to another way of making sense of statistical models (w @vincentab.bsky.social)
Preprint: doi.org/10.31234/osf...
Website: j-rohrer.github.io/marginal-psy...
25.08.2025 11:49 — 👍 942 🔁 283 💬 49 📌 19
Tracking Census Dataset Changes
See when the U.S. Census Bureau adds or removes datasets.
🚀 I made a tracker that shows when the Census Bureau adds or removes datasets from their APIs.
Dashboard at www.hrecht.com/census-api-d... and follow @censusapitracker.bsky.social for major updates.
I've been wanting to build this for years but now seemed an especially important time to keep track.
11.08.2025 17:09 — 👍 58 🔁 29 💬 4 📌 3
Academics responding to referee reports
08.07.2025 19:20 — 👍 6 🔁 1 💬 0 📌 0
TLDR: AI caused a lot more job posts, but no more jobs!
While the intervention did save employers time, the welfare loss to jobseekers from time wasted on applications is 6x the size of the welfare gain to employers. Suggests widespread use of LLMs can harm market efficiency.
24.03.2025 17:00 — 👍 20 🔁 5 💬 0 📌 1
🚨 New paper alert, with @johnjhorton.bsky.social! Using an experiment run on a large online labor market, we provide evidence that providing employers access to an AI-written first draft of a job post harms the efficiency of the market.
emmawiles.github.io/storage/jobo...
24.03.2025 17:00 — 👍 90 🔁 29 💬 8 📌 2
How Software Engineers Actually Use AI
We surveyed 730 coders and developers about how (and how often) they use AI chatbots on the job. The results amazed and disturbed us.
Is AI liberating human programmers—or programming them right out of their jobs?
At @wired.com we kept hearing conflicting accounts, so we surveyed 730 coders and developers at every career stage about how (and how often) they use AI chatbots on the job:
www.wired.com/story/how-so...
24.03.2025 15:37 — 👍 86 🔁 25 💬 4 📌 1
Tagging one of the three authors here: @joakimweill.bsky.social
14.04.2025 21:21 — 👍 1 🔁 0 💬 1 📌 0
This paper leverages novel data on 15 million home insurance policies to understand how credit scores and natural disaster risks affect the price of home insurance. We find that individuals with subprime credit pay 30% more for home insurance than individuals with super-prime credit. These differences are not explained by property or contract-level characteristics, and likely reflect how insurers use credit score to price anticipated future claim filing behavior. Leveraging a house-level match with a proprietary disaster risk model, we find that the passthrough of expected annual disaster losses is close to one on average, higher than one for individuals with lower credit scores, and below one in high insurance-regulation states like California. We show that despite a passthrough rate for disaster risk close to one, disaster risks account for less than a quarter of premiums on average, and around a third in high-risk states like Florida and Louisiana. These findings reveal that at the policy-level, the disaster risk component of insurance prices is on average smaller than the discount given to people with high credit scores. Insurance prices often depend more on who lives in a home than on the disaster risk a home is exposed to
Important paper on the home insurance market. Authors use novel data on 15 million insurance policies + document many new descriptive facts.
For instance, "the average person’s homeowners insurance [costs] 17% of their monthly principal and interest payment."
papers.ssrn.com/sol3/papers....
14.04.2025 21:20 — 👍 19 🔁 3 💬 1 📌 0
Deportation Data Project
The Deportation Data Project collects and posts public, anonymized U.S. government immigration enforcement datasets. A public good provided by academics and lawyers at Berkeley, UCLA, Stanford, Yale, and Columbia.
07.03.2025 01:57 — 👍 84 🔁 44 💬 1 📌 0
Image reads: "Recognizing that the system allows municipalities to adjust variables to stabilize revenue, the researchers wondered if it also created unintended opportunities for manipulation and self-dealing. To answer that question, they collected data on the homes of local tax assessors. They found that, on average, assessments on those properties grew 0.7 percentage points more slowly each year than those of similar homes nearby. This slight annual difference adds up to a significant 20-to-24 percent gap in the assessed values of the two sets of properties over the study period."
Also, the authors "find that local tax assessors: 1) have tax assessments on their own properties significantly lower than neighboring properties; and 2) these tax assessments grow significantly slower than neighbors – lowering their tax bills." 👀
Image via: www.library.hbs.edu/working-know...
09.04.2025 19:46 — 👍 3 🔁 0 💬 0 📌 0
image shows abstract of the nber working paper. here's the abstract: "Property tax revenues – the largest discretionary source of revenue for local governments - adjust at a pace that is inconsistent with property values in the US. We show that this form of revenue smoothing may be rooted in the political economy of municipalities. Measures of local budget stressors are positively related to upward assessments of a property’s value. Moreover, municipalities are significantly more likely to reassess in up markets as opposed to down – consistent with maximizing tax base and revenue collected. Using micro-level evidence from justpassing school referenda in Illinois, these shocks to municipal liabilities lead to significant increases in property assessments without any associated increases in market values or transactions. Passing a referendum over the prior 3 years increases the probability that a house is reassessed upward by 23%. This flexible form of revenue smoothing creates avenues for personal rent extraction. We find that local tax assessors: 1) have tax assessments on their own properties significantly lower than neighboring properties; and 2) these tax assessments grow significantly slower than neighbors – lowering their tax bills. We further document a significant connection between the underassessment of tax assessors’ own properties and the tax-maximizing assessment gaps documented in the districts they operate."
TIL "property tax revenues adjust at a pace that is inconsistent with property values in the US."
And this revenue smoothing is not symmetric: "municipalities are significantly more likely to reassess in up markets as opposed to down."
H/t @nber.org WP by Chen & Cohen: www.nber.org/papers/w33238
09.04.2025 19:42 — 👍 6 🔁 0 💬 1 📌 0
🚨🚨
What happens when you take a bunch of human capital out of a local economy? Is brain drain a real thing? And what are the consequences? In a new paper, we shed light on this in the context of highly developed economies: Sweden and Norway. #Econsky
08.04.2025 17:37 — 👍 13 🔁 6 💬 1 📌 1
Migration data is critical in the health, environmental, and social sciences.
We're releasing a new dataset, MIGRATE: annual flows between 47 billion pairs of US Census areas. MIGRATE is:
- 4600x more granular than existing public data
- highly correlated with external ground-truth data
1/2
28.03.2025 16:04 — 👍 28 🔁 4 💬 2 📌 0
Grateful for small joys these days, like a little more sunlight every day. Fun little map I made last week 📊
18.03.2025 23:50 — 👍 38 🔁 8 💬 2 📌 0
NEW PAPER: "Examining the role of training data for supervised methods of automated record linkage: Lessons for best practice in economic history" with Jonas Helgertz and Joe Price now forthcoming (and nicely proofed) at Explorations in Economic History 🧵 authors.elsevier.com/c/1kiv03I~dW...
05.03.2025 14:59 — 👍 26 🔁 7 💬 2 📌 0
If you:
- received your PhD in 2017-2024 &
- have early-stage research you're looking to present
=> submit to the ✨Early Career Workshop✨!
20.02.2025 21:28 — 👍 3 🔁 1 💬 0 📌 0
Call for Papers | Opportunity & Inclusive Growth Institute
Hey #econsky! FYI: our @oiginstitute.bsky.social annual research conference is Nov 13-14, 2025!
- Nov 13 is the main conference day ft. keynote by David Card
- Nov 14 is a workshop day for early career researchers
Learn more + submit here (deadline: 4/11): www.minneapolisfed.org/institute/co...
20.02.2025 21:17 — 👍 10 🔁 4 💬 1 📌 1
Happy to share that @annabindler.bsky.social & I will organize the 16th #TWEC Transatlantic Workshop on the Economics of Crime in Berlin.
Save the date: 26-27 September 2025.
CfP follows next month #econsky
29.01.2025 14:01 — 👍 31 🔁 21 💬 2 📌 2
CeMENT: Mentoring for Junior Faculty
Based on feedback from past participants, potential applicants, and mentors, the CeMENT workshop will now be held June 30 - July 2, 2025, at the Federal Reserve Bank of Chicago. To apply, go to: www.aeaweb.org/about-aea/co...
Deadline to apply is March 15.
28.01.2025 19:55 — 👍 11 🔁 12 💬 0 📌 5
📣 Submit your crime paper by Feb 14.
🔸 4th Workshop on the Economics of Crime for Junior Scholars
🔸 29-30 May, 2025
🔸 Quattrone Center, UPenn
🔸 Keynote: @amandayagan.bsky.social
🔹 Send your paper to:
WEC.Jr.Econ@gmail.com
@emmarackstraw.bsky.social
16.01.2025 16:25 — 👍 13 🔁 12 💬 0 📌 5
A bar/area chart with the title, "20% of American adults do 75% of the country's book reading."
2/ If you combine most Americans not reading much and a few reading a lot, you've got some Power Law nonsense developing! The 4% of adults who read 50 or more books did at least 28% of ~all the country's reading~ — more than the bottom 80% combined.
The top 20% read 75% of all books.
21.01.2025 16:18 — 👍 28 🔁 9 💬 2 📌 8
Bar chart with the title "Most Americans didn't read many books last year"
At the end of the year, I ran a poll for @today.yougov.com asking people how many books they read in 2024. The answer is not many! The median American read ~2 books~ last year; 38% of Americans didn't read ANY.
But... 1/
21.01.2025 16:15 — 👍 36 🔁 8 💬 5 📌 6
In economics, editors, referees, and authors often behave as if a published paper should reflect some kind of authoritative consensus.
As a result, valuable debate happens in secret, and the resulting paper is an opaque compromise with anonymous co-authors called referees.
1/
24.12.2024 14:44 — 👍 556 🔁 161 💬 28 📌 33
Congestion Pricing Tracker | Benjamin and Joshua Moshes
This project is run by Joshua Moshes and Benjamin Moshes, under the supervision of Brown University Professor Emily Oster
Students at Brown have used the Google Maps API to create a live tracker for traffic flows under congestion pricing in NYC. It includes a real-time diff-in-diff type comparison between NYC, Boston, and Chicago:
www.congestion-pricing-tracker.com
05.01.2025 17:39 — 👍 425 🔁 112 💬 15 📌 15
As I am now handling papers at AEJ:Policy again, I want to encourage authors of papers that identify partial equilibrium effects to consider (in a rigorous manner) how they relate to policy effects or general equilibrium effects.
11.01.2025 16:10 — 👍 90 🔁 34 💬 3 📌 5
Nice! Will def check out your post if I ever switch to #quarto 😅
15.01.2025 17:02 — 👍 1 🔁 0 💬 0 📌 0
Me (today) revisiting some old code written by me (years ago)
@xkcd.com
15.01.2025 01:39 — 👍 38 🔁 1 💬 1 📌 0
PSA: Folks are porting Hugo/blogdown sites over to #Quarto nowadays.
H/t @bharrap.bsky.social for the post on how to do that.
bsky.app/profile/bhar...
15.01.2025 01:27 — 👍 5 🔁 0 💬 0 📌 0
Let Me Upgrade You | The Little Dataset
Updated my blog for the first time since 2023. It had been so long that I needed to read one of my old blog posts to even remember *how* to update it!
Thank you, 2018-self for the RStudio + {blogdown} workflow refresher: thelittledataset.com/2018/08/26/l...
#rstats #blogdown
15.01.2025 01:25 — 👍 17 🔁 0 💬 2 📌 0
Data and graphics journalist. Census, health policy, R and open source.
Now: freelancing. Past work for The Washington Post, KFF Health News, Bloomberg News.
https://www.hrecht.com/
Assistant Professor @ Carlson School
Labor, history, gender & family
Economist. Researches the criminal justice system at Public Policy Institute of California. PhD from UC Berkeley. Undergrad at Iowa State. Thoughts are my own. #EconSky
https://sites.google.com/view/deepakpremkumar
Economist studying crime policy. Assistant Professor at the University of Pennsylvania, Criminology Department.
Climate economist at the Federal Reserve Board. Views are my own, website https://joakimweill.github.io
Dallas Fed Economist, CESifo, IZA // Labor institutions, imperfect labor markets, public econ // Volleyball and music enthusiast
Views my own
PhD student at Cornell Tech | he/him | cities + equity + spatial everything | fan of cats and Taylor Swift | gsagostini.github.io
CS PhD student at Cornell Tech. Interested in interactions between algorithms and society. Princeton math '22.
kennypeng.me
Assistant professor of CS at UC Berkeley, core faculty in Computational Precision Health. Developing ML methods to study health and inequality. "On the whole, though, I take the side of amazement."
https://people.eecs.berkeley.edu/~emmapierson/
Data junkie, Clemson Tiger 🐅, very amateur golfer ⛳, husband, father. Star Wars fan. Sometimes I run 🏃 Views are my own. **Rebellions are built on hope**
On the faculty job market!
Presently PhD candidate, @hcii.cmu.edu. Formerly: Adobe, Highsoft, Apple, Visa, + others.
I care a lot about accessible data interaction 📊 (:
Disabled, getting into trouble, & making a ttrpg.
He/him
www.frank.computer
Visual journalist and engineer at Axios. Builder of websites, data visualizations, and pixel cats. She/her.
Research Economist at the World Bank’s Development Research Group
PhD Harvard Econ
https://www.louisepauldelvaux.com/
economist @ university of minnesota
Research Fellow at IFPRI. Trying this out!
Economist at U.S. Census Bureau, researching income measurement, aging.
Assistant Professor at Cornell. Macro, labor, inequality. "Bloesch" rhymes with "mesh".
Economist @ UChicago. Drummer. Dad.
https://www.guglielmobriscese.com
The Opportunity & Inclusive Growth Institute at the Minneapolis Fed supports research to expand economic opportunity and inclusive growth for all. Reposts ≠ endorsement.
Economist, Sr. Econ Advisor at Cornell -ILR.
14th Commissioner of BLS.
Formerly at NY & Cleveland Federal Reserve Banks. Now in Seattle!