Jeroen Mahieu's Avatar

Jeroen Mahieu

@jmahieu.bsky.social

Assistant professor at Utrecht University School of Economics || Economics of Entrepreneurship

98 Followers  |  187 Following  |  26 Posts  |  Joined: 26.09.2023  |  2.092

Latest posts by jmahieu.bsky.social on Bluesky

One clear pattern that is emerging from students’ #LLM usage for their papers is that they all rely on the same theories. Like 90% uses the β€œresource-based view” of the firm. Before, they often relied on weird exotic frameworks, but now it has all become so boring and repetitive

08.07.2025 11:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

On the verge of declaring defeat with chatgpt in my asynchronous online dataviz class. Something changed this semester compared to past ones and SO MANY assignments are essentially 100% LLM output.

08.05.2025 00:44 β€” πŸ‘ 543    πŸ” 60    πŸ’¬ 55    πŸ“Œ 30
Post image

Even the White House is using GenAI to cheat on its homework πŸ€·β€β™‚οΈ

03.04.2025 06:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
The State of Personal Online Security and Confidentiality | SXSW LIVE YouTube video by SXSW

Meredith Whitakker on the meaning of privacy, Signal vs Whatsapp/Telegram/iMessage/…, the value proposition of OS tech, why being a nonprofit is mission critical, and much more. Besides the privacy part this is a deep discussion on strategy in the current tech ecosystem

youtube.com/live/AyH7zoP...

29.03.2025 17:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Childbirth affects firm performance for female Norwegian entrepreneurs: -30 percent profit 10 years post-childbirth. No such decline for male-owned businesses is found, from John Bonney, Luigi Pistaferri, and Alessandra Voena https://www.nber.org/papers/w33448

12.02.2025 22:00 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image 04.02.2025 15:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Musk's Junta Establishes Him as Head of Government Imagining how we'd cover overseas what's happening to the U.S. right now

Having watched with growing alarm the developments of the last 24 and 36 hours in Washington, I thought I’d take a stab at how the US media would cover this story if it was happening in a foreign country. Here’s that story that should be written this weekend: www.doomsdayscenario.co/p/musk-s-jun...

01.02.2025 15:53 β€” πŸ‘ 6979    πŸ” 3620    πŸ’¬ 352    πŸ“Œ 810

Oh, the irony

29.01.2025 04:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
All Day TA All Day TA is an AI EdTech company focused on higher education that enables professors to build customized AI teaching assistants for their courses. Available 24/7, it provides students with instant, ...

The big black box for us as teachers remains how students are actually using LLMs and how we can help them use LLMs in a way that helps them *learn*. I currently see very few efforts in this direction. Tools like alldayta.com are a first step but not the best solution for this problem imo

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Because of this lack of expert knowledge, learning *from* LLMs and learning *how* to generate high-quality with LLMs is impossible. Students remain trapped in mediocre text that may *look* good but is a β€œsufficient” grade at best

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Students lack expert knowledge which is complementary to LLM output. Without such knowledge it is very hard to direct LLMs to consistently produce output that is better than mediocre. You see this very clearly in theory sections that require careful logical argumentation and β€œconnecting the dots”

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Student effort going down. LLMs typically produce text that looks good on first sight and might trick a non-expert into thinking they can do the job with less effort from their side. However, actual quality of such first attempts is most of the time mediocre at best

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

However, I see little improvement among the mediocre and good proposals despite > 2 years since ChatGPT launch and model improvements. My guess this is due to different reasons:

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The quality of the worst proposals has improved, mainly in terms of writing. Nobody submits terrible text anymore. My standards on this also have increased; submitting text with grammar or spelling mistakes is not done and will be penalised. 0 cost to write w/o grammar mistakes, students know this

28.01.2025 21:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Reading through the research proposals submitted by my students, I do not believe the claim that β€œstudent papers are dead since ChatGPT”. Despite basically 100% adoption rate of LLMs to develop their papers, there’s no sign the student paper problem is β€œsolved”. Couple of observations:

28.01.2025 21:30 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ‘€

21.01.2025 10:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

um

20.01.2025 19:58 β€” πŸ‘ 44835    πŸ” 11446    πŸ’¬ 7005    πŸ“Œ 8071
Post image 18.01.2025 13:08 β€” πŸ‘ 70    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
Does ChatGPT use 10x more energy than a standard Google search? A journey down the rabbit hole of viral AI energy claims. It's probably true in relative terms, but that's not what matters.

That’s a good article, but the figures quoted for ChatGPT vs. Google search are outdated, wrong & too high.

Best current energy estimate for a day of ChatGPT use is equiv. to driving an average car the length of a tennis court:

engineeringprompts.substack.com/p/does-chatg...

19.01.2025 17:00 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Before, I would never have even considered searching, hiring and training a TA (or two) for this project given it is too small and there is no funding. Now it costs me 10 dollars for the API and three hours to debug the code myself + some minor manual cleaning for the β€œspecial” cases. Bonkers

17.01.2025 23:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Eg: in less than half a day, I had GPT-4o code and run a script to extract and interpret text from unstructured scanned company pdfs in Dutch and French & return structured data based on the information from the docs (β€œgive the gender of all the founders of the firm”). Would have taken RA weeks

17.01.2025 23:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

An underappreciated fact about the massive cost reduction potential of LLMs for data collection and cleaning is that some projects which in the past were too costly to do compared to the possible contribution/impact are now interesting cost-benefit wise. A whole new problem space is now within reach

17.01.2025 23:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

New randomized, controlled trial by the World Bank of students using GPT-4 as a tutor in Nigeria. Six weeks of after-school AI tutoring = 2 years of typical learning gains, outperforming 80% of other educational interventions.

And it helped all students, especially girls who were initially behind.

15.01.2025 20:58 β€” πŸ‘ 355    πŸ” 89    πŸ’¬ 15    πŸ“Œ 27
"I have no hopes for 2025. Humanity is disappointing. We killed the Earth. Villains triumph and the innocents suffer. I imagine these trends will continue."

"I have no hopes for 2025. Humanity is disappointing. We killed the Earth. Villains triumph and the innocents suffer. I imagine these trends will continue."

incredible -- the NYT ran fluff "what i hope to see in 2025" blurbs from CEOs and economists, and then this guy

www.nytimes.com/2025/01/02/o...

04.01.2025 04:52 β€” πŸ‘ 6763    πŸ” 1708    πŸ’¬ 90    πŸ“Œ 214

To conclude: one effect being β€œsignificant” and the other β€œnot significant” is rarely enough to conclude a meaningful difference of differences.

01.01.2025 13:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In fact, with a moderate correlation (most likely scenario), the difference of 0.11 on an 11-point PTV scale is probably not large enough to be statistically significant and the main claim of the paper is not valid.

01.01.2025 13:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

For both groups the effect of inclusion is positive, and the "difference-in-differences" is 0.11 (0.20βˆ’0.09). Only with an extremely high positive correlation between VB and NVA PTV that difference will be significant. That’s not impossible, but it’s quite speculative without the underlying data.

01.01.2025 13:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Here's the figure with the main result. We see that inclusion/exclusion significantly altered PTV for VB (p = .045), but not for N-VA (p = .445). However, one effect being significant and the other being non-significant does not itself prove that those effects differ.

01.01.2025 13:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This study has gained some traction and is being picked up by the Belgian press. Looking at the results, I'm skeptical of its main claim though- that including or excluding a radical right party by the mainstream party differentially affects support for the two. Let me explain:

01.01.2025 13:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Opinie | De zorg-zzp’er is het symbool van een systeem dat mensen niet waardeert Zelfstandigen: Nu de overheid straks Γ©cht gaat handhaven op schijnzelfstandigheid, dreigt volgens Jeroen Mahieu een uitstroom van zzp’ers uit de zorg.

β€œDe zorg-zzp’er is het symbool van een systeem dat mensen niet waardeert” of waarom het handhaven van de wet rond schijnzelfstandigheid zal leiden tot een uitstroom van waardevolle krachten uit de zorg. Opiniestuk in NRC.

30.12.2024 12:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@jmahieu is following 20 prominent accounts