Kai Zhu's Avatar

Kai Zhu

@kai-zzzzzz.bsky.social

Assistant Professor at Bocconi University https://kaizhu.me/

72 Followers  |  301 Following  |  25 Posts  |  Joined: 13.11.2024  |  2.2574

Latest posts by kai-zzzzzz.bsky.social on Bluesky

Centre for Competition Policy High quality independent research into competition policy and regulation

Call for papers, 4th UK Workshop on Digital Economics, London 28 November 2025: competitionpolicy.ac.uk/events/4th-w... This is always a great event

25.07.2025 09:43 β€” πŸ‘ 21    πŸ” 15    πŸ’¬ 0    πŸ“Œ 0
Preview
How language is hiding the real internet from you Most of the internet is out of your reach, but the barrier isn't just algorithms. In another language, the same platforms turn into whole other worlds.

I wrote an article about linguistic bias and the internet for the BBC, based on a paper @ze.vin, @ethanz.bsky.social, and I wrote comparing four language-specific samples of YouTube. www.bbc.com/future/artic...

13.08.2025 16:55 β€” πŸ‘ 27    πŸ” 10    πŸ’¬ 1    πŸ“Œ 3

Want a good starting point for learning good principles of Dataviz?

I'd highly recommend @andrew.heiss.phd course--Data Visualization with R.

Reading materials, slides, lecture videos, examples, code, etc. are all posted for free on his website.

12.08.2025 16:18 β€” πŸ‘ 47    πŸ” 11    πŸ’¬ 1    πŸ“Œ 1
Law and ethics
Post-API Age
XML and JSON
IP and HTTP
Static web pages
Archives web pages
Dynamic web pages
PDFs
Wikipedia
Government APIs
Social APIs
Automation
AI APIs

Law and ethics Post-API Age XML and JSON IP and HTTP Static web pages Archives web pages Dynamic web pages PDFs Wikipedia Government APIs Social APIs Automation AI APIs

Got around to pushing all my @cuboulder.info Web Data Science @jupyter.org notebooks to @github.com

Enjoy! github.com/cuinfoscienc...

27.03.2025 03:45 β€” πŸ‘ 57    πŸ” 14    πŸ’¬ 3    πŸ“Œ 0

There have been a number of recent articles on statistical power in quantitative political science. This is something that I think deserves more attention and discussion. A short thread of the articles I have read. 🧡

23.07.2025 06:58 β€” πŸ‘ 72    πŸ” 24    πŸ’¬ 3    πŸ“Œ 1
Preview
Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites.

Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites. blog.cloudflare.com/perplexity-i...

04.08.2025 13:30 β€” πŸ‘ 73    πŸ” 45    πŸ’¬ 3    πŸ“Œ 14
Monetizing Platforms: An Empirical Analysis of Supply and Demand Responses to Entry Costs in Two-Sided Markets | Management Science

Read the full (Open Access!) paper here: doi.org/10.1287/mnsc...

Thanks to my co-authors, Qiaoni Shi and Shrabastee Banerjee!

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Key takeaway: Introducing entry costs can reshapes the ecosystem. Platforms must weigh short-term revenue against the long-term risks of marginalizing small creators, reducing diversity, and harming consumer matching.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Mismatch: Why lower ratings? We used a fine-tuned BERT model to analyze review text. The results suggest an increase in consumer-book mismatches.

With reduced diversity (a shrinking "long tail"), readers were more likely to receive books misaligned with their preferences.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Demand Paradox: How did readers (demand side) respond? The promotional effects intensified, but with a paradox.

Books in the paid program received a HIGHER volume of reviews, but LOWER average ratings. Monetization amplified the "Groupon effect."

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The "Rich-Get-Richer" Dynamic: Diving deeper, we saw a "rich-get-richer" effect. Popular genres (like Mystery/Thriller) became more dominant, while niche genres (like Poetry/Science) lost market share. The entry cost narrowed the range of cultural products being promoted.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Diversity Drop: This shift in suppliers directly affected product variety. We measured a significant decline in the diversity of book genres available in the program post-monetization. The marketplace became less varied.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Market Concentration & Author Profiles: This led to a massive 200% increase in market concentration (HHI). Furthermore, the authors who continued to participate post-monetization were generally more established, popular, and experienced with the platform.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Uneven Impact & Concentration: Importantly, the impact was uneven. The cost disproportionately pushed out indie publishers and self-published authors.

While overall participation dropped, the market share of the "Big 5" publishing houses more than doubled (12% to 30%).

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Supply Shock: The impact on the supply side was immediate and dramatic. Introducing the entry cost caused the average number of monthly promotional campaigns to plummet from ~3,000 to ~1,000.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We studied the Goodreads "Giveaways" program, a marketplace for book promotion. It was free for authors/publishers until Jan 2018, when Goodreads introduced a fixed $119 entry cost.

This provided a natural experiment to study monetization in a two-sided market.

04.08.2025 10:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

πŸ“£ Thrilled to announce our new paper, "Monetizing Platforms: An Empirical Analysis of Supply and Demand Responses to Entry Costs in Two-Sided Markets," is now published in Management Science!

When a digital platform starts charging for access, who wins and who loses? πŸ§΅πŸ‘‡

04.08.2025 10:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image Post image

Inspiring PDW on using sensitivity analysis in empirical management research. My contribution is to present the sensemakr package by Cinelli & Hazlett (2020) for observational designs. Thanks a lot to the organizers for putting this fantastic session together. #AOM2025

26.07.2025 07:52 β€” πŸ‘ 18    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0
Preview
Where Congress’s Cuts Threaten Access to PBS and NPR The loss of federal funding threatens scores of public TV and radio stations across the United States.

Where the cuts are going to be felt the most.
www.nytimes.com/interactive/...

19.07.2025 15:44 β€” πŸ‘ 24    πŸ” 13    πŸ’¬ 0    πŸ“Œ 1
Post image

24/ An excellent recent survey revisits the theoretical literature on herds & cascades It notes that cascades cause poor information aggregation, lead to fragile mass behaviors, and remain central to understanding social learning. Those 1992 papers launched a vast literature 😎

06.07.2025 04:01 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Academia will form these little pockets -- people whose theorizing is outrageous & supported by methods outdated since the 90s -- but once it reaches a critical size those people just review each others papers & grants, form societies, hand out awards etc, like a self-contained parallel society.

03.06.2025 05:31 β€” πŸ‘ 467    πŸ” 90    πŸ’¬ 26    πŸ“Œ 30

its crazy how dominant germany was in science (especially chemistry) in the 19th century, it was basically the international language for scientists, people came from all over the world to train at heidelberg etc, and then....

31.05.2025 10:39 β€” πŸ‘ 4053    πŸ” 1279    πŸ’¬ 71    πŸ“Œ 39
Preview
How museums are using Wikipedia to archive marginalized art As public archival institutions fade and the state rewrites curricula, the next generation of cultural workers is stepping up

Wikipedia may seem like an unlikely site of resistance, but in an era of escalating censorship, disinformation, & erosion of public trust, Wikipedia is a model for collective governance led by next generation of cultural workers, @alexmar.bsky.social reports: prismreports.org/2025/05/12/w...

12.05.2025 23:18 β€” πŸ‘ 292    πŸ” 104    πŸ’¬ 4    πŸ“Œ 5
Preview
Wikipedia Contributions in the Wake of ChatGPT How has Wikipedia activity changed for articles with content similar to ChatGPT following its introduction? We estimate the impact using differences-in-differences models, with dissimilar Wikipedia ar...

Just out in WWW last week! πŸ“œOur work on substitution patterns between Wikipedia and ChatGPT. We find *heterogeneous* impacts, where Wiki articles that are similar to ChatGPT outputs see a greater drop in views than dissimilar articles:

arxiv.org/abs/2503.00757

06.05.2025 19:04 β€” πŸ‘ 31    πŸ” 3    πŸ’¬ 3    πŸ“Œ 0
Post image

Have you ever asked yourself about the overall extent of TikTok? Here some numbers from "Just Another Hour on TikTok" - Great compliment to @bendavidsteel.bsky.social for this data collection effort!
w/ @miriamschirmer.bsky.social & Derek Ruths
arxiv.org/abs/2504.13279

21.04.2025 17:09 β€” πŸ‘ 78    πŸ” 32    πŸ’¬ 4    πŸ“Œ 4
Preview
Climate Terminology Does Not Matter Our new paper finds that swapping out one climate term for another does not meaningfully change people’s stated commitment to fight climate change

Climate Terminology Does Not Matter

Across tens of thousands of participants in two large-scale experiments, we found that labeling climate change in different ways had no effect on their stated willingness to act.
jayvanbavellab.substack.com/p/climate-te...

via @dgoldwert.bsky.social

07.04.2025 17:46 β€” πŸ‘ 127    πŸ” 43    πŸ’¬ 8    πŸ“Œ 6
Post image Post image

Wow!

Three scholars at Columbia, Michigan, & Maryland just introduced a measure of the partisan leanings of employers in the U.S.

The data is constructed by linking voter registrations to online worker profiles.

VRscores capture the political affiliations of 21.8M workers across 2.6M employers.

06.04.2025 15:25 β€” πŸ‘ 268    πŸ” 74    πŸ’¬ 13    πŸ“Œ 11
Preview
β€œWait, not like that”: Free and open access in the age of generative AI The real threat isn't AI using open knowledge β€” it's AI companies killing the projects that make knowledge free

Awesome post by @molly.wiki on the tensions of free knowledge production and commons in the face of extractive technologies like AI. www.citationneeded.news/free-and-ope...

14.03.2025 18:35 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Machine-assisted Content Creation on Peer Production Platforms We examine how AI-powered translation technology shape multilingual content creation on peer production platforms. Drawing on Wikipedia's integration of Google

This project is a collaboration with @dylantwalker.bsky.social. We sincerely thank the reviewers and editors for their constructive feedbackβ€”it has been incredibly helpful in improving the paper!

You can find the updated paper here: papers.ssrn.com/sol3/papers....

24.02.2025 18:03 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Our work offers actionable insights for platform designers and policymakers aiming to democratize knowledge and bridge digital divides through AI.

24.02.2025 18:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@kai-zzzzzz is following 20 prominent accounts