Hayoung Jung's Avatar

Hayoung Jung

@hayoungjung.bsky.social

PhD student at @princetoncitp.bsky.social. Previously @uwcse.bsky.social website: hayoungjung.me

112 Followers  |  129 Following  |  32 Posts  |  Joined: 22.11.2024  |  2.4068

Latest posts by hayoungjung.bsky.social on Bluesky

Post image Post image

๐ŸŒฑโœจ Life update: I just started my PhD at Princeton University!

I will be supervised by @manoelhortaribeiro.bsky.social and affiliated with Princeton CITP.

It's only been a month, but the energy feels amazing โ€”very grateful for such a welcoming community. Excited for whatโ€™s ahead! ๐Ÿš€

03.10.2025 17:56 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Social media feeds today are optimized for engagement, often leading to misalignment between users' intentions and technology use.

In a new paper, we introduce Bonsai, a tool to create feeds based on stated preferences, rather than predicted engagement.

arxiv.org/abs/2509.10776

16.09.2025 13:24 โ€” ๐Ÿ‘ 153    ๐Ÿ” 46    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 7
Preview
MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform Understanding the prevalence of misinformation in health topics online can inform public health policies and interventions. However, measuring such misinformation at scale remains a challenge, particu...

Forgot the most important link! Paper here:

08.09.2025 18:50 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Lastly, I would like to thank my awesome collaborators @shravika-mittal.bsky.social, Ananya Aatreya (my first mentee!), @navreetkaur.bsky.social, and faculty mentors who taught me a lot during this project @tanumitra.bsky.social @munmun10.bsky.social!

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
SocialCompUW/youtube-opioid-myth-detect-M1 ยท Hugging Face Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

๐Ÿ™Œ We hope public health, platforms, & researchers build on MythTriage to scale OUD myth detection on video platforms.
To support this, weโ€™re releasing everything:
๐Ÿง  Models: huggingface.co/SocialCompUW...
๐Ÿ’ป Code: github.com/hayoungjungg...
๐Ÿ“Š Data: github.com/hayoungjungg...

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿคฉ Lastly, weโ€™re excited because this work shows how a decade-old, but simple ideaโ€”model cascadesโ€”scales with LLM advancements to tackle real high-stakes health issues like OUD myths.

Past work tested model cascades on standard benchmarks (e.g., SQuAD). We validate them in the wild!

08.09.2025 18:13 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Our findings offer actionable insights in the context of the ongoing opioid crisisโ€”showing the value of MythTriage:

๐Ÿ‘ฉโ€โš•๏ธPublic health: Inform targeted interventions & debunk myths.
๐Ÿ›ก๏ธPlatforms: Provides a scalable auditing pipeline to flag high-risk content & improve moderation.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ“ŠFinding #3: YouTubeโ€™s recommendation continued surfacing myth-supporting content.

โžก๏ธ12.7% of recs from myth videos led to more myths initiallyโ€”rising to 22% at deeper levels.

โš ๏ธ Moderation should target these rec pathways that reinforce harmful myths.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ“Š Finding #2: How you filter your search results matters! Switching from โ€œRelevanceโ€ to โ€œUpload Dateโ€ or โ€œRatingโ€ increases exposure to mythsโ€”echoing the same patterns seen in my COVID-19 misinformation audit: ojs.aaai.org/index.php/IC...

๐Ÿ˜ฌA few clicks can change your exposure to myths!

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐ŸซถThanks to MythTriage, we present the first large-scale study of OUD-related myths on YouTube!

๐Ÿ“Š Finding #1: Nearly 20% of YouTube search results support OUD myths, while 30% oppose.

๐Ÿ˜ฐDespite more opposing, myth-supporting content is widespreadโ€”and risks shaping how people understand treatment.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

โš™๏ธSo how does MythTriage perform?
๐Ÿ“Š Achieves 0.68-0.86 macro F1 and defers only 5-67% of the examples to the costly LLM.

In practice, MythTriage:
๐Ÿ’ธ Cuts financial costs by 98% vs experts and by 94% vs LLM labeling
โฑ๏ธ Cuts time costs by 96% vs experts & by 76% vs LLM labeling

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿš€ Our solution: MythTriage
๐Ÿ‘‰ Uses lightweight DeBERTa for routine cases
๐Ÿ‘‰ Defers harder ones to GPT-4o (high-performing but costly)

The trick? We distilled DeBERTa on GPT-4oโ€™s synthetic labelsโ€”achieving strong performance without massive expert-labeled data.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ’กChallenge: Detecting OUD myths on video platforms at *scale* is toughโ€“clinical expertise and labeling are essential, but it is slow and costly.

๐Ÿค–LLMs show promise, but high compute & API costsโ€”especially for long-form videoโ€”limit their practicality for large-scale detection.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

๐Ÿฉบ To rigorously detect OUD myths in our datasets, we collaborated closely with clinical experts to:

โœ…Validate eight pervasive myths on OUD (see examples below!)
โœ…Create and refine annotation guidelines
โœ…Build a gold-standard dataset: 310 videos labeled across 8 myths (~2.5K expert labels).

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

To measure the scale and prevalence of myths on YouTube, we curated opioid and OUD search queries based on real-world search interests. Using these queries, we built two datasets on YouTube:

1๏ธโƒฃ OUD Search Dataset: 2.9K search results
2๏ธโƒฃ OUD Recs Dataset: 343K video recommendations

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ›œFacing offline stigma, many turn to online platforms (YouTube) for health info & recovery.

โ€ผ๏ธBut myths fuel treatment hesitancy, distrust in healthcare, & stigma.

๐Ÿค”Understanding the scale of myths is crucial for health officials & platforms to design effective interventions.

08.09.2025 18:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐ŸšจYouTube is a key source of health info, but itโ€™s also rife with dangerous myths on opioid use disorder (OUD), a leading cause of death in the U.S.

To understand the scale of such misinformation, our #EMNLP2025 paper introduces MythTriage, a scalable system to detect OUD myth๐Ÿงต

08.09.2025 18:13 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

๐ŸŽ‰ Our work on attribution in summarization is now accepted to #EMNLP2025 main! ๐ŸŽ‰

"Unstructured Evidence Attribution for Long Context Query Focused Summarization"

w/ @zainmujahid.me , Lu Wang, @iaugenstein.bsky.social , and @davidjurgens.bsky.social

25.08.2025 11:42 โ€” ๐Ÿ‘ 22    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Want $50 to create your own feed with natural language?

Weโ€™re a group of researchers at Princeton studying how people could more easily build their own feeds.

Join our study to try it out and tell us what you think of the experience!

๐Ÿ‘‰ Sign up at forms.gle/MkSGKzxDfBEc... (takes less than 1 min)

22.07.2025 13:57 โ€” ๐Ÿ‘ 12    ๐Ÿ” 7    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2

I would also love to be added!!

24.06.2025 13:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

On my way to Copenhagen, where I will give an invited talk at a workshop and present this work at ICWSM! Super excited to meet everyone -- please DM me if you would like to chat!

22.06.2025 03:35 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thank you for the shoutout, Joey! :)

16.01.2025 05:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

13/ Huge thanks to Prof. Tanu Mitra (@tanumitra.bsky.social) for this incredible opportunity and to my amazing PhD mentor, Prof. Prerna Juneja (@prerna6.bsky.social), for guiding me throughout. I have learned so much from you and your support. Thank you for introducing me to the world of research!๐Ÿ™Œ

16.01.2025 01:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
SocialCompUW/youtube-covid-misinfo-detect ยท Hugging Face Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

12/ We also release:
๐Ÿค– Classifier: huggingface.co/SocialCompUW...
๐Ÿ“‚ Data: github.com/social-comp/...
๐Ÿ“„ Paper: arxiv.org/abs/2409.10168
Check them out!

16.01.2025 01:39 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

11/ Our study reveals contrasting algorithmic behavior in YouTube Search across countries, highlighting the need for YouTube to regulate its algorithmic behaviors consistently to ensure an equitable, responsible search engine that delivers accurate info regardless of location ๐ŸŒ

16.01.2025 01:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

10/ This work also emphasizes the need for more audits in Global South contexts.

However, access to affordable and widely available proxies, VPNs, and VMs is a barrier, highlighting the need for greater support to facilitate more extensive audits in the Global South.

16.01.2025 01:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

9/ These findings have profound implications. SA usersโ€™ heightened exposure to misinformation could influence beliefs ๐Ÿง, health practices ๐Ÿฉบ, and societal challenges ๐Ÿฅ like vaccine hesitancy.

16.01.2025 01:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

8/ SA bots encountered significantly more misinformation in the default Relevance filter.

Since most users will likely engage with SERPs in this default settings, users in SA ๐Ÿ‡ฟ๐Ÿ‡ฆ face a higher likelihood of misinformation exposure, raising public health risks.

16.01.2025 01:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

7/ Thus, this shows how YouTubeโ€™s algorithm behaves inconsistently:
โœ… Relevance & View Count filters surfaced SERPs *opposing* misinfo.
โŒ Upload Date & Rating filters surfaced SERPs *supporting* misinfo.
This highlights potential gaps in YouTubeโ€™s content moderation policies.

16.01.2025 01:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@hayoungjung is following 20 prominent accounts