Janet Liu's Avatar

Janet Liu

@janetlauyeung.bsky.social

https://janetlauyeung.github.io/ ๐Ÿ›Ž๏ธ postdoc @mainlp.bsky.social, LMU Munich ๐Ÿค  PhD in CompLing from Georgetown ๐Ÿ•บ๐Ÿป prev: x2 intern @Spotify @SpotifyResearch

722 Followers  |  149 Following  |  19 Posts  |  Joined: 15.10.2023  |  1.8958

Latest posts by janetlauyeung.bsky.social on Bluesky

Piper title ("A multi-dialectal dataset for German dialect ASR and dialect-to-standard speech translation") and a map of the German state Bavaria showing where the Franconian, Bavarian, and Alemannic dialect groups are spoken

Piper title ("A multi-dialectal dataset for German dialect ASR and dialect-to-standard speech translation") and a map of the German state Bavaria showing where the Franconian, Bavarian, and Alemannic dialect groups are spoken

At #Interspeech2025 I'm going to present Betthupferl, a dataset for German dialect ASR & dialect-to-standard speech translation! We analyze differences between dialectal & Standard German transcriptions, benchmark ASR models, and examine shortcomings of current ASR models & evaluation metrics.

07.08.2025 08:46 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Unsure which presentations to attend at #ACL2025? ๐Ÿ›Ž๏ธ๐Ÿ—ฃ๏ธ

27.07.2025 09:56 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ•บ๐Ÿผswing by our poster in Hall 4/5 on Wednesday, July 30 at 11:00 to chat with @florian-eichin.com and I to find out the answers to these questions

๐Ÿ›Ž๏ธ bonus: to see the full poster ๐Ÿซฃ๐Ÿงฉ

#ACL2025 #NLProc

23.07.2025 14:03 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Some recommendations for #ACL2025 ๐Ÿ‘‡
(join me and @janetlauyeung.bsky.social to talk about discourse generalization and probing!)

23.07.2025 12:38 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Headed to ACL? MaiNLP & our most recent work will be there too๐Ÿ‘ฅ๐Ÿ“„
Come see what weโ€™ve been working on!

23.07.2025 12:29 โ€” ๐Ÿ‘ 14    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

๐Ÿ’ก more findings, error analysis, and in-depth discussion are in our paper:

๐Ÿ“„ arxiv.org/abs/2503.10515
๐Ÿค– github.com/mainlp/disco...

meet and chat with us at our poster in Vienna ๐Ÿ‡ฆ๐Ÿ‡น at #ACL2025NLP

๐Ÿ•ฐ๏ธ 11:00-12:30, Wednesday, July 30
๐Ÿ“ Hall 4/5 Session 12: IP-Posters

10.07.2025 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Layer-wise probe performance by languages. Mean accuracy over five runs.

Layer-wise probe performance by languages. Mean accuracy over five runs.

๐Ÿ” finding 3: discourse representations are best aligned across languages in the intermediate layers

10.07.2025 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Mean accuracy over five runs of the Aya-23-35B-probe trained and tested on various partitions of DISRPT.

Mean accuracy over five runs of the Aya-23-35B-probe trained and tested on various partitions of DISRPT.

๐ŸŒ finding 2: our probes generalize across languages and language families

10.07.2025 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Mean accuracy over five runs of the probing classifiers trained on the entire DISRPT and full attention representations. The reference system DisCoDisCo achieved a mean accuracy of 47.9% (the red dashed line).

Mean accuracy over five runs of the probing classifiers trained on the entire DISRPT and full attention representations. The reference system DisCoDisCo achieved a mean accuracy of 47.9% (the red dashed line).

๐Ÿ“Œ finding 1: model size alone does not lead to discourse probing success; instead, multilingual training, dataset composition, and language-specific factors play significant roles

10.07.2025 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿงช for 23 SOTA LLMs, we use a probing approach to test whether their representations encode information relevant to discourse relation classification on DISRPT 2023, which covers 13 languages, four frameworks, 26 datasets, and various genres, domains, and modalities

10.07.2025 12:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Examples of the core discourse relation CONDITION (Bunt and Prasad, 2016) annotated in different frameworks and languages using different labels.

Examples of the core discourse relation CONDITION (Bunt and Prasad, 2016) annotated in different frameworks and languages using different labels.

the proposed unified label set (see definitions and examples in the appendix of the paper)

the proposed unified label set (see definitions and examples in the appendix of the paper)

โ“problem: discourse relations are central to NLU, but current work is primarily fragmented across frameworks & languages

๐Ÿ”ง solution: we proposed a unified label set of 17 relations across 4 discourse frameworks. This lets us compare model behavior across corpora, languages, and annotation schemes

10.07.2025 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
to appear at ACL2025

to appear at ACL2025

๐Ÿฆ™ how well do LLMs encode discourse knowledge? does that generalize across languages?

๐Ÿ›Ž๏ธ in our #ACL2025 paper, we uncover fascinating trends about multilingual discourse representations!

joint work w/ @florian-eichin.com @barbaraplank.bsky.social @mhedderich.bsky.social

๐Ÿ“„ arxiv.org/abs/2503.10515

10.07.2025 12:38 โ€” ๐Ÿ‘ 16    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

I'm looking for a reviewer for a paper on measuring syntactic productivity (lots of maths!) due a week from now. Please shoot me an email if you could review!

11.06.2025 20:36 โ€” ๐Ÿ‘ 0    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Bavarian dialect speakers needed! Our MSc student Miriam wants to find out 1. how good/bad LLM-generated "Bavarian" is, and 2. whether dialect speakers agree with each other on this. The survey takes <5 min: survey.ifkw.lmu.de/dialquali25/ Thank you for sharing/participating!

30.05.2025 14:17 โ€” ๐Ÿ‘ 3    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

@munichcenterml.bsky.social
@slds-lmu.bsky.social
@munichcenterml.bsky.social
@berd-nfdi.bsky.social

16.05.2025 13:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

my amazing co-organizers: @assenmacher.bsky.social Jacob Beck, @barbaraplank.bsky.social , @stephnie.bsky.social, Frauke Kreuter, Gina Walejko

16.05.2025 13:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Welcome to the First Workshop on Bridging NLP and Public Opinion Research, co-located with COLM 2025, October 10, 2025, Montreal, Canada.

Welcome to the First Workshop on Bridging NLP and Public Opinion Research, co-located with COLM 2025, October 10, 2025, Montreal, Canada.

๐Ÿ›Ž๏ธ Excited to announce the 1st Workshop on Bridging NLP and Public Opinion Research at COLM 2025, Oct 10th in Montreal ๐Ÿ‡จ๐Ÿ‡ฆ

As LLMs reshape public discourse and research, collaboration between NLP and Public Opinion Research (POR) is more vital than ever #NLPOR Submit by June 23๐Ÿ“„

๐Ÿ”— tinyurl.com/nlpor25

16.05.2025 13:23 โ€” ๐Ÿ‘ 18    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

BlackboxNLP, the leading workshop on interpretability and analysis of language models, will be co-located with EMNLP 2025 in Suzhou this November! ๐Ÿ“†

This edition will feature a new shared task on circuits/causal variable localization in LMs, details here: blackboxnlp.github.io/2025/task

15.05.2025 08:21 โ€” ๐Ÿ‘ 21    ๐Ÿ” 8    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 4

On my way to #NAACL2025 where I'll give a keynote at the noisy text workshop (WNUT), presenting some of the challenges & methods for dialect NLP + also discussing dialect speakers' perspectives!

๐Ÿ—จ๏ธ Beyond โ€œnoisyโ€ text: How (and why) to process dialect data
๐Ÿ—“๏ธ Saturday, May 3, 9:30โ€“10:30

29.04.2025 09:17 โ€” ๐Ÿ‘ 27    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Logo for MIB: A Mechanistic Interpretability Benchmark

Logo for MIB: A Mechanistic Interpretability Benchmark

Lots of progress in mech interp (MI) lately! But how can we measure when new mech interp methods yield real improvements over prior work?

We propose ๐Ÿ˜Ž ๐— ๐—œ๐—•: a ๐— echanistic ๐—œnterpretability ๐—•enchmark!

23.04.2025 18:15 โ€” ๐Ÿ‘ 49    ๐Ÿ” 15    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 6
The hand-drawn sign from three years ago.

The hand-drawn sign from three years ago.

๐ŸŽ‰MaiNLP is turning 3 today!๐ŸŽ‚๐Ÿฅณ Weโ€™ve grown a lot since @barbaraplank.bsky.social started this group with nothing but three aspiring researches and a hand-drawn sign on the door. Huge thanks to all the amazing people who have joined or visited us since. Hereโ€™s to many more years of exciting research!๐Ÿš€

01.04.2025 10:40 โ€” ๐Ÿ‘ 19    ๐Ÿ” 9    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

๐ŸŽฏ

28.03.2025 08:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

this ๐ŸŽฏ๐ŸŽฏ

26.03.2025 19:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Welcome back, astronauts! A LOT has changed since you left... #DailyShow #NASA #Trump #DesiLydic TikTok video by The Daily Show

www.tiktok.com/@thedailysho...

24.03.2025 16:57 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc

19.11.2024 03:48 โ€” ๐Ÿ‘ 107    ๐Ÿ” 37    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Preview
โ€œSeeing the Big through the Smallโ€: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations? Beiduo Chen, Xinpeng Wang, Siyao Peng, Robert Litschko, Anna Korhonen, Barbara Plank. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024.

Beiduo Chen from @MaiNLPlab: โ‰๏ธCan LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

๐Ÿ“poster session 6, Nov 13 (Wed) 10:30-12:00
๐Ÿ“œ aclanthology.org/2024.finding...

11.11.2024 17:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models Philipp Mondorf, Barbara Plank. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024.

Philipp Mondorf from @MaiNLPlab: ๐Ÿง๐ŸปLiar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models

๐Ÿ“poster session 6, Nov 13 (Wed) 10:30-12:00
๐Ÿ“œ aclanthology.org/2024.emnlp-m...

11.11.2024 17:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

2โƒฃ eRST: A Signaled Graph Theory of Discourse Relations and Organization (CL journal)

๐Ÿ“oral session 6, Nov 13 (Wed) 10:30-12:00
๐Ÿ“œ direct.mit.edu/coli/article...

11.11.2024 17:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes. Proceedings of the 2024 Conference on Empirical Methods in...

1โƒฃ GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains

๐Ÿ“poster session 3, Nov 12 (Tue) 14:00-15:30
๐Ÿ“œ aclanthology.org/2024.emnlp-m...
๐Ÿ–ฅ๏ธgithub.com/gucorpling/gumโ€ฆ

11.11.2024 17:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐ŸŒด in Miami for #EMNLP2024 this week ๐ŸŒด

1st conference {of the year | with @MaiNLP | as a postdoc}:
> 2 papers to present, SPF 50 on standby
> with my new labmates (who are doing cool work๐Ÿคฉ)
> reunite with @GUCompLing folks ๐Ÿ™Œ
> come say hi and catch up ๐Ÿ•บ๐Ÿป

see ๐Ÿงต

11.11.2024 17:34 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@janetlauyeung is following 20 prominent accounts