💥Introducing new paper: arxiv.org/pdf/2510.17715, QueST — train specialized generators to create challenging coding problems.
From Qwen3-8B-Base
✅ 100K synthetic problems: better than Qwen3-8B
✅ Combining with human written problems: matches DeepSeek-R1-671B
🧵(1/5)
21.10.2025 14:01 — 👍 4 🔁 3 💬 1 📌 0
Sina Ahmadi receiving award.
Congratulations to @sinaahmadi.bsky.social and co-authors for receiving an ACL 2025 Outstanding Paper Award for PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages!
aclanthology.org/2025.acl-lon...
30.07.2025 15:10 — 👍 14 🔁 6 💬 0 📌 0
Sina Ahmadi, Rico Sennrich, Michelle Wastl, Jannis Vamvas, Andrianos Michail from the Department of Computational Linguistics posing in front of the at ACL 2025 Vienna step-and-repeat banner
We're proud to see our researchers sharing their latest work: @sinaahmadi.bsky.social, @ricosennrich.bsky.social, @michellewastl.bsky.social, @vamvas.bsky.social and Andrianos Michail (pictured) and many others.
If you’re at #ACL, stop by to learn more!
29.07.2025 13:31 — 👍 16 🔁 4 💬 0 📌 0
Arrived in Vienna ready for #ACL2025NLP!
Excited to present papers with @vamvas.bsky.social @ricosennrich.bsky.social on Unsupervised Translation Direction Detection and Multilingual Hallucination Detection!
Come say hi! 👋
#NLProc #NLP #NMT #LLMs
26.07.2025 07:33 — 👍 7 🔁 2 💬 1 📌 0
Make a guess: How does GPT-4o tokenize the Lithuanian word ‘nebeprisikiškiakopūsteliautum’?
A. ⟨nebepr, is, ik, iškiak, opūste, liautum⟩
B. ⟨neb, pre, ski, kayak, opus, tell, autumn⟩
C. ⟨ne, be, pris, iki, ški, ak, op, ū, stel, ia, utum⟩
Can you solve my exam question?
I've come to believe that multiple-choice exams are underrated. More in my blog post, “The Joy of Multiple-Choice.” vamvas.ch/the-joy-of-m...
23.06.2025 10:39 — 👍 4 🔁 0 💬 0 📌 0
I had the pleasure of presenting our paper “20min-XD: A Comparable Corpus of Swiss News Articles (arxiv.org/abs/2504.21677) ”, a joint work with @vamvas.bsky.social , Selena Calleri and @ricosennrich.bsky.social , at SwissText 2025 last week.
We are honored to receive Best Paper Award for it! ✨
19.05.2025 10:32 — 👍 9 🔁 3 💬 0 📌 0
Michelle Wastl at the SwissNLP Expo
Our Michelle Wastl is demonstrating her invention, the Translation Direction Detector, at the SwissNLP Expo!
Michelle's paper: arxiv.org/abs/2401.06769
Demo: huggingface.co/spaces/Zuric...
If you're at the expo, make sure to stop by the Department of Computational Linguistics UZH!
13.05.2025 09:22 — 👍 6 🔁 3 💬 0 📌 0
Bar chart illustrating counts of different API responses for the same request.
I also found what others complained about previously: The “Predicted Outputs” feature is unstable even with temp=0.
It’s unclear to me why this happens – do they batch the verification steps of users? An artifact of temperature sampling?
So, any hints would be appreciated! 4/4
21.04.2025 16:52 — 👍 1 🔁 0 💬 0 📌 0
3D plot comparing API behavior to simulation results.
Systematic probing of the OpenAI API reveals that they are using a lookahead parameter of K=16, and perform prefix matching with a prefix threshold of 32 tokens.
A simulation experiment confirms that the reverse-engineered algorithm matches the behavior of the API. 3/4
21.04.2025 16:52 — 👍 0 🔁 0 💬 1 📌 0
Schematic illustration of draft verification
Draft verification is a well-known idea in NLP, from Speculative Decoding to algorithms leveraging a static draft, such as Aggressive Decoding and Prompt Lookup Decoding.
Clearly, “Predicted Outputs” does something similar. But how does it work, specifically? 2/4
21.04.2025 16:52 — 👍 0 🔁 0 💬 1 📌 0
OpenAI Platform
Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.
OpenAI introduced the “Predicted Outputs” feature last November, but their documentation raises more questions than it answers. So let’s reverse-engineer the feature! 🧵1/4
platform.openai.com/docs/guides/...
21.04.2025 16:52 — 👍 0 🔁 0 💬 1 📌 0
OpenAI's Speculative Decoding, Reverse-Engineered
Why LLMs are faster if we give them a draft to complete.
Blogged: Reverse-engineering OpenAI's Speculative Decoding.
I was curious how GPT-4o can make use of predicted outputs to speed up text generation.
vamvas.ch/openai-predi...
21.04.2025 16:52 — 👍 6 🔁 0 💬 1 📌 0
Congratulations to Dr. @shaitarova.bsky.social , who just successfully defended her thesis on "Generated Text as the Locus of Language Contact: Exploring the Impact of Generative AI on English and German". Special thanks to the external examiner Natalia Resende!
11.04.2025 09:30 — 👍 12 🔁 3 💬 2 📌 0
I'm thrilled to share my first PhD project, a joint work with
@vamvas.bsky.social and @ricosennrich.bsky.social
Paper link:
arxiv.org/pdf/2503.10494
Long context LLMs have paved the way for document translation, but is simply inputting the whole content the optimal way?
Here's the thread 🧵 [1/n]
14.03.2025 14:58 — 👍 8 🔁 3 💬 1 📌 0
Tannon Kew presenting during his viva.
Congratulations to Dr. @tannonk.bsky.social, who just successfully defended his thesis on "Leveraging Data, Decoding, and Context for Controlling Text Generation from Pretrained Language Models". Special thanks to the external examiner @feralvam.bsky.social!
06.12.2024 10:34 — 👍 18 🔁 5 💬 0 📌 0
UZH: PhD Student in Natural Language Processing
You will be joining the Department of Computational Linguistics, which has 5 faculties and around 40 postdoctoral and student researchers in the areas of text technologies, phonetics and speech. You w...
Rico Sennrich (University of Zurich) is hiring a PhD student to work on representations of knowledge in multilingual language models, starting September 2024.
Rico is my former advisor and I can greatly recommend working with him. Apply by January 4: jobs.uzh.ch/offene-stell...
11.12.2023 13:50 — 👍 4 🔁 1 💬 0 📌 0
GitHub - ZurichNLP/mbr: Minimum Bayes Risk Decoding for Hugging Face Transformers
Minimum Bayes Risk Decoding for Hugging Face Transformers - GitHub - ZurichNLP/mbr: Minimum Bayes Risk Decoding for Hugging Face Transformers
Minimum Bayes Risk decoding has recently gained momentum in #NLProc.
However, a generic implementation for @huggingface.bsky.social Transformers has been missing. Check out our new 𝗺𝗯𝗿🔥 repo, which is designed to work with any model and metric on the Hub: github.com/ZurichNLP/mbr
03.11.2023 12:50 — 👍 0 🔁 0 💬 0 📌 0
Paper page - Towards Unsupervised Recognition of Semantic Differences in Related
Documents
Join the discussion on this paper page
Can NLP be used for highlighting differences between two documents?
Our EMNLP paper investigates the task of Recognizing Semantic Differences (RSD) with simple, unsupervised approaches.
• Demo: huggingface.co/spaces/Zuric...
• Paper: huggingface.co/papers/2305....
#NLProc
19.10.2023 09:59 — 👍 10 🔁 1 💬 0 📌 0
Host of scientistic papers for @aclmeeting.bsky.social and other venues in the field of natural language processing. https://aclanthology.org/
#NLP #NLProc
PhD student @mainlp.bsky.social (@cislmu.bsky.social, LMU Munich). Interested in language variation & change, currently working on NLP for dialects and low-resource languages.
verenablaschke.github.io
Assistant Professor of Computational Linguistics @ Georgetown; formerly postdoc @ ETH Zurich; PhD @ Harvard Linguistics, affiliated with MIT Brain & Cog Sci. Language, Computers, Cognition.
Computational Linguist and Professional Nerd at Georgetown University
he/him pronouns, ALL the prepositions. http://nathan.cl
Lecturer@Queen's Uni Belfast; postdoc&PhD@Edinburgh Uni. I work on LLM post-training, multilingualism, machine translation, and financial AI.
Postdoctoral researcher @University of Zurich
linguist
PhD student at the University of Zurich. Trying to get to know what LLMs know🤔
Swiss Investigative Tech Journalist.
Awards 2018, 2020, 2021, 2023, 2024
Investigative Tech Reporterin.
Republik.ch & Techjournalismus-Blog DNIP.ch.
Eigener Blog: Techjournalismus.ch
#Privacy, #DigitaleEthik, #Cybersecurity, #Surveillance, #Cloud
Slator.com is leading source of analysis and research for the global translation, localization, and language technology industry.
DiLi lab at the Department of Computational Linguistics, University of Zurich. 👀🤖📖🧠💬 https://www.cl.uzh.ch/en/research-groups/digital-linguistics.html
« Ὅσον ζῇς φαίνου »
Researcher in computational linguistics at University of Zurich
I am a scientist specializing in phenomena around communication with speech and voice.
PhD candidate at the Department of Computational Linguistics, University of Zurich @cl-uzh.bsky.social
Impact of GenAI on Language | Language Change | Psycholinguistics Digital Humanities | Negation Resolution
Senior Research Scientist at Google DeepMind. Equitable AI, language, gender, society. She/her.
🌐 jasmijn.bastings.me
Postdoc at @sardine-lab-it.bsky.social working on fair and safe language technologies. | gattanasio.cc | he/him | http://questovirgolettatoesiste.com
Researcher at University of Zurich
https://www.juriopitz.com/