Jannis Vamvas's Avatar

Jannis Vamvas

@vamvas.bsky.social

Natural Language Processing (#NLProc #NLP) Researcher @ University of Zurich @cl-uzh.bsky.social vamvas.ch

124 Followers  |  95 Following  |  9 Posts  |  Joined: 17.10.2023  |  1.6589

Latest posts by vamvas.bsky.social on Bluesky

Post image

💥Introducing new paper: arxiv.org/pdf/2510.17715, QueST — train specialized generators to create challenging coding problems.
From Qwen3-8B-Base
✅ 100K synthetic problems: better than Qwen3-8B
✅ Combining with human written problems: matches DeepSeek-R1-671B
🧵(1/5)

21.10.2025 14:01 — 👍 4    🔁 3    💬 1    📌 0
Sina Ahmadi receiving award.

Sina Ahmadi receiving award.

Congratulations to @sinaahmadi.bsky.social and co-authors for receiving an ACL 2025 Outstanding Paper Award for PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages!

aclanthology.org/2025.acl-lon...

30.07.2025 15:10 — 👍 14    🔁 6    💬 0    📌 0
Sina Ahmadi, Rico Sennrich, Michelle Wastl, Jannis Vamvas, Andrianos Michail from the Department of Computational Linguistics posing in front of the at ACL 2025 Vienna step-and-repeat banner

Sina Ahmadi, Rico Sennrich, Michelle Wastl, Jannis Vamvas, Andrianos Michail from the Department of Computational Linguistics posing in front of the at ACL 2025 Vienna step-and-repeat banner

We're proud to see our researchers sharing their latest work: @sinaahmadi.bsky.social, @ricosennrich.bsky.social, @michellewastl.bsky.social, @vamvas.bsky.social and Andrianos Michail (pictured) and many others.
If you’re at #ACL, stop by to learn more!

29.07.2025 13:31 — 👍 16    🔁 4    💬 0    📌 0
Post image

Arrived in Vienna ready for #ACL2025NLP!

Excited to present papers with @vamvas.bsky.social @ricosennrich.bsky.social on Unsupervised Translation Direction Detection and Multilingual Hallucination Detection!

Come say hi! 👋
#NLProc #NLP #NMT #LLMs

26.07.2025 07:33 — 👍 7    🔁 2    💬 1    📌 0
Make a guess: How does GPT-4o tokenize the Lithuanian word ‘nebeprisikiškiakopūsteliautum’?
A.	⟨nebepr, is, ik, iškiak, opūste, liautum⟩
B.	⟨neb, pre, ski, kayak, opus, tell, autumn⟩
C.	⟨ne, be, pris, iki, ški, ak, op, ū, stel, ia, utum⟩

Make a guess: How does GPT-4o tokenize the Lithuanian word ‘nebeprisikiškiakopūsteliautum’? A. ⟨nebepr, is, ik, iškiak, opūste, liautum⟩ B. ⟨neb, pre, ski, kayak, opus, tell, autumn⟩ C. ⟨ne, be, pris, iki, ški, ak, op, ū, stel, ia, utum⟩

Can you solve my exam question?
I've come to believe that multiple-choice exams are underrated. More in my blog post, “The Joy of Multiple-Choice.” vamvas.ch/the-joy-of-m...

23.06.2025 10:39 — 👍 4    🔁 0    💬 0    📌 0
Post image Post image

I had the pleasure of presenting our paper “20min-XD: A Comparable Corpus of Swiss News Articles (arxiv.org/abs/2504.21677) ”, a joint work with @vamvas.bsky.social , Selena Calleri and @ricosennrich.bsky.social , at SwissText 2025 last week.

We are honored to receive Best Paper Award for it! ✨

19.05.2025 10:32 — 👍 9    🔁 3    💬 0    📌 0
Michelle Wastl at the SwissNLP Expo

Michelle Wastl at the SwissNLP Expo

Our Michelle Wastl is demonstrating her invention, the Translation Direction Detector, at the SwissNLP Expo!
Michelle's paper: arxiv.org/abs/2401.06769
Demo: huggingface.co/spaces/Zuric...
If you're at the expo, make sure to stop by the Department of Computational Linguistics UZH!

13.05.2025 09:22 — 👍 6    🔁 3    💬 0    📌 0
Bar chart illustrating counts of different API responses for the same request.

Bar chart illustrating counts of different API responses for the same request.

I also found what others complained about previously: The “Predicted Outputs” feature is unstable even with temp=0.

It’s unclear to me why this happens – do they batch the verification steps of users? An artifact of temperature sampling?

So, any hints would be appreciated! 4/4

21.04.2025 16:52 — 👍 1    🔁 0    💬 0    📌 0
3D plot comparing API behavior to simulation results.

3D plot comparing API behavior to simulation results.

Systematic probing of the OpenAI API reveals that they are using a lookahead parameter of K=16, and perform prefix matching with a prefix threshold of 32 tokens.

A simulation experiment confirms that the reverse-engineered algorithm matches the behavior of the API. 3/4

21.04.2025 16:52 — 👍 0    🔁 0    💬 1    📌 0
Schematic illustration of draft verification

Schematic illustration of draft verification

Draft verification is a well-known idea in NLP, from Speculative Decoding to algorithms leveraging a static draft, such as Aggressive Decoding and Prompt Lookup Decoding.

Clearly, “Predicted Outputs” does something similar. But how does it work, specifically? 2/4

21.04.2025 16:52 — 👍 0    🔁 0    💬 1    📌 0
Preview
OpenAI Platform Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

OpenAI introduced the “Predicted Outputs” feature last November, but their documentation raises more questions than it answers. So let’s reverse-engineer the feature! 🧵1/4

platform.openai.com/docs/guides/...

21.04.2025 16:52 — 👍 0    🔁 0    💬 1    📌 0
Preview
OpenAI's Speculative Decoding, Reverse-Engineered Why LLMs are faster if we give them a draft to complete.

Blogged: Reverse-engineering OpenAI's Speculative Decoding.

I was curious how GPT-4o can make use of predicted outputs to speed up text generation.
vamvas.ch/openai-predi...

21.04.2025 16:52 — 👍 6    🔁 0    💬 1    📌 0
Post image

Congratulations to Dr. @shaitarova.bsky.social , who just successfully defended her thesis on "Generated Text as the Locus of Language Contact: Exploring the Impact of Generative AI on English and German". Special thanks to the external examiner Natalia Resende!

11.04.2025 09:30 — 👍 12    🔁 3    💬 2    📌 0
Post image

I'm thrilled to share my first PhD project, a joint work with
@vamvas.bsky.social and @ricosennrich.bsky.social
Paper link:
arxiv.org/pdf/2503.10494
Long context LLMs have paved the way for document translation, but is simply inputting the whole content the optimal way?
Here's the thread 🧵 [1/n]

14.03.2025 14:58 — 👍 8    🔁 3    💬 1    📌 0
Tannon Kew presenting during his viva.

Tannon Kew presenting during his viva.

Congratulations to Dr. @tannonk.bsky.social, who just successfully defended his thesis on "Leveraging Data, Decoding, and Context for Controlling Text Generation from Pretrained Language Models". Special thanks to the external examiner @feralvam.bsky.social!

06.12.2024 10:34 — 👍 18    🔁 5    💬 0    📌 0
Preview
UZH: PhD Student in Natural Language Processing You will be joining the Department of Computational Linguistics, which has 5 faculties and around 40 postdoctoral and student researchers in the areas of text technologies, phonetics and speech. You w...

Rico Sennrich (University of Zurich) is hiring a PhD student to work on representations of knowledge in multilingual language models, starting September 2024.

Rico is my former advisor and I can greatly recommend working with him. Apply by January 4: jobs.uzh.ch/offene-stell...

11.12.2023 13:50 — 👍 4    🔁 1    💬 0    📌 0
Preview
GitHub - ZurichNLP/mbr: Minimum Bayes Risk Decoding for Hugging Face Transformers Minimum Bayes Risk Decoding for Hugging Face Transformers - GitHub - ZurichNLP/mbr: Minimum Bayes Risk Decoding for Hugging Face Transformers

Minimum Bayes Risk decoding has recently gained momentum in #NLProc.

However, a generic implementation for @huggingface.bsky.social Transformers has been missing. Check out our new 𝗺𝗯𝗿🔥 repo, which is designed to work with any model and metric on the Hub: github.com/ZurichNLP/mbr

03.11.2023 12:50 — 👍 0    🔁 0    💬 0    📌 0
Preview
Paper page - Towards Unsupervised Recognition of Semantic Differences in Related Documents Join the discussion on this paper page

Can NLP be used for highlighting differences between two documents?
Our EMNLP paper investigates the task of Recognizing Semantic Differences (RSD) with simple, unsupervised approaches.

• Demo: huggingface.co/spaces/Zuric...
• Paper: huggingface.co/papers/2305....

#NLProc

19.10.2023 09:59 — 👍 10    🔁 1    💬 0    📌 0

@vamvas is following 20 prominent accounts