Antonios Dimakis @antoniosdimakis

Antonios Dimakis

@antoniosdimakis.bsky.social

PhD fellow at Archimedes Unit, Athena Research Center | PhD student at the National and Kapodistrian University of Athens Interested in NLP for low-resource languages/terms, tokenization, and linguistics

8 Followers | 15 Following | 4 Posts | Joined: 25.07.2025 | 1.3962

Latest posts by antoniosdimakis.bsky.social on Bluesky

GitHub - andhmak/rule_dialnorm: Code and datasets associated with the paper titled "Dialect Normalization using Large Language Models and Morphological Rules" Code and datasets associated with the paper titled "Dialect Normalization using Large Language Models and Morphological Rules" - GitHub - andhmak/rule_dialnorm: Code and datasets associa...

Proud to work with John Pavlopoulos and @antonisa.bsky.social on this publication!

Check out the data and code here: github.com/andhmak/rule...

4/4

25.07.2025 17:52 — 👍 2 🔁 1 💬 0 📌 0

Regions clustered based on the embeddings of their proverbs. Normalized proverbs produce much more meaningful groupings.

We implement our method for Greek, and experiment on a proverb dataset. We therefore very cheaply extend NLU coverage of models pre-trained on just the standard to almost every Greek dialect.

After normalizing we even find cultural insights which were previously obscured!

3/4

25.07.2025 17:50 — 👍 1 🔁 0 💬 1 📌 0

Table showing normalization quality for different setups, with the full setup obtaining good scores.

"Dialect Normalization using Large Language Models and Morphological Rules"

By applying rule-based, linguistically informed transformations to the input before passing it to a LLM, with targeted few-shot prompting, we can obtain high-quality normalized outputs.

2/4

25.07.2025 17:48 — 👍 0 🔁 0 💬 1 📌 0

Example of a dialectal sentence being normalized incorrectly when using a base LLM, and the same sentence normalized correctly using our method.

How can we make models understand dialectal input, even in dialects with very little data available?

Our work indicates that Rule-Based Normalization can significantly help.

If you're at #ACL2025, check out our poster on Monday at 6pm! aclanthology.org/2025.finding...

1/4

25.07.2025 17:46 — 👍 2 🔁 0 💬 1 📌 0

@antoniosdimakis is following 15 prominent accounts

@mariebexte

Zain Muhammad Mujahid
@zainmujahid.me

PhD Fellow | University of Copenhagen (CopeNLU) | Pioneer Centre for AI

@esquirar

Nikitas Theodoropoulos
@nikitas-theo

You can learn more about me here: https://nikitas-theo.github.io/

Layla Bouzoubaa
@laylab

PhD Candidate @ Drexel University, Info. Sci. 🇲🇦 Studying lived experiences of people who use #drugs, #stigma, #harmreduction through mixed methods, #NLProc. Public health background. Aspiring wine connoisseur & proud cat lady 🐱🐱🐱

Pavel
@p4m8

Athens, Greece

Verena Blaschke
@verenablaschke

PhD student @mainlp.bsky.social (@cislmu.bsky.social, LMU Munich). Interested in language variation & change, currently working on NLP for dialects and low-resource languages. verenablaschke.github.io

Prabin Bhandari
@iamprabin

I do research related to LLMs , their interaction with geospatial data and leveraging them for information extraction . PhD in computer Science at George Mason University.

Joshua Otten
@joshuaotten

CS PhD student @GeorgeMasonUniversity NLP for ancient languages

@fahimfaisal

Poorvi Acharya
@pooorvi

PhD student in NLP at GMU w/ Antonios Anastasopoulos. Focus: L2 acquisition, low-resource NLP, psycholinguistics. Passionate about empowering heritage speakers. Berkeley '19

Belu Ticona
@beluticona

CS PhD student @GeorgeMasonU @ComputacionUBA NLP, Speech &🤎 Language Technologies for Crisis Response, AI + Indigenous People 🌱 http://beluticona.github.io

Nate Krasner
@natekrasner

CS (NLP) PhD @ GMU I work on multilinguality and multilingual encoder alignment among other things.

Antonis Anastasopoulos
@antonisa

Assistant Prof at GMU. NLP, CompLing, ML, and other things language+humans

Sam Blouir
@samblouir

Thanks for coming to “Foundation Models for Biological Discoveries” (FMs4Bio) @ AAAI 2025!