GitHub - Kiryukhasemenov/InFlags: Python package for dictionary-based inline tokenization preprocessing
Python package for dictionary-based inline tokenization preprocessing - Kiryukhasemenov/InFlags
Our paper at TokShop
InCa and InDia: more stable and interpretable tokenizer preprocessing that handles casing and diacritization!
Check out our:
💻package: github.com/Kiryukhaseme...
🎥video: www.youtube.com/watch?v=XgDP...
📝paper: openreview.net/pdf?id=9GwVW...
25.07.2025 12:54 — 👍 1 🔁 0 💬 0 📌 0
Terminology Translation Task
📣Take part in 3rd Terminology shared task @WMT!📣
This year:
👉5 language pairs: EN->{ES, RU, DE, ZH},
👉2 tracks - sentence-level and doc-level translation,
👉authentic data from 2 domains: finance and IT!
www2.statmt.org/wmt25/termin...
Don't miss an opportunity - we only do it once in two years😏
06.06.2025 15:54 — 👍 2 🔁 2 💬 0 📌 2