Excited to be joining #ACL2025NLP in Vienna ๐ฆ๐น!
DM me if you would like to meet up and chat ๐
@ayukh.bsky.social
Statistics MSc @ ETH Zurich Multilingual LLM training/eval/safety @ SRI lab ayukh.com
Excited to be joining #ACL2025NLP in Vienna ๐ฆ๐น!
DM me if you would like to meet up and chat ๐
A powerful step for linguistic tech: ETH Zurich student Hanna Yukhymenko developed MamayLM โ a Ukrainian #LLM fluent in ๐บ๐ฆ & ๐ฌ๐ง, capturing language, culture & history. Supervised by Prof. Vechev & alumnus A. Alexandrov, in collab with INSAIT. bit.ly/3ED4o5k
@ayukh.bsky.social @ethzurich.bsky.social
It's just a game to these people
08.03.2025 02:02 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0ะจะพ?
08.03.2025 01:59 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 01/1 complete
13.12.2024 06:08 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0๐ข Our benchmark on self-supervised learning for single-cell data๐งฌ is accepted at the #NeurIPS2024 SSL workshop. We take a first step towards establishing best practices for SSL methods for single-cell data, and benchmark 8 SSL methods on 3 downstream tasks across 8 datasets.
12.12.2024 00:30 โ ๐ 6 ๐ 1 ๐ฌ 1 ๐ 2Watch me stalk Kaggle in Vancouver to get stickers
07.12.2024 18:04 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Is MMLU Western-centric? ๐ค
As part of a massive cross-institutional collaboration:
๐ฝFind MMLU is heavily overfit to western culture
๐ Professional annotation of cultural sensitivity data
๐ Release improved Global-MMLU 42 languages
๐ Paper: arxiv.org/pdf/2412.03304
๐ Data: hf.co/datasets/Coh...
Going to #neurips2024 next week
05.12.2024 01:44 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0Yes, I should have specified it before maybe :)
04.12.2024 02:13 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval๐
Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
ะฆะตะน ะดะตะฝั ะฝะฐััะฐะฒ - Ukrainian is finally recognized as a mid-resource language ๐บ๐ฆ๐ฆ ๐ฆ ๐ฆ
04.12.2024 00:43 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Exciting end of the year!
- Won the GraySwanAI jailbreaking challenge for harmful code generation
- Proud Ukrainian ambassador for Cohere4AI new Aya Expanse models
- Started my master thesis๐ฉโ๐ณ๐บ๐ฆ๐
- Going to Vancouver๐จ๐ฆ for @neuripsconf.bsky.social to chat about LLM privacy and SynthPAI
#neurips
Seems to be a hot take here: why are people getting mad about stuff they post *themselves* online? Your posts online are getting scraped all the time and you chose an open-source movement leader as a scapegoat.
Just initiate a discussion with HF about the "right to be forgotten" from GDPR
๐
25.11.2024 13:26 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Zaporizhzhia, a big Ukrainian city of almost one million, is under a massive drone attack. terrorism is russian culture.
18.11.2024 19:48 โ ๐ 267 ๐ 118 ๐ฌ 11 ๐ 3One year of ChatGPT has shown incredible capabilities of LLMs. However, they still have lots of problems! The LVE project aims at addressing this - with LVEs we track LLM vulnerabilities and exposures in an open-source community-first approach.
Contribute and more info: lve-project.org
#NLP #LLM
ะขัะตะด ะท ะบะพัะธัะฝะธะผะธ ะฟะพัะฐะดะฐะผะธ ะดะปั ัะธั , ั ัะพ ัะพะนะฝะพ ะดะพะปััะธะฒัั. ๐งต
26.05.2023 19:20 โ ๐ 1249 ๐ 360 ๐ฌ 62 ๐ 136ะัะฟะธะปะฐ ัะบัะฐัะฝััะบั ะฒะฐัะตะฝะธะบะธ ะฒ ะจะฒะตะนัะฐััั
09.07.2023 20:37 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0๐ค๐ซก
03.07.2023 17:57 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0ะฏะบ ะบะฐะถััั
03.07.2023 12:56 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0ะัะบัั ะทะฐ ัะฝะฒะฐะนั๐
03.07.2023 12:43 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0