Hanna Yukhymenko's Avatar

Hanna Yukhymenko

@ayukh.bsky.social

Statistics MSc @ ETH Zurich Multilingual LLM training/eval/safety @ SRI lab ayukh.com

129 Followers  |  472 Following  |  17 Posts  |  Joined: 03.07.2023
Posts Following

Posts by Hanna Yukhymenko (@ayukh.bsky.social)

Excited to be joining #ACL2025NLP in Vienna πŸ‡¦πŸ‡Ή!
DM me if you would like to meet up and chat πŸ‘‹

25.07.2025 10:01 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM A Blog post by Institute for Computer Science, Artificial intelligence and Technology on Hugging Face

A powerful step for linguistic tech: ETH Zurich student Hanna Yukhymenko developed MamayLM – a Ukrainian #LLM fluent in πŸ‡ΊπŸ‡¦ & πŸ‡¬πŸ‡§, capturing language, culture & history. Supervised by Prof. Vechev & alumnus A. Alexandrov, in collab with INSAIT. bit.ly/3ED4o5k
@ayukh.bsky.social @ethzurich.bsky.social

24.04.2025 07:05 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

It's just a game to these people

08.03.2025 02:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Π¨ΠΎ?

08.03.2025 01:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

1/1 complete

13.12.2024 06:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

πŸ“’ Our benchmark on self-supervised learning for single-cell data🧬 is accepted at the #NeurIPS2024 SSL workshop. We take a first step towards establishing best practices for SSL methods for single-cell data, and benchmark 8 SSL methods on 3 downstream tasks across 8 datasets.

12.12.2024 00:30 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 2

Watch me stalk Kaggle in Vancouver to get stickers

07.12.2024 18:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Is MMLU Western-centric? πŸ€”

As part of a massive cross-institutional collaboration:
πŸ—½Find MMLU is heavily overfit to western culture
πŸ” Professional annotation of cultural sensitivity data
🌍 Release improved Global-MMLU 42 languages

πŸ“œ Paper: arxiv.org/pdf/2412.03304
πŸ“‚ Data: hf.co/datasets/Coh...

05.12.2024 16:31 β€” πŸ‘ 59    πŸ” 12    πŸ’¬ 7    πŸ“Œ 7
Post image

Going to #neurips2024 next week

05.12.2024 01:44 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, I should have specified it before maybe :)

04.12.2024 02:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for evalπŸ™‚

04.12.2024 02:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased

04.12.2024 02:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Π¦Π΅ΠΉ дСнь настав - Ukrainian is finally recognized as a mid-resource language πŸ‡ΊπŸ‡¦πŸ¦…πŸ¦…πŸ¦…

04.12.2024 00:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Exciting end of the year!
- Won the GraySwanAI jailbreaking challenge for harmful code generation
- Proud Ukrainian ambassador for Cohere4AI new Aya Expanse models
- Started my master thesisπŸ‘©β€πŸ³πŸ‡ΊπŸ‡¦πŸ‘€
- Going to VancouverπŸ‡¨πŸ‡¦ for @neuripsconf.bsky.social to chat about LLM privacy and SynthPAI

#neurips

29.11.2024 00:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Seems to be a hot take here: why are people getting mad about stuff they post *themselves* online? Your posts online are getting scraped all the time and you chose an open-source movement leader as a scapegoat.
Just initiate a discussion with HF about the "right to be forgotten" from GDPR

28.11.2024 12:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ‘‹

25.11.2024 13:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Zaporizhzhia, a big Ukrainian city of almost one million, is under a massive drone attack. terrorism is russian culture.

18.11.2024 19:48 β€” πŸ‘ 265    πŸ” 118    πŸ’¬ 11    πŸ“Œ 3
Preview
LVE Repository We document and track vulnerabilities and exposures of large language models (LVEs).

One year of ChatGPT has shown incredible capabilities of LLMs. However, they still have lots of problems! The LVE project aims at addressing this - with LVEs we track LLM vulnerabilities and exposures in an open-source community-first approach.

Contribute and more info: lve-project.org
#NLP #LLM

12.12.2023 23:57 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image 02.11.2023 16:02 β€” πŸ‘ 43    πŸ” 7    πŸ’¬ 1    πŸ“Œ 0

Π’Ρ€Π΅Π΄ Π· корисними ΠΏΠΎΡ€Π°Π΄Π°ΠΌΠΈ для Ρ‚ΠΈΡ…, Ρ…Ρ‚ΠΎ Ρ‰ΠΎΠΉΠ½ΠΎ долучився. 🧡

26.05.2023 19:20 β€” πŸ‘ 1244    πŸ” 357    πŸ’¬ 61    πŸ“Œ 135
Post image

ΠšΡƒΠΏΠΈΠ»Π° ΡƒΠΊΡ€Π°Ρ—Π½ΡΡŒΠΊΡ– Π²Π°Ρ€Π΅Π½ΠΈΠΊΠΈ Π² Π¨Π²Π΅ΠΉΡ†Π°Ρ€Ρ–Ρ—

09.07.2023 20:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ€™πŸ«‘

03.07.2023 17:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Π―ΠΊ ΠΊΠ°ΠΆΡƒΡ‚ΡŒ

03.07.2023 12:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Π”ΡΠΊΡƒΡŽ Π·Π° Ρ–Π½Π²Π°ΠΉΡ‚πŸ’…

03.07.2023 12:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0