Hanna Yukhymenko's Avatar

Hanna Yukhymenko

@ayukh.bsky.social

Statistics MSc @ ETH Zurich Multilingual LLM training/eval/safety @ SRI lab ayukh.com

124 Followers  |  471 Following  |  17 Posts  |  Joined: 03.07.2023  |  1.649

Latest posts by ayukh.bsky.social on Bluesky

Excited to be joining #ACL2025NLP in Vienna ๐Ÿ‡ฆ๐Ÿ‡น!
DM me if you would like to meet up and chat ๐Ÿ‘‹

25.07.2025 10:01 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM A Blog post by Institute for Computer Science, Artificial intelligence and Technology on Hugging Face

A powerful step for linguistic tech: ETH Zurich student Hanna Yukhymenko developed MamayLM โ€“ a Ukrainian #LLM fluent in ๐Ÿ‡บ๐Ÿ‡ฆ & ๐Ÿ‡ฌ๐Ÿ‡ง, capturing language, culture & history. Supervised by Prof. Vechev & alumnus A. Alexandrov, in collab with INSAIT. bit.ly/3ED4o5k
@ayukh.bsky.social @ethzurich.bsky.social

24.04.2025 07:05 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

It's just a game to these people

08.03.2025 02:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

ะจะพ?

08.03.2025 01:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

1/1 complete

13.12.2024 06:08 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐Ÿ“ข Our benchmark on self-supervised learning for single-cell data๐Ÿงฌ is accepted at the #NeurIPS2024 SSL workshop. We take a first step towards establishing best practices for SSL methods for single-cell data, and benchmark 8 SSL methods on 3 downstream tasks across 8 datasets.

12.12.2024 00:30 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

Watch me stalk Kaggle in Vancouver to get stickers

07.12.2024 18:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Is MMLU Western-centric? ๐Ÿค”

As part of a massive cross-institutional collaboration:
๐Ÿ—ฝFind MMLU is heavily overfit to western culture
๐Ÿ” Professional annotation of cultural sensitivity data
๐ŸŒ Release improved Global-MMLU 42 languages

๐Ÿ“œ Paper: arxiv.org/pdf/2412.03304
๐Ÿ“‚ Data: hf.co/datasets/Coh...

05.12.2024 16:31 โ€” ๐Ÿ‘ 59    ๐Ÿ” 12    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 6
Post image

Going to #neurips2024 next week

05.12.2024 01:44 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yes, I should have specified it before maybe :)

04.12.2024 02:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval๐Ÿ™‚

04.12.2024 02:11 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased

04.12.2024 02:09 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

ะฆะตะน ะดะตะฝัŒ ะฝะฐัั‚ะฐะฒ - Ukrainian is finally recognized as a mid-resource language ๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿฆ…๐Ÿฆ…๐Ÿฆ…

04.12.2024 00:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Exciting end of the year!
- Won the GraySwanAI jailbreaking challenge for harmful code generation
- Proud Ukrainian ambassador for Cohere4AI new Aya Expanse models
- Started my master thesis๐Ÿ‘ฉโ€๐Ÿณ๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿ‘€
- Going to Vancouver๐Ÿ‡จ๐Ÿ‡ฆ for @neuripsconf.bsky.social to chat about LLM privacy and SynthPAI

#neurips

29.11.2024 00:47 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Seems to be a hot take here: why are people getting mad about stuff they post *themselves* online? Your posts online are getting scraped all the time and you chose an open-source movement leader as a scapegoat.
Just initiate a discussion with HF about the "right to be forgotten" from GDPR

28.11.2024 12:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿ‘‹

25.11.2024 13:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Zaporizhzhia, a big Ukrainian city of almost one million, is under a massive drone attack. terrorism is russian culture.

18.11.2024 19:48 โ€” ๐Ÿ‘ 267    ๐Ÿ” 118    ๐Ÿ’ฌ 11    ๐Ÿ“Œ 3
Preview
LVE Repository We document and track vulnerabilities and exposures of large language models (LVEs).

One year of ChatGPT has shown incredible capabilities of LLMs. However, they still have lots of problems! The LVE project aims at addressing this - with LVEs we track LLM vulnerabilities and exposures in an open-source community-first approach.

Contribute and more info: lve-project.org
#NLP #LLM

12.12.2023 23:57 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image 02.11.2023 16:02 โ€” ๐Ÿ‘ 43    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

ะขั€ะตะด ะท ะบะพั€ะธัะฝะธะผะธ ะฟะพั€ะฐะดะฐะผะธ ะดะปั ั‚ะธั…, ั…ั‚ะพ ั‰ะพะนะฝะพ ะดะพะปัƒั‡ะธะฒัั. ๐Ÿงต

26.05.2023 19:20 โ€” ๐Ÿ‘ 1249    ๐Ÿ” 360    ๐Ÿ’ฌ 62    ๐Ÿ“Œ 136
Post image

ะšัƒะฟะธะปะฐ ัƒะบั€ะฐั—ะฝััŒะบั– ะฒะฐั€ะตะฝะธะบะธ ะฒ ะจะฒะตะนั†ะฐั€ั–ั—

09.07.2023 20:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿค™๐Ÿซก

03.07.2023 17:57 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

ะฏะบ ะบะฐะถัƒั‚ัŒ

03.07.2023 12:56 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

ะ”ัะบัƒัŽ ะทะฐ ั–ะฝะฒะฐะนั‚๐Ÿ’…

03.07.2023 12:43 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@ayukh is following 20 prominent accounts