Excited to be joining #ACL2025NLP in Vienna π¦πΉ!
DM me if you would like to meet up and chat π
Excited to be joining #ACL2025NLP in Vienna π¦πΉ!
DM me if you would like to meet up and chat π
A powerful step for linguistic tech: ETH Zurich student Hanna Yukhymenko developed MamayLM β a Ukrainian #LLM fluent in πΊπ¦ & π¬π§, capturing language, culture & history. Supervised by Prof. Vechev & alumnus A. Alexandrov, in collab with INSAIT. bit.ly/3ED4o5k
@ayukh.bsky.social @ethzurich.bsky.social
It's just a game to these people
08.03.2025 02:02 β π 1 π 0 π¬ 0 π 0Π¨ΠΎ?
08.03.2025 01:59 β π 0 π 0 π¬ 2 π 01/1 complete
13.12.2024 06:08 β π 1 π 0 π¬ 0 π 0π’ Our benchmark on self-supervised learning for single-cell data𧬠is accepted at the #NeurIPS2024 SSL workshop. We take a first step towards establishing best practices for SSL methods for single-cell data, and benchmark 8 SSL methods on 3 downstream tasks across 8 datasets.
12.12.2024 00:30 β π 6 π 1 π¬ 1 π 2Watch me stalk Kaggle in Vancouver to get stickers
07.12.2024 18:04 β π 0 π 0 π¬ 1 π 0
Is MMLU Western-centric? π€
As part of a massive cross-institutional collaboration:
π½Find MMLU is heavily overfit to western culture
π Professional annotation of cultural sensitivity data
π Release improved Global-MMLU 42 languages
π Paper: arxiv.org/pdf/2412.03304
π Data: hf.co/datasets/Coh...
Going to #neurips2024 next week
05.12.2024 01:44 β π 3 π 0 π¬ 0 π 0Yes, I should have specified it before maybe :)
04.12.2024 02:13 β π 0 π 0 π¬ 0 π 0
This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for evalπ
Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
Π¦Π΅ΠΉ Π΄Π΅Π½Ρ Π½Π°ΡΡΠ°Π² - Ukrainian is finally recognized as a mid-resource language πΊπ¦π¦ π¦ π¦
04.12.2024 00:43 β π 0 π 0 π¬ 1 π 0
Exciting end of the year!
- Won the GraySwanAI jailbreaking challenge for harmful code generation
- Proud Ukrainian ambassador for Cohere4AI new Aya Expanse models
- Started my master thesisπ©βπ³πΊπ¦π
- Going to Vancouverπ¨π¦ for @neuripsconf.bsky.social to chat about LLM privacy and SynthPAI
#neurips
Seems to be a hot take here: why are people getting mad about stuff they post *themselves* online? Your posts online are getting scraped all the time and you chose an open-source movement leader as a scapegoat.
Just initiate a discussion with HF about the "right to be forgotten" from GDPR
π
25.11.2024 13:26 β π 0 π 0 π¬ 0 π 0Zaporizhzhia, a big Ukrainian city of almost one million, is under a massive drone attack. terrorism is russian culture.
18.11.2024 19:48 β π 265 π 118 π¬ 11 π 3
One year of ChatGPT has shown incredible capabilities of LLMs. However, they still have lots of problems! The LVE project aims at addressing this - with LVEs we track LLM vulnerabilities and exposures in an open-source community-first approach.
Contribute and more info: lve-project.org
#NLP #LLM
Π’ΡΠ΅Π΄ Π· ΠΊΠΎΡΠΈΡΠ½ΠΈΠΌΠΈ ΠΏΠΎΡΠ°Π΄Π°ΠΌΠΈ Π΄Π»Ρ ΡΠΈΡ , Ρ ΡΠΎ ΡΠΎΠΉΠ½ΠΎ Π΄ΠΎΠ»ΡΡΠΈΠ²ΡΡ. π§΅
26.05.2023 19:20 β π 1244 π 357 π¬ 61 π 135ΠΡΠΏΠΈΠ»Π° ΡΠΊΡΠ°ΡΠ½ΡΡΠΊΡ Π²Π°ΡΠ΅Π½ΠΈΠΊΠΈ Π² Π¨Π²Π΅ΠΉΡΠ°ΡΡΡ
09.07.2023 20:37 β π 2 π 0 π¬ 1 π 0π€π«‘
03.07.2023 17:57 β π 1 π 0 π¬ 0 π 0Π―ΠΊ ΠΊΠ°ΠΆΡΡΡ
03.07.2023 12:56 β π 1 π 0 π¬ 1 π 0ΠΡΠΊΡΡ Π·Π° ΡΠ½Π²Π°ΠΉΡπ
03.07.2023 12:43 β π 1 π 0 π¬ 1 π 0