Alham Fikri Aji's Avatar

Alham Fikri Aji

@afaji.bsky.social

Faculty @MBZUAI, visiting scientist @Google

40 Followers  |  25 Following  |  5 Posts  |  Joined: 09.12.2024  |  1.2082

Latest posts by afaji.bsky.social on Bluesky

We also explored other benchmark datasets and different models.

If you're interested in learning more, check out our paper, Data Laundering: arxiv.org/pdf/2412.15255

27.12.2024 10:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We discovered that the (illegal) knowledge of GPQA was leaked through the distillation loss, even though it was never explicitly trained on during the distillation stage.

We also repeated the distillation process multiple times and found that the performance was maintained

27.12.2024 10:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Data Laundering

We first train a model on the GPQA test data, which obviously made this model achieve 100% performance. But hey, don’t many LLMs train on test data anyway?πŸ™ˆ

Then, we train a new model on another (fair) data, but with a distillation loss from the cheating model

27.12.2024 10:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Final work promotion in 2024, by my student Jonibek Mansurov

We managed to achieve ~75% on a challenging GPQA with only 2 layers of transformers(~ 40M params) that were trained on different data; in our case, MedMCQA.

Introducing...

27.12.2024 10:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Grassroots Science A global initiative focused on developing state-of-the-art multilingual language models through grassroots efforts.

⭐️ We're going to launch Grassroots Science, a year-long ambitious, massive-scale, fully open-source initiative aimed at developing multilingual LLMs aligned to diverse and inclusive human preferences in Feb 2025.

🌐 Check our website: grassroots.science.

#NLProc #GrassrootsScience

09.12.2024 05:02 β€” πŸ‘ 7    πŸ” 5    πŸ’¬ 1    πŸ“Œ 3
Post image

Hello, world! 🌍

I’ll be using this platform, mainly cross-posting from X and other places

Kicking things off by promoting (to my nonexistent audience πŸ˜‚) CVQA at NeurIPS!

Oral:
πŸ“ East Meeting Room 1-3
πŸ—“οΈ Thu, 12 Dec 3:30 pm PST

Poster:
πŸ“ West Ballroom A-D #5110
πŸ—“οΈ Thu, 12 Dec 4:30 pm PST

09.12.2024 14:42 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@afaji is following 20 prominent accounts