Alham Fikri Aji @afaji - Bluesky Profile

Latest posts by afaji.bsky.social on Bluesky

We also explored other benchmark datasets and different models.

If you're interested in learning more, check out our paper, Data Laundering: arxiv.org/pdf/2412.15255

27.12.2024 10:42 — 👍 0 🔁 0 💬 0 📌 0

We discovered that the (illegal) knowledge of GPQA was leaked through the distillation loss, even though it was never explicitly trained on during the distillation stage.

We also repeated the distillation process multiple times and found that the performance was maintained

27.12.2024 10:42 — 👍 1 🔁 0 💬 1 📌 0

Data Laundering

We first train a model on the GPQA test data, which obviously made this model achieve 100% performance. But hey, don’t many LLMs train on test data anyway?🙈

Then, we train a new model on another (fair) data, but with a distillation loss from the cheating model

27.12.2024 10:42 — 👍 0 🔁 0 💬 1 📌 0

Final work promotion in 2024, by my student Jonibek Mansurov

We managed to achieve ~75% on a challenging GPQA with only 2 layers of transformers(~ 40M params) that were trained on different data; in our case, MedMCQA.

Introducing...

27.12.2024 10:42 — 👍 1 🔁 0 💬 1 📌 0

Grassroots Science A global initiative focused on developing state-of-the-art multilingual language models through grassroots efforts.

⭐️ We're going to launch Grassroots Science, a year-long ambitious, massive-scale, fully open-source initiative aimed at developing multilingual LLMs aligned to diverse and inclusive human preferences in Feb 2025.

🌐 Check our website: grassroots.science.

#NLProc #GrassrootsScience

09.12.2024 05:02 — 👍 7 🔁 5 💬 1 📌 3

Hello, world! 🌍

I’ll be using this platform, mainly cross-posting from X and other places

Kicking things off by promoting (to my nonexistent audience 😂) CVQA at NeurIPS!

Oral:
📍 East Meeting Room 1-3
🗓️ Thu, 12 Dec 3:30 pm PST

Poster:
📍 West Ballroom A-D #5110
🗓️ Thu, 12 Dec 4:30 pm PST

09.12.2024 14:42 — 👍 4 🔁 1 💬 0 📌 0

@afaji is following 20 prominent accounts

Irvi Aini
@irvifa

Sebastian Ruder
@sebruder

Research Scientist at Meta • ex Cohere, Google DeepMind • https://www.ruder.io/

karpathy
@karpathy

AI @ OpenAI, Tesla, Stanford

Sasha Rush
@srushnlp

Professor, Programmer in NYC. Cornell, Hugging Face 🤗

Sara Hooker
@sarahooker

I lead Cohere For AI. Formerly Research Google Brain. ML Efficiency, LLMs, @trustworthy_ml.

Najoung Kim
@najoung

https://najoung.kim langauge

Graham Neubig
@gneubig

Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI

hardmaru
@hardmaru

Co-Founder & CEO, Sakana AI 🎏 → @sakanaai.bsky.social https://sakana.ai/careers

Lindia Tjuatja
@lindiatjuatja

a natural language processor and “sensible linguist”. PhD-ing LTI@CMU, previously BS-ing Ling+ECE@UTAustin 🤠🤖📖 she/her lindiatjuatja.github.io

Language Technologies Institute | CMU
@ltiatcmu

The Language Technologies Institute in Carnegie Mellon University's @scsatcmu.bsky.social lti.cmu.edu

Mark Dredze
@mdredze

John C Malone Professor, Johns Hopkins Computer Science Director, Data Science and AI (DSAI) Institute Center for Language and Speech Processing, Malone Center for Engineering in Healthcare. Part-time: Bloomberg LP #nlproc

@eleutherai

Simran Khanuja
@simi97k

NLP ❤️ | PhD @ CMU, LTI | Prev. Google Research, Microsoft Research | https://simran-khanuja.github.io/

Clem Delangue 🤗
@clem.hf.co

Co-founder and CEO at Hugging Face

Lj Miranda
@ljvmiranda

PhD student at the University of Cambridge https://ljvmiranda921.github.io

@gentaiscool

Research Scientist at Capital One. Organizing https://grassroots.science

Grassroots Science
@grassroots-science

A year-long global initiative focused on collecting global data and developing multilingual LLMs via grassroots efforts. Coming Soon in early 2025! #NLProc

Nedjma Ousidhoum
@nedjmaou-nlp

#NLP #NLProc Lecturer (Assistant Professor) at Cardiff University. http://nedjmaou.github.io

Lintang Sutawika
@sutawika.com

PhD @ltiatcmu.bsky.social previously @eleutherai.bsky.social 🌐 lintang.sutawika.com

Ruochen
@ruochenzhang

PhD@browncs doing multilingual things <= Undergrad@SUTD