Michael Hu @michahu - Bluesky Profile

Latest posts by michahu.bsky.social on Bluesky

Nothing Like You Stephan Bodzin, Luna Semara · Boavista · Song · 2021

Boavista album by Stephan Bodzin:
open.spotify.com/track/7ujvbI...

26.11.2024 01:29 — 👍 2 🔁 0 💬 0 📌 0

Is this #1 in your Spotify wrapped 😆

26.11.2024 01:14 — 👍 1 🔁 0 💬 0 📌 0

thanks for featuring this work!

19.11.2024 02:04 — 👍 1 🔁 0 💬 0 📌 0

Aioli: A Unified Optimization Framework for Language Model Data Mixing Language model performance depends on identifying the optimal mixture of data groups to train on (e.g., law, code, math). Prior work has proposed a diverse set of methods to efficiently learn mixture ...

In joint work with @MayeeChen @NickLourie @kchonyc @HazyResearch, we use our optimization framework to analyze failures of existing methods. We then turn these insights into:

Aioli 🧄, a fully-online data mixing algorithm!

paper: arxiv.org/abs/2411.05735
code: github.com/HazyResearch...

12.11.2024 17:04 — 👍 0 🔁 0 💬 0 📌 0

So you want a good pretraining data mix🧑‍🍳, but which data mixing algorithm do you pick? DoGE, DoReMi, Skill-it, grid searching proportions… 😵‍💫

It turns out that these algorithms are all special cases of Linear Mixing Optimization, our new data mixing framework! 🧵

12.11.2024 17:04 — 👍 0 🔁 0 💬 1 📌 0

metropolis-hastings:
1️⃣ sample from your proposal function
2️⃣ run the sample through your filter, proportional to the desired pdf
3️⃣ use the kept samples to initialize the next round

i wonder if we can connect iterative approaches to synthetic data as making specific choices in an MCMC framework...

10.11.2024 02:24 — 👍 0 🔁 0 💬 0 📌 0

@michahu is following 20 prominent accounts

@hungtingchen

PhD student at NYU, working on NLP. https://timchen0618.github.io

@momergul

CS PhD Student @Cornell

Kim Stachenfeld, PhD
@neurokim

Neuro + AI Research Scientist at DeepMind; Affiliate Professor at Columbia Center for Theoretical Neuroscience. Likes studying learning+memory, hippocampi, and other things brains have and do, too. she/her.

Samuel Ainsworth
@skainswo

prev: @BrownUniversity, @uwcse/@uw_wail phd, ex-@cruise, RS @waymo. 0.1x engineer, 10x friend. spondyloarthritis, cars ruin cities, open source

@tiwa-eisape

NYU Center for Data Science
@nyudatascience

Official account of the NYU Center for Data Science, the home of the Undergraduate, Master’s, and Ph.D. programs in data science. cds.nyu.edu

Mor Geva
@megamor2

https://mega002.github.io

Sadhika Malladi
@sadhika

CS PhD student at Princeton. https://www.cs.princeton.edu/~smalladi/index.html

Noah A. Smith
@nlpnoah

Researcher in NLP, ML, computer music. Prof @uwcse @uwnlp & helper @allen_ai @ai2_allennlp & familiar to two cats. Single reeds, tango, swim, run, cocktails, מאַמע־לשון, GenX. Opinions not your business.

Deniz Oktay
@denizzokt

👀 || ESM3 || Princeton PhD || MIT BS/MEng || former ai resident @google, intern @nvidia || Bay Area native

Vicente Ordonez
@vicenteor

Rice University, Associate Professor of Computer Science. Computer Vision, Multimodal AI, Deep Learning. Houston, Texas. Check our work at https://vislang.ai/

Saurabh Shah
@saurabhshah2

training olmos at Ai2, prev at Apple, Penn …. 🎤 dabbler of things🎸 🐈‍⬛enjoyer of cats 🐈 and mountains🏔️he/him

@byungdoh

@wellecks

Candace Ross
@candaceross

Research Scientist at FAIR (Meta), PhD from MIT

Adina Williams
@adinawilliams

NLP, Linguistics, Cognitive Science, AI, ML, etc. Job currently: Research Scientist (NYC) Job formerly: NYU Linguistics, MSU Linguistics

Simran Khanuja
@simi97k

NLP ❤️ | PhD @ CMU, LTI | Prev. Google Research, Microsoft Research | https://simran-khanuja.github.io/

Ali Behrouz
@alibehrouz

Intern @Google, Ph.D. Student @Cornell_CS. Interested in machine learning, LLM, brain, and healthcare. abehrouz.github.io

@kyunghyuncho

a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor. see more at https://kyunghyuncho.me/

Kaiser Sun
@kaiserwholearns

Ph.D. student at @jhuclsp, human LM that hallucinates. Formerly @MetaAI, @uwnlp, and @AWS they/them🏳️‍🌈 #NLProc #NLP Crossposting on X.