Introducing π₯EGGROLL π₯(Evolution Guided General Optimization via Low-rank Learning)! π Scaling backprop-free Evolution Strategies (ES) for billion-parameter models at large population sizes
β‘100x Training Throughput
π―Fast Convergence
π’Pure Int8 Pretraining of RNN LLMs
21.11.2025 17:56 β
π 26
π 8
π¬ 1
π 4
hello π¦ ! I'll be at NeurIPS next week presenting our work on using learnability to select levels for RL autocurricula. If you're there, I would love to chat about curricula and RL generalisation more broadly. Please DM if you'd like to grab a coffee :)
06.12.2024 10:24 β
π 5
π 1
π¬ 0
π 0
The good reviewer partyβ’, a party at conferences for reviewers whose ACs said they did a great job. ML operates on FOMO so I think it'd work
24.11.2024 20:09 β
π 97
π 8
π¬ 5
π 1
JoΓ£o F. Henriques
Research of Joao F. Henriques
Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!
23.11.2024 14:35 β
π 51
π 18
π¬ 1
π 4