Introducing π₯EGGROLL π₯(Evolution Guided General Optimization via Low-rank Learning)! π Scaling backprop-free Evolution Strategies (ES) for billion-parameter models at large population sizes
β‘100x Training Throughput
π―Fast Convergence
π’Pure Int8 Pretraining of RNN LLMs
21.11.2025 17:56 β
π 26
π 8
π¬ 1
π 4
Late to the party (since I just took some time to spend with our two little ones) but luckily good science is timeless ;)
02.05.2025 20:13 β
π 4
π 0
π¬ 0
π 1
PQN puts Q-learning back on the map and now comes with a blog post + Colab demo! Also, congrats to the team for the spotlight at #ICLR2025
20.03.2025 11:51 β
π 16
π 4
π¬ 0
π 0
Job Details
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
12.03.2025 15:17 β
π 19
π 13
π¬ 0
π 0
That's the first time that I see a video by chess.com cited in an accepted ICLR paper, in particular on handshakes vs. fist bumps during a chess competition ...
By Oxford, @jfoerst.bsky.social
Paper: openreview.net/forum?id=wFg...
Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com
09.02.2025 13:40 β
π 9
π 1
π¬ 1
π 0
YouTube video by Machine Learning Street Talk
ImageNet Moment for Reinforcement Learning?
@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is π.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.
18.02.2025 19:26 β
π 3
π 1
π¬ 0
π 0
Second #runconference @neuripsconf.bsky.social #NeurIPS2024 !
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io
Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...
Join us!
11.12.2024 18:25 β
π 20
π 1
π¬ 0
π 1
DPhil in Engineering Science | University of Oxford
About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support
π¨ PSA π¨ Deadline to apply for your dream Phd in ML
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
29.11.2024 19:45 β
π 16
π 3
π¬ 1
π 0
correct -- this runs on top of an open-source protocol and the UI is a Twitter clone. How hard can this be?
23.11.2024 16:08 β
π 0
π 0
π¬ 1
π 0
wth did we not go to an open-source and non-for profit alternative? en.wikipedia.org/wiki/Bluesky
23.11.2024 15:00 β
π 9
π 0
π¬ 3
π 0
sad times. Joking aside, have you tried pufferlib? I am really curious how it compares and contrasts to JAX RL line of work and haven't seen much direct comparison.
23.11.2024 14:57 β
π 2
π 0
π¬ 3
π 0
Let's try @josephsuarez.bsky.social
23.11.2024 14:37 β
π 1
π 0
π¬ 1
π 0
DPhil in Engineering Science | University of Oxford
About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support
Candidates also need to apply for an Engineering DPhil by 2nd of Dec AOE (if they havenβt already) listing me as the supervisor, www.ox.ac.uk/admissions/g... The student should have an outstanding track record of academic excellence and relevant research experience.
23.11.2024 14:35 β
π 3
π 0
π¬ 0
π 0
Apply by emailing a CV, personal statement, and research proposal to βfair-flair-2024-applications@googlegroups.comβ by 2nd of Dec AOE. Joint interviews will be held in January. Shortlisted candidates will also be invited to apply to FAIR.
23.11.2024 14:35 β
π 2
π 0
π¬ 1
π 0
Home
FLAIR is a research group in the Department of Engineering Science at the University of Oxford, specialising in Reinforcement Learning.
The goal is to improve the generalisation abilities and data efficiency of GenAI, e.g. using RL and curriculum learning to train LLMs at the frontier of learnability.
For more details about our work, check out foersterlab.com and joao.science
23.11.2024 14:35 β
π 0
π 0
π¬ 1
π 0
JoΓ£o F. Henriques
Research of Joao F. Henriques
Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!
23.11.2024 14:35 β
π 51
π 18
π¬ 1
π 4
I got summoned π«‘
15.11.2024 09:43 β
π 9
π 0
π¬ 2
π 0
Hello world!
14.11.2024 09:15 β
π 33
π 5
π¬ 6
π 0