A dream come true! I presented "No Representation, No Trust" on my favorite RL podcast, TalkRL!
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!
@imtd.bsky.social
π https://www.trdavidson.com π¬research: deep generative learning; agentic systems; synthetic data PhD @EPFL on reliable magic Intern @MSR, Prev. @Google machine learning & company building π@NYU @UvA alumn
A dream come true! I presented "No Representation, No Trust" on my favorite RL podcast, TalkRL!
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!
π₯ Want to train large neural networks WITHOUT Adam while using less memory and getting better results? β‘
Check out SCION: a new optimizer that adapts to the geometry of your problem using norm-constrained linear minimization oracles (LMOs): π§΅π
hey max - highly recommend the book βChip Warsβ - en.m.wikipedia.org/wiki/Chip_Wa...
07.01.2025 07:25 β π 2 π 0 π¬ 0 π 0love the format/stack you settled on β hyped for 2025 entries π¦Ύππ
26.12.2024 15:44 β π 1 π 0 π¬ 0 π 0any chance at a sweet blogpost at some point ?! O_o
20.12.2024 20:16 β π 2 π 0 π¬ 1 π 1π Introducing PICLe: a framework for in-context named-entity detection (NED) using pseudo-annotated demonstrations.
π― No human labeling neededβyet it outperforms few-shot learning with human annotations!
#AI #NLProc #LLMs #ICL #NER
Here's Veo 2, the latest version of our video generation model, as well as a substantial upgrade for Imagen 3 π§βπ³π’
(Did I mention we are hiring on the Generative Media team, btw π)
blog.google/technology/g...
lol. yes, very true and important
15.12.2024 16:32 β π 0 π 0 π¬ 0 π 0Also, check out our ML project templateβitβs a game-changer!ππ
@caglarai.bsky.social
π§βπ» github.com/CLAIRE-Labo/...
I am in Vancouver for NeurIPS 2024 until December 16th if you want to meet, DM or email me.
We have two accepted papers from my lab:
1. Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers, on Wednesday, East Exhibit Hall A-C #2010 (1/3)
favorite conference experience for me :)
03.12.2024 16:59 β π 6 π 1 π¬ 0 π 1Better VQ-VAEs with this one weird rotation trick!
I missed this when it came out, but I love papers like this: a simple change to an already powerful technique, that significantly improves results without introducing complexity or hyperparameters.
I've put together a starter pack of EPFL researchers across all labs and domains! π¨π Would love to expand this list and showcase more amazing work happening at EPFL. Drop a reply to be added!
#EPFL #academicsky
go.bsky.app/73zdbtp
π¦Ύ βοΈ- nice pack :)
29.11.2024 15:04 β π 1 π 0 π¬ 1 π 0~π£~ -> π
29.11.2024 11:48 β π 0 π 0 π¬ 0 π 0