Excited to share our latest work on EvoTune, a novel method integrating LLM-guided evolutionary search and reinforcement learning to accelerate the discovery of algorithms! 1/12π§΅
26.04.2025 16:56 β π 21 π 10 π¬ 1 π 2Excited to share our latest work on EvoTune, a novel method integrating LLM-guided evolutionary search and reinforcement learning to accelerate the discovery of algorithms! 1/12π§΅
26.04.2025 16:56 β π 21 π 10 π¬ 1 π 2
A dream come true! I presented "No Representation, No Trust" on my favorite RL podcast, TalkRL!
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!
I am in Vancouver for NeurIPS 2024 until December 16th if you want to meet, DM or email me.
We have two accepted papers from my lab:
1. Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers, on Wednesday, East Exhibit Hall A-C #2010 (1/3)
Ah, I see, you got me here π
03.12.2024 08:33 β π 3 π 0 π¬ 0 π 0Where is this paper? I cannot find it
03.12.2024 08:28 β π 1 π 0 π¬ 1 π 0Is the paper on ArXiv?
28.11.2024 21:48 β π 0 π 0 π¬ 1 π 0@haeggee.bsky.social Is missing from the list π’
22.11.2024 15:32 β π 2 π 0 π¬ 2 π 0Maybe we can set our bluesky handle, in addition to twitter, in our huggingface profile? π
21.11.2024 21:55 β π 0 π 0 π¬ 0 π 0They have released many of their models, maybe thatβs why you thought that? I also thought that the weights were released
21.11.2024 21:33 β π 7 π 0 π¬ 1 π 0Amazing repo π€© Thank you for including our work βΊοΈ
21.11.2024 21:02 β π 0 π 0 π¬ 0 π 0For people coming from an engineering background (ie without a background in SDEs, I feel like the variational diffusion model perspective (arxiv.org/pdf/2107.00630) is relatively intuitive, as then diffusion is kind of a generalized VAE
19.11.2024 15:23 β π 3 π 0 π¬ 1 π 0
A paper a day keeps the FOMO away, episode 7.
Among "oldies but goldies", this tutorial by Rabiner on Hidden Markov Models (HMMs) is dear to my heart. HMMs are one of the simplest statistical models where some variables are not observed, and we love them for it. π§΅
www.cs.ubc.ca/~murphyk/Bay...
Thanks for sharing those high quality resources! π
19.11.2024 12:11 β π 1 π 0 π¬ 0 π 0
In a gratuitous attempt to acquire more followers myself π, I've made a start on a "starter pack". Hopefully as more people from π¦ make it over to π¦, we can extend this a bit. Suggestions welcome!
I've noticed not all accounts seem to be eligible to be added, anyone know what's up with that? π€