Justin Deschenaux's Avatar

Justin Deschenaux

@jdeschena.bsky.social

PhD student @EPFL, supervised by Caglar Gulcehre. Casting the forces of gradient descent πŸ§™β€β™‚οΈ Website: https://jdeschena.github.io

128 Followers  |  280 Following  |  9 Posts  |  Joined: 15.11.2024
Posts Following

Posts by Justin Deschenaux (@jdeschena.bsky.social)

Video thumbnail

Excited to share our latest work on EvoTune, a novel method integrating LLM-guided evolutionary search and reinforcement learning to accelerate the discovery of algorithms! 1/12🧡

26.04.2025 16:56 β€” πŸ‘ 21    πŸ” 10    πŸ’¬ 1    πŸ“Œ 2

A dream come true! I presented "No Representation, No Trust" on my favorite RL podcast, TalkRL!
Make sure to check it out to learn why training with PPO for too long makes your agent collapse!

03.03.2025 21:36 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

I am in Vancouver for NeurIPS 2024 until December 16th if you want to meet, DM or email me.
We have two accepted papers from my lab:
1. Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers, on Wednesday, East Exhibit Hall A-C #2010 (1/3)

09.12.2024 23:03 β€” πŸ‘ 11    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

Ah, I see, you got me here πŸ˜†

03.12.2024 08:33 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Where is this paper? I cannot find it

03.12.2024 08:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Is the paper on ArXiv?

28.11.2024 21:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@haeggee.bsky.social Is missing from the list 😒

22.11.2024 15:32 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Maybe we can set our bluesky handle, in addition to twitter, in our huggingface profile? πŸ‘€

21.11.2024 21:55 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

They have released many of their models, maybe that’s why you thought that? I also thought that the weights were released

21.11.2024 21:33 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Amazing repo 🀩 Thank you for including our work ☺️

21.11.2024 21:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For people coming from an engineering background (ie without a background in SDEs, I feel like the variational diffusion model perspective (arxiv.org/pdf/2107.00630) is relatively intuitive, as then diffusion is kind of a generalized VAE

19.11.2024 15:23 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A paper a day keeps the FOMO away, episode 7.

Among "oldies but goldies", this tutorial by Rabiner on Hidden Markov Models (HMMs) is dear to my heart. HMMs are one of the simplest statistical models where some variables are not observed, and we love them for it. 🧡

www.cs.ubc.ca/~murphyk/Bay...

19.11.2024 10:40 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 2    πŸ“Œ 1

Thanks for sharing those high quality resources! πŸ™Œ

19.11.2024 12:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In a gratuitous attempt to acquire more followers myself 😁, I've made a start on a "starter pack". Hopefully as more people from 🐦 make it over to πŸ¦‹, we can extend this a bit. Suggestions welcome!

I've noticed not all accounts seem to be eligible to be added, anyone know what's up with that? πŸ€”

15.11.2024 20:04 β€” πŸ‘ 125    πŸ” 37    πŸ’¬ 34    πŸ“Œ 10