Contrats!
28.02.2025 00:52 β π 1 π 0 π¬ 1 π 0
GitHub - ddidacus/mol-moe: Repository for: "Training Preference-Guided Routers for Molecule Generation"
Repository for: "Training Preference-Guided Routers for Molecule Generation" - ddidacus/mol-moe
Special thanks to Biogen and CIFAR for the support, and
@proceduralia.bsky.social + @pierrelucbacon.bsky.social
for their valuable supervision, and to the entire Mila community for their feedback, discussions, and support. Code, paper, and models are public: github.com/ddidacus/mol...
20.02.2025 19:43 β π 3 π 1 π¬ 0 π 0
Exactly this. So much private innovation is really just repackaged public innovation. So by scrapping public funding, we risk scrapping future economic gains as well.
08.02.2025 10:15 β π 23 π 10 π¬ 2 π 0
Self-Verification, The Key to AI
Again, Sutton said it first: "Verification, The Key to AI" incompleteideas.net/IncIdeas/Key...
The bitter lesson became clear with LLMs, and now verification is all the rage in reasoning models. Sutton's insight (2001 !) was, as always, prescient.
27.01.2025 15:13 β π 6 π 0 π¬ 0 π 0
Introduction β Practical Reinforcement Learning: From Algorithms to Applications
I am pretty happy with jupyter-book jupyterbook.org that I used for pierrelucbacon.com/rlbook/
14.01.2025 22:20 β π 1 π 0 π¬ 0 π 0
Working on RL training of LLMs @Mila_Quebec.
Professor, author of book on Simulation-Based Optimization, #ReinforcementLearning #MDPs #ORMS www.simoptim.com
Visiting Researcher at Meta; PhD student @mila.quebec. Ex: Intern @GoogleDeepMind, Intern @ EPFL, MSc@MIPT;
artemzholus.github.io
PhD student | Interested in all things decision-making and learning
Scientist, #MachineLearning and #AI for Moleculear Sciences. Scuba Diver. Loves @cecclementi.bsky.social
Ask me about Reinforcement Learning
Research @ Sony AI
AI should learn from its experiences, not copy your data.
My website for answering RL questions: https://www.decisionsanddragons.com/
Views and posts are my own.
Research Director @GoogleDeepMind. Co-lead of Veo, working on generative models of video and their fun applications.
International Conference on Learning Representations https://iclr.cc/
Founder & executive & community builder & organizer & researcher
ML Collective (mlcollective.org)
Google DeepMind
rosanneliu.com
Internet pedestrian. β¨Content creatorβ¨ Machine learning mercenary. α(γ)α (he/him/his)
https://laurent-dinh.github.io/
AI safety at Anthropic, on leave from a faculty job at NYU.
Views not employers'.
I think you should join Giving What We Can.
cims.nyu.edu/~sbowman
Professor, Computer Science, University of British Columbia. CIFAR AI Chair, Vector Institute. Senior Advisor, DeepMind. ML, AI, deep RL, deep learning, AI-Generating Algorithms (AI-GAs), open-endedness.
Musician, math lover, cook, dancer, π³οΈβπ, and an ass prof of Computer Science at New York University
Assoc. Prof. @Seoul National University. A CV/ML researcher.
Official account of the NYU Center for Data Science, the home of the Undergraduate, Masterβs, and Ph.D. programs in data science. cds.nyu.edu
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Professor at University of Toronto. Research on machine learning, optimization, and statistics.
@agentic-ai-lab.bsky.social
mengyeren.com
AI research lab at New York University led by Mengye Ren
@mengyer.bsky.social
agenticlearning.ai