Ooh nice! I'll check it out!
               
            
            
                21.03.2025 17:23 β π 0    π 0    π¬ 0    π 0                      
            
         
            
        
            
            
            
            
            
    
    
            
                            
                        
                Reinforcement Learning: An Overview
                This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based met...
            
        
    
    
            An updated intro to reinforcement learning by Kevin Murphy: arxiv.org/abs/2412.05265! Like their books, it covers a lot and is quite up to date with modern approaches. It also is pretty unique in coverage, I don't think a lot of this is synthesized anywhere else yet
               
            
            
                09.12.2024 14:27 β π 270    π 73    π¬ 9    π 5                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            I collected some folk knowledge for RL and stuck them in my lecture slides a couple weeks back: web.mit.edu/6.7920/www/l... See Appendix B... sorry, I know, appendix of a lecture slide deck is not the best for discovery. Suggestions very welcome.
               
            
            
                27.11.2024 13:36 β π 113    π 17    π¬ 3    π 3                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            It's 2040. ICLR rebuttal now lasts two years. Reviewer 2 still hasn't read your paper but has strong opinions about it
               
            
            
                25.11.2024 20:42 β π 102    π 6    π¬ 4    π 0                      
            
         
            
        
            
            
            
            
            
    
    
    
    
            Learning can also mean functional adaptation. So, adapting to contextβwhether through embeddings or reasoningβcan still count.
               
            
            
                20.11.2024 23:27 β π 1    π 0    π¬ 0    π 0                      
            
         
            
        
            
        
            
            
            
            
            
    
    
    
    
            Pretty cool initiative @eugenevinitsky.bsky.social !
               
            
            
                19.11.2024 15:55 β π 2    π 0    π¬ 0    π 0                      
            
         
    
         
        
            
        
                            
                    
                    
                                            Applied Scientist at AWS working on auto.gluon.ai
Machine learning, time series forecasting
                                     
                            
                    
                    
                                            RL & Agents Reading Group @ University of Edinburgh 
We regularly discuss recent papers in RL, MARL & related
https://edinburgh-rl.github.io/reading-group
                                     
                            
                    
                    
                                            Researcher on MDPs and RL. Retired prof. #orms #rl
                                     
                            
                    
                    
                                            VP of Research, GenAI @ Meta (Multimodal LLMs, AI Agents), UPMC Professor of Computer Science at CMU, ex-Director of AI research at @Apple, co-founder Perceptual Machines (acquired by Apple)
                                     
                            
                    
                    
                                            Chief Models Officer @ Stealth Startup; Inria & MVA - Ex: Llama @AIatMeta & Gemini and BYOL @GoogleDeepMind
                                     
                            
                    
                    
                                            machine learning researcher @ Apple machine learning research 
                                     
                            
                    
                    
                                            Working on machine learning for science and symmetry/equivariance in RL. She/her. Senior Researcher at Microsoft Research Amsterdam. https://www.elisevanderpol.nl/ & https://sigmoid.social/@elisevanderpol
                                     
                            
                    
                    
                                            RL & Meta-Learning @ DeepMind.
                                     
                            
                    
                    
                                            This is the official account of EWRL18 - European Workshop on Reinforcement Learning 
Official website: https://euro-workshop-on-reinforcement-learning.github.io/ewrl18/
                                     
                            
                    
                    
                                    
                            
                    
                    
                                            PhD graduate in reinforcement learning from the University of Alberta. Now, RL research in space! ππ°οΈ
                                     
                            
                            
                    
                    
                                            Group Leader in TΓΌbingen, Germany 
Iβm π«π· and I work on RL and lifelong learning. Mostly posting on ML related topics.
                                     
                            
                    
                    
                                            Principal Scientist at Naver Labs Europe, Lead of Spatial AI team. AI for Robotics, Computer Vision, Machine Learning. Austrian in France. https://chriswolfvision.github.io/www/
                                     
                            
                    
                    
                                            PhD student @Berkeley_AI
reinforcement learning, AI, robotics
                                     
                            
                    
                    
                                            Sakana AI is an AI R&D company based in Tokyo, Japan. πΌπ§ 
https://sakana.ai/careers
                                     
                            
                    
                    
                                            IJCAI is the longest-running premier international AI research conference since 1969. π Connect across domains and feel the pulse of AI. ποΈ #IJCAI2025 β16-22 August 2025βMontreal π¨π¦
                                     
                            
                    
                    
                                            Research director @Inria, Head of @flowersInria
 lab, prev. @MSFTResearch @SonyCSLParis
Artificial intelligence, cognitive sciences, sciences of curiosity, language, self-organization, autotelic agents, education, AI and society
http://www.pyoudeyer.com
                                     
                            
                    
                    
                                            PhD student at @cmurobotics.bsky.social working on efficient algorithms for interactive learning (e.g. imitation / RL / RLHF). no model is an island. prefers email. https://gokul.dev/. on the job market!
                                     
                            
                    
                    
                                            Research Scientist at Google DeepMind. Addressing multi-agent interactions, reasoning and evaluation with reinforcement learning, search/planning, decentralized markets, and game theory.