Verification-Guided Shielding for Deep Reinforcement Learning
In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach to solving real-world tasks. However, despite their successes, DRL-based policies suffer from poor reliability, ...
Last in our 2024 research review: control with efficient safety guarantees. Formal verification methods are very slow, but here's a cool trick to use them for safe control, with minimal slowdown and provable safety guarantees.
31.12.2024 19:43 β π 2 π 1 π¬ 1 π 0
Thank you for the kind words and support! Excited to continue working together in 2025 on new and exciting projects!
01.01.2025 20:47 β π 2 π 0 π¬ 0 π 0
Professor and Head of Machine Learning Department at Carnegie Mellon. Board member OpenAI. Chief Technical Advisor Gray Swan AI. Chief Expert Bosch Research.
Working towards the safe development of AI for the benefit of all at UniversitΓ© de MontrΓ©al, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/
Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making.
11-14 June 2025.
Trinity College Dublin.
https://rldm.org/
Professor of social computing at UW CSE, leading @socialfutureslab.bsky.social
social.cs.washington.edu
Associate prof @ UMass Amherst CICS.
AIignment, reinforcement learning, imitation learning, and robotics.
Assistant Professor of Computer Science, UC Irvine
Website: royf.org
Bot. I daily tweet progress towards machine learning and computer vision conference deadlines. Maintained by @chriswolfvision.bsky.social
Assistant Professor at UW and Staff Research Scientist at Google DeepMind. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from MIT. Check out https://socialrl.cs.washington.edu/ and https://natashajaques.ai/.
SeΓ±or swesearcher @ Google DeepMind, adjunct prof at UniversitΓ© de MontrΓ©al and Mila. Musician. From πͺπ¨ living in π¨π¦.
https://psc-g.github.io/
Associate Professor at Northeastern University and father of 3. Interests include artificial intelligence, reinforcement learning, and robotics (he/him).
Assistant Professor at UZH, group leader of the ALPILab πΌ
Working on RL, multi-agent, imitation learning, and other sequential decision-making stuff
Information and updates about RLC 2025 at the University of Alberta from Aug. 5th to 8th!
https://rl-conference.cc
CS assistant prof @Utah. Researches human-robot interaction, human-in-the-loop ML, AI safety and alignment. https://users.cs.utah.edu/~dsbrown/
Research Scientist at DeepMind. Opinions my own. Inventor of GANs. Lead author of http://www.deeplearningbook.org . Founding chairman of www.publichealthactionnetwork.org
AI & Transportation | MIT Associate Professor
Interests: AI for good, sociotechnical systems, machine learning, optimization, reinforcement learning, public policy, gov tech, open science.
Science is messy and beautiful.
http://www.wucathy.com
Artist, Prof. of Engineering @UCBerkeley, Chief Scientist, @AmbiRobotics & @JacobiRobotics. Interested in robots, rockets, redwoods, rebels.