*New Paper*
π¨ Goal misgeneralization occurs when AI agents learn the wrong reward function, instead of the human's intended goal.
π We show that training with a minimax regret objective provably mitigates it, promoting safer and better-aligned RL policies!
08.07.2025 17:16 β π 9 π 2 π¬ 1 π 0
YouTube video by Summer School on AI and Games TV
Oskar StΓ₯lberg: Landscapes of Hex and Square - Summer School on AI and Games 2023
a very interesting multi-criteria optimisation problem for procedural level generation in 3D games
could one be inspiration for the environment design as a general sum game or a coalition game, going beyond zero-sum stuff?
www.youtube.com/watch?v=Npfo...
cc @michaelddennis.bsky.social
26.02.2025 18:09 β π 2 π 1 π¬ 1 π 0
In our recent paper, we outline 15 fully risk-agnostic, process-based, evidence-seeking policy objectives. None of them limit *what* developers can do -- they just affect *reporting* and *visibility.*
arxiv.org/abs/2502.096...
17.02.2025 18:26 β π 6 π 1 π¬ 0 π 0
Not at NeurIPS but would love to chat about it sometime! sounds right, and I think a lot of weird stuff happens when you take it to the extreme
14.12.2024 14:52 β π 0 π 0 π¬ 0 π 0
Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! π§
04.12.2024 16:13 β π 94 π 18 π¬ 3 π 3
Thrilled to share Genie 2! Endless environments created by text or images, a key to open-ended/AI-Generating Algorithms.
Genie 1 showed it's possible. 9 months later, Genie 2 shows jaw-dropping progress.π€― Witness the magic of scale, again. ππ Thx to all team members @deep-mind.bsky.social!
04.12.2024 16:07 β π 43 π 4 π¬ 1 π 0
Itβs been a crazy 2 years seeing so many amazingly talented researchers bring GENerative Interactive Environments alive in Genie 1 and 2.
the future is agents in generative environments
04.12.2024 17:53 β π 30 π 3 π¬ 0 π 0
The secret to doing good research is always to be a little underemployed. You waste years by not being able to waste hours. - Amos Tversky
19.11.2024 18:57 β π 38 π 3 π¬ 0 π 1
Welcome! @natashajaques.bsky.social. π
I think you may find one or two people who share your sentiment here... π
19.11.2024 01:07 β π 6 π 1 π¬ 0 π 0
π€£ really appreciate the support β€οΈ
17.11.2024 01:38 β π 0 π 0 π¬ 0 π 0
Thanks Eugene! I just have to get around to starting a blog already to post my weirdest takes π
17.11.2024 01:36 β π 3 π 0 π¬ 1 π 0
just joined bluesky too! π
13.11.2024 19:55 β π 7 π 0 π¬ 0 π 0
Always love a classic βmore environmentsβ when they admit the current envs prove the method works
13.11.2024 15:03 β π 0 π 0 π¬ 0 π 0
I have seen the silliest. strengths, weaknesses, and questions all a variant on the same sentence
12.11.2024 22:27 β π 4 π 0 π¬ 1 π 0
Incoming PhD, UC Berkeley
Interested in RL, AI Safety, Cooperative AI, TCS
https://karim-abdel.github.io
AGI safety researcher at Google DeepMind, leading causalincentives.com
Personal website: tomeveritt.se
AI technical governance & risk management research. PhD Candidate at MIT CSAIL. Also at https://x.com/StephenLCasper.
https://stephencasper.com/
Chief AI Scientist at Databricks. Founding team at MosaicML. MIT/Princeton alum. Lottery ticket enthusiast. Working on data intelligence.
AI, creativity and procedural generation researcher. No, not that kind of AI. I write and make games and take photographs of cities. Senior Lecturer at King's College London.
A prototype for a much larger system
http://www.possibilityspace.org | he/they
Research @ OpenAI, Prev PhD at Oxford University
AI & Transportation | MIT Associate Professor
Interests: AI for good, sociotechnical systems, machine learning, optimization, reinforcement learning, public policy, gov tech, open science.
Science is messy and beautiful.
http://www.wucathy.com
AI researcher at XBOW, Associate Professor @ NYU Tandon (on leave). Security, RE, ML. PGP http://keybase.io/moyix/
Founder of the MESS Lab: http://messlab.moyix.net
Computer Science professor at CMU. Doing research on automated software testing and bug finding. https://rohan.padhye.org
now: Assistant Professingβ’ in Software Practices Lab at UBC. was: postdoc MSR NYC, phd UC Berkeley. also at https://mastodon.acm.org/@cestlemieux. she/her.
Senior Lecturer #USydCompSci at the University of Sydney. Postdocs IBM Research and Stanford; PhD at Columbia. Converts β into puns: sometimes theorems. He/him.
RL & Meta-Learning @ DeepMind.
SeΓ±or swesearcher @ Google DeepMind, adjunct prof at UniversitΓ© de MontrΓ©al and Mila. Musician. From πͺπ¨ living in π¨π¦.
https://psc-g.github.io/
Research Scientist at DeepMind working on Gemini Thinking
Researcher @ Google DeepMind and Honorary Fellow @ U of Edinburgh.
RL, philosophy, foundations, AI.
https://david-abel.github.io
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making.
11-14 June 2025.
Trinity College Dublin.
https://rldm.org/
PhD Candidate at the University of Maryland researching reinforcement learning and autocurricula in complex, open-ended environments.
Previously RL intern @ SonyAI, RLHF intern @ Google Research, and RL intern @ Amazon Science
dumbest overseer at @anthropic
https://www.akbir.dev