how not to convince someone to not be paranoid
21.02.2026 17:57 β π 0 π 0 π¬ 0 π 0@conitzer.bsky.social
AI professor. Director, Foundations of Cooperative AI Lab at Carnegie Mellon. Head of Technical AI Engagement, Institute for Ethics in AI (Oxford). Author, "Moral AI - And How We Get There." https://www.cs.cmu.edu/~conitzer/
how not to convince someone to not be paranoid
21.02.2026 17:57 β π 0 π 0 π¬ 0 π 0"what if we trained an LLM only on data from after 1900?"
20.02.2026 12:52 β π 2 π 0 π¬ 0 π 0looking forward to seeing some good friends tomorrow
events.seas.harvard.edu/event/cs-col...
(2/2) (I am actually interested in whether and how we can say anything about AI consciousness -- www.cs.cmu.edu/~conitzer/LL... -- but I don't think it does much to just ask an LLM for a probability that it is conscious.)
17.02.2026 17:03 β π 2 π 0 π¬ 1 π 0(1/2) From the article: "Suppose you have a model that assigns itself a 72 percent chance of being conscious [...] Would you believe it?"
Hmm. If you walk into a hospital and someone tells you "I think there's a 72% chance that I am a doctor," what would you think?
futurism.com/artificial-i...
Very sad to hear of Joe Halpern's passing. He influenced so many people, myself included, for the better. Even when his body was getting weak, his mind was still sharp, and he was still helping me, kindly pointing me to important work.
16.02.2026 02:43 β π 12 π 0 π¬ 0 π 0AIES 2026 will be in MalmΓΆ, Sweden, October 12-14. Abstract due May 14, paper due May 21.
www.aies-conference.com/2026/
"What if Pythagoras had failed to install a security update? Be specific."
09.02.2026 15:23 β π 1 π 0 π¬ 0 π 1the sciences constantly need to adapt to the elements changing weight
07.02.2026 22:08 β π 2 π 0 π¬ 0 π 0air resistance
05.02.2026 13:57 β π 0 π 0 π¬ 0 π 0an underappreciated feature
04.02.2026 12:52 β π 4 π 0 π¬ 0 π 0(2/2) One concerning development is that models are starting to be able to recognize when they are being tested for safety. This upends traditional approaches to safety testing, turning it instead into a strategic game between the tester and the model. arxiv.org/abs/2508.14927
03.02.2026 13:55 β π 1 π 0 π¬ 0 π 0(1/2) proud to have played a small role in the second International AI Safety Report! internationalaisafetyreport.org/publication/...
03.02.2026 13:55 β π 1 π 0 π¬ 1 π 0gave brief closing talk "Cooperative AI: How to set up a contest for environments that arenβt zero-sum" @Algorithms with a Purpose '26 @cmu.edu, where teams competed programming bots to run restaurants, with the option to sabotage your competitor by throwing away their ingredients.
awap.acmatcmu.com
Also thanks to the amazing committee: Fei Fang, Tuomas Sandholm, Ben Levinstein, and Stuart Russell! (There's gotta be a better way to do these zoom pictures though...)
31.01.2026 14:27 β π 0 π 0 π¬ 0 π 0(2/3) pic left to right: Vince, Caspar, Emin (zoom), Emanuel (zoom), Jiayuan, Carlos, Tuomas
31.01.2026 14:27 β π 0 π 0 π¬ 1 π 0(1/3) Congratulations to my student Caspar Oesterheld (second from left) on a successful defense and an important dissertation "New foundational ideas in cooperative AI"! www.andrew.cmu.edu/user/coesterh/
31.01.2026 14:27 β π 3 π 0 π¬ 1 π 0Today is Holocaust remembrance day. How it got to that:
www.auschwitz.org/en/history/b...
Our paper in the alignment track of AAAI'26: what if human feedback is unstable over time?
arxiv.org/abs/2511.10032
Tomorrow (F) morning at 11am SG time, Emin Berker and Emanuel Tewolde will give a talk at AAAI on our paper below!
arxiv.org/abs/2512.16895
not a great look for Google
20.01.2026 13:52 β π 2 π 0 π¬ 0 π 0"Oh look AI is figuring out human preferences from our behavior so things will be fine."
Suno (music-generating AI):
suno.com/s/pmQIY73ulk...
simple
18.01.2026 13:38 β π 6 π 2 π¬ 0 π 2opening for new department head for CMU Philosophy!
www.cmu.edu/dietrich/phi...
Soccer goalkeepers have to make some tough decisions during the game. This should clear things up.
14.01.2026 02:35 β π 6 π 2 π¬ 0 π 0Congratulations to Markus Brill (and to Oxford) for starting a faculty position at Oxford CS! sites.google.com/site/brillma...
11.01.2026 18:32 β π 4 π 0 π¬ 0 π 0"could my teacher's husband and my wife's teacher be the same person?"
09.01.2026 03:09 β π 3 π 0 π¬ 0 π 0more on having your son as your elementary school teacher
07.01.2026 13:31 β π 4 π 1 π¬ 0 π 0Apparently in the 1960s/70s anything was possible.
06.01.2026 14:15 β π 1 π 0 π¬ 1 π 0It is now ridiculously easy to make little games like this that you can just play in your web browser. (This one made with Claude.)
www.cs.cmu.edu/~conitzer/ke...