Vincent Conitzer's Avatar

Vincent Conitzer

@conitzer.bsky.social

AI professor. Director, Foundations of Cooperative AI Lab at Carnegie Mellon. Head of Technical AI Engagement, Institute for Ethics in AI (Oxford). Author, "Moral AI - And How We Get There." https://www.cs.cmu.edu/~conitzer/

1,454 Followers  |  544 Following  |  388 Posts  |  Joined: 05.05.2024  |  1.6795

Latest posts by conitzer.bsky.social on Bluesky


Post image

how not to convince someone to not be paranoid

21.02.2026 17:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

"what if we trained an LLM only on data from after 1900?"

20.02.2026 12:52 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GameΒ TheoryΒ forΒ AIΒ Agents

looking forward to seeing some good friends tomorrow
events.seas.harvard.edu/event/cs-col...

19.02.2026 00:54 β€” πŸ‘ 10    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

(2/2) (I am actually interested in whether and how we can say anything about AI consciousness -- www.cs.cmu.edu/~conitzer/LL... -- but I don't think it does much to just ask an LLM for a probability that it is conscious.)

17.02.2026 17:03 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Anthropic CEO Says Company No Longer Sure Whether Claude Is Conscious Anthropic CEO Dario Amodei said he didn't know whether his Claude AI was conscious, but was strikingly open to the possibility.

(1/2) From the article: "Suppose you have a model that assigns itself a 72 percent chance of being conscious [...] Would you believe it?"

Hmm. If you walk into a hospital and someone tells you "I think there's a 72% chance that I am a doctor," what would you think?

futurism.com/artificial-i...

17.02.2026 17:03 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

Very sad to hear of Joe Halpern's passing. He influenced so many people, myself included, for the better. Even when his body was getting weak, his mind was still sharp, and he was still helping me, kindly pointing me to important work.

16.02.2026 02:43 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
AI, Ethics, and Society β€” Home

AIES 2026 will be in MalmΓΆ, Sweden, October 12-14. Abstract due May 14, paper due May 21.
www.aies-conference.com/2026/

11.02.2026 13:48 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

"What if Pythagoras had failed to install a security update? Be specific."

09.02.2026 15:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

the sciences constantly need to adapt to the elements changing weight

07.02.2026 22:08 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

air resistance

05.02.2026 13:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

an underappreciated feature

04.02.2026 12:52 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AI Testing Should Account for Sophisticated Strategic Behaviour This position paper argues for two claims regarding AI testing and evaluation. First, to remain informative about deployment behaviour, evaluations need account for the possibility that AI systems und...

(2/2) One concerning development is that models are starting to be able to recognize when they are being tested for safety. This upends traditional approaches to safety testing, turning it instead into a strategic game between the tester and the model. arxiv.org/abs/2508.14927

03.02.2026 13:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

(1/2) proud to have played a small role in the second International AI Safety Report! internationalaisafetyreport.org/publication/...

03.02.2026 13:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
AWAP 2026 β€” Algorithms With A Purpose ACM@CMU’s flagship algorithmic competition returns with a fast-paced cooking arena where bots plan, cook, and compete.

gave brief closing talk "Cooperative AI: How to set up a contest for environments that aren’t zero-sum" @Algorithms with a Purpose '26 @cmu.edu, where teams competed programming bots to run restaurants, with the option to sabotage your competitor by throwing away their ingredients.
awap.acmatcmu.com

02.02.2026 02:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Also thanks to the amazing committee: Fei Fang, Tuomas Sandholm, Ben Levinstein, and Stuart Russell! (There's gotta be a better way to do these zoom pictures though...)

31.01.2026 14:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

(2/3) pic left to right: Vince, Caspar, Emin (zoom), Emanuel (zoom), Jiayuan, Carlos, Tuomas

31.01.2026 14:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

(1/3) Congratulations to my student Caspar Oesterheld (second from left) on a successful defense and an important dissertation "New foundational ideas in cooperative AI"! www.andrew.cmu.edu/user/coesterh/

31.01.2026 14:27 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Before the extermination / History / Auschwitz-Birkenau

Today is Holocaust remembrance day. How it got to that:
www.auschwitz.org/en/history/b...

27.01.2026 19:16 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Moral Change or Noise? On Problems of Aligning AI With Temporally Unstable Human Feedback Alignment methods in moral domains seek to elicit moral preferences of human stakeholders and incorporate them into AI. This presupposes moral preferences as static targets, but such preferences often...

Our paper in the alignment track of AAAI'26: what if human feedback is unstable over time?
arxiv.org/abs/2511.10032

23.01.2026 14:07 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
On the Edge of Core (Non-)Emptiness: An Automated Reasoning Approach to Approval-Based Multi-Winner Voting Core stability is a natural and well-studied notion for group fairness in multi-winner voting, where the task is to select a committee from a pool of candidates. We study the setting where voters eith...

Tomorrow (F) morning at 11am SG time, Emin Berker and Emanuel Tewolde will give a talk at AAAI on our paper below!
arxiv.org/abs/2512.16895

22.01.2026 14:04 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

not a great look for Google

20.01.2026 13:52 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Fortress Protocol Listen and make your own on Suno.

"Oh look AI is figuring out human preferences from our behavior so things will be fine."
Suno (music-generating AI):
suno.com/s/pmQIY73ulk...

19.01.2026 13:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

simple

18.01.2026 13:38 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 2
Applications open for Philosophy Department Head Position - Department of Philosophy - Dietrich College of Humanities and Social Sciences - Carnegie Mellon University Carnegie Mellon’s Philosophy Department seeks a Department Head to provide academic leadership and shape the department’s strategic vision.

opening for new department head for CMU Philosophy!
www.cmu.edu/dietrich/phi...

17.01.2026 16:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Soccer goalkeepers have to make some tough decisions during the game. This should clear things up.

14.01.2026 02:35 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Markus Brill Markus Brill Associate Professor, Department of Computer Science Tutorial Fellow, Oriel College University of Oxford

Congratulations to Markus Brill (and to Oxford) for starting a faculty position at Oxford CS! sites.google.com/site/brillma...

11.01.2026 18:32 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

"could my teacher's husband and my wife's teacher be the same person?"

09.01.2026 03:09 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

more on having your son as your elementary school teacher

07.01.2026 13:31 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Apparently in the 1960s/70s anything was possible.

06.01.2026 14:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Kessler Syndrome: Satellite Collision Simulator

It is now ridiculously easy to make little games like this that you can just play in your web browser. (This one made with Claude.)
www.cs.cmu.edu/~conitzer/ke...

04.01.2026 14:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@conitzer is following 20 prominent accounts