AI Safety for Who? | Kairos.fm
AI safety is making you less safe: chatbot anthropomorphization, mental health harms, dark patterns
π¨ New muckrAIkers: "AI Safety For Who?"
@jacobhaimes.bsky.social & @thegermanpole.bsky.social break down how instruction tuning/RLHF create anthropomorphic chatbots that exploit human empathyβleading to mental health harms. kairos.fm/muckraikers/...
Find us wherever you listen! (links in thread)
13.10.2025 20:42 β π 4 π 4 π¬ 2 π 0
Dangers of being ab auto-didact: I somehow missed the identity $a^b=e^(b ln(a))$ for the last 5 years.
I'm technically semi-autodidact because I did get formal training, but somewhere that very useful and basic thing got lost and I never had enough pain to derive it myself.
09.02.2025 07:57 β π 1 π 0 π¬ 0 π 0
Probably our best episode yet, Chris was an awesome guest and is a true Mensch
27.01.2025 16:50 β π 1 π 0 π¬ 0 π 0
π¨ New Paper Alert: Open Problem in Machine Unlearning for AI Safety π¨
Can AI truly "forget"? While unlearning promises data removal, controlling emergent capabilities is a inherent challenge. Here's why it matters: π
Paper: arxiv.org/pdf/2501.04952
1/8
10.01.2025 16:58 β π 25 π 6 π¬ 1 π 3
Scientist and Group Leader of the Simons Machine Learning Center
@SEMC_NYSBC. Co-founder and CEO of http://OpenProtein.AI. Opinions are my own.
Official Bluesky account for the Into AI Safety podcast - available on all podcast listening platforms!
https://kairos.fm/intoaisafety
founder of Kairos.fm, host of the Into AI Safety and muckrAIkers podcasts, working with Apart Research and the Odyssean Institute.
All views my own. He/him.
PhD candidate, Cornell University, Government & IR.
Grad Fellow, Cornell Tech Policy Institute
Born & raised in Ottawa, interested in the politics of surveillance, smart cities, AI, emerging tech, and int'l security. She/her
www.ameliacarsenault.com
Academic jack-of-all-trades.
AI Governance Researcher, interested in this centuries' greatest challenges & host for On What Matters (https://bsky.app/profile/on-what-matters.bsky.social)
Senior Research Associate @cser.bsky.social | Co-founder @gvra.bsky.social | Risk communication | Global Catastrophic Risk | Systemic Risk | Volcanic risk π| First Gen | European
On this podcast hosted by Coleman Snell, we talk about the biggest risks/challenges facing our species, solutions, and how we can find meaning in this strange century.
audio engineer/GM/well known insect
Wear your cringe like armor and it can never be used to hurt you.
Official Bluesky account for the Kairos.fm media network - making complex problems meaningfully accessible.
Visit our website, or find our podcasts wherever you listen!
Official Bluesky account for the muckrAIkers podcast - available on all podcast listening platforms!
https://kairos.fm/muckraikers
SFF author, triple Hugo award winner (and three times Locus award too), over a million books sold.
Mostly on Mastodon: @cstross@wandering.shop
Blog: https://www.antipope.org/charlie/blog-static/
I do SciML + open source!
π§ͺ ML+proteins @ http://Cradle.bio
π Neural ODEs: http://arxiv.org/abs/2202.02435
π€ JAX ecosystem: http://github.com/patrick-kidger
π§βπ» Prev. Google, Oxford
π ZΓΌrich, Switzerland
Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com
PhD student University College London: machine learning for protein design and structure-based drug discovery
Computer Science -- Computation and Language
source: export.arxiv.org/rss/cs.CL
maintainer: @tmaehara.bsky.social
Recently a principal scientist at Google DeepMind. Joining Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamical systems.