Andreas Madsen @andreasmadsen

Andreas Madsen

@andreasmadsen.bsky.social

Ph.D. in NLP Interpretability from Mila. Previously: independent researcher, freelancer in ML, and Node.js core developer.

321 Followers | 172 Following | 10 Posts | Joined: 07.10.2024 | 1.6345

Latest posts by andreasmadsen.bsky.social on Bluesky

Also thanks to @sarath-chandar.bsky.social and @sivareddyg.bsky.social for supporting me during my Ph.D., which helped me get this far! I would highly recommend them if you are looking for a Ph.D. supervisor.

07.02.2025 17:01 — 👍 0 🔁 0 💬 0 📌 0

Positions:
* Full-stack
* Research Engineer
* Research Scientist
* Systems Infrastructure Engineer
* Research intern
Feel free to reach out but chances are I will see your application if you apply online. I will post details on my internship later, but there are more openings.

07.02.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0

Excited to finally announce that I have joined @guidelabs.bsky.social. We are building LLMs from scratch designed to be interpretable. Many have asked what I'm doing after my Ph.D., so great to finally get it out. We have a lot of open positions, from engineering to scientist to intern.

07.02.2025 17:01 — 👍 4 🔁 0 💬 1 📌 0

All investigations of faithfulness show that explanations' faithfulness is by default model and task-dependent. However, this is not the case when using FMMs. Thus, presenting a new paradigm for how to provide and ensure faithful explanations.

28.11.2024 14:02 — 👍 1 🔁 0 💬 0 📌 0

Diagram of faithfulness measurable models. Showing the model is designed to measure the faithfulness of an explanation, and that this can be used to optimize an explanation.

FMMs are when models are designed such that measuring faithfulness is cheap and precise, which makes it possible to optimize explanations toward maximum faithfulness.

28.11.2024 14:02 — 👍 2 🔁 0 💬 1 📌 0

Diagram of self-explanations. Showing input going in, then the regular output and explanation going out.

Self-explanations are when LLMs explain themselves. Current models are not capable of this, but we suggest how that could be changed.Diagram of self-explanations. Showing input going in, then the regular output and explanation going out.

28.11.2024 14:02 — 👍 1 🔁 0 💬 1 📌 0

We ask the question: How to provide and ensure faithful explanations for general-purpose NLP models? The main thesis is that we should develop new paradigms in interpretability. The two new paradigms explored are faithfulness measurable models (FMMs) and self-explanations.

28.11.2024 14:02 — 👍 1 🔁 0 💬 1 📌 0

New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing As machine learning becomes more widespread and is used in more critical applications, it's important to provide explanations for these models, to prevent unintended behavior. Unfortunately, many curr...

The full thesis is available at arxiv.org/abs/2411.17992. Thanks to @sivareddyg.bsky.social and @sarath-chandar.bsky.social for supervising me throughout all these years. It's been a great journey and I'm very grateful for their support.

28.11.2024 14:02 — 👍 3 🔁 0 💬 1 📌 0

Interpretability Needs a New Paradigm Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only model...

I’m thrilled to share that I’ve finished my Ph.D. at Mila and Polytechnique Montreal. For the last 4.5 years, I have worked on creating new faithfulness-centric paradigms for NLP Interpretability. Read my vision for the future of interpretability in our new position paper: arxiv.org/abs/2405.05386

28.11.2024 13:39 — 👍 36 🔁 4 💬 3 📌 1

Hi, can you add me thanks 🙂

27.11.2024 16:13 — 👍 0 🔁 0 💬 0 📌 0

@andreasmadsen is following 19 prominent accounts

Julian Skirzynski
@jskirzynski

PhD student in Computer Science @UCSD. Studying interpretable AI and RL to improve people's decision-making.

Kabir Kumar, aiplans.org
@kabirkumar

I run AI Plans, an AI Safety lab focused on solving AI Alignment before 2029. For several weeks I used a stone for a pillow. I once spent a quarter of my paycheck on cheese. Ping me! DMs not working atm due to totalitarian UK law :( SurpassAI

Simon Eiriksson
@simoneiriksson

Anthropologist from UniCPH and MS student in Machine Learning at Techincal University of Denmark. Apart from uncertainty in modelling, I care a lot about regenerative farming and food. You can also find me here: https://www.linkedin.com/in/simoneiriksson

Oliver Eberle
@eberleoliver

Clément Dumas
@butanium

Master student at ENS Paris-Saclay / aspiring AI safety researcher / improviser Prev research intern @ EPFL w/ wendlerc.bsky.social and Robert West MATS Winter 7.0 Scholar w/ neelnanda.bsky.social https://butanium.github.io

Alessio Devoto
@alessiodevoto

PhD in ML/AI | Researching Efficient ML/AI (vision & language) 🍀 & Interpretability | @SapienzaRoma @EdinburghNLP | https://alessiodevoto.github.io/ | ex @NVIDIA

Leonie Weissweiler
@weissweiler

Postdoc at Uppsala University Computational Linguistics with Joakim Nivre PhD from LMU Munich, prev. UT Austin, Princeton, @ltiatcmu.bsky.social, Cambridge computational linguistics, construction grammar, morphosyntax leonieweissweiler.github.io

Federico Adolfi
@fedeadolfi

Computation & Complexity | AI Interpretability | Meta-theory | Computational Cognitive Science https://fedeadolfi.github.io

Jasmijn Bastings
@jasmijn.bastings.me

Senior Research Scientist at Google DeepMind. Equitable AI, language, gender, society. She/her. 🌐 jasmijn.bastings.me

@paulirish

wrong acct. head to https://bsky.app/profile/paul.irish

Nathan Rajlich
@n8.io

Programmer / Nerd. Engineer @vercel. Former @nodejs core committer. Before you ask, I'm 6’6”

Tim Caswell
@creationix.blue

Lover of all things good in life including Family, Friends, Food, and Functional Programs. ⚒️ Creator of http://luvit.io and http://nvm.sh 👨‍🔬 Protocol Wizard Making the web faster @vercel.com

Domenic Denicola
@domenic.me

Working on Google Chrome to make the web better, as a way to pass the time until the singularity hits.

Dominic 🔵
@dominictarr

Amateur Anthropologist, Independent Degrowth Researcher, Special Interest: Traditional Craft of the Pacific. 100% pure meat brain. Youtube Sailor: https://youtube.com/@dominictarrsailing (Btw 🔵 is the view of this planet from the angle the land isnt)

Node.js
@nodejs.org

The Node.js JavaScript Runtime. 🐢🚀 Need help with Node.js? We've got a repo for that: https://github.com/nodejs/help

John Resig
@johnresig.com

Creator of jQuery, Chief Software Architect at Khan Academy, Japanese print nerd. https://johnresig.com/ https://ukiyo-e.org/ (bot: @ukiyo-e.org)

Mathias Bynens
@mths.be

♥ JavaScript, HTML, CSS, HTTP, performance, security, Bash, Unicode, i18n, macOS. https://mths.be/

Frank Bendon
@fb55

Engineer, pilot et al likes: stuff that works dislikes: stuff that does not work, people who sell stuff that does not work.

Guillermo Rauch
@rauchg.blue

@vercel.com