San Diego πΊπΈ or Mexico City π²π½ for #NeurIPS2025? We got you covered either way π
On Dec 3rd:
π²π½ @dilya.bsky.social present our work on the fragility of Mech Interp in Mexico
πΊπΈ @lkopf.bsky.social present our work on polysemanticity in San Diego
I am not there this year, so Iβll be cheering from afar!
03.12.2025 00:19 β
π 6
π 1
π¬ 0
π 0
βοΈπ²π½ Next Wednesday (Dec 3), 1β4 p.m. CST, Iβll be presenting Manipulating Feature Visualizations with Gradient Slingshots at NeurIPS 2025 in Mexico City!
Feature Visualization has long been a staple interpretability tool. Our work shows itβs far from reliable! π¨
29.11.2025 16:38 β
π 9
π 4
π¬ 1
π 0
Iβm at #NeurIPS in San Diego this week! Come see our poster on feature interpretability. Find @eberleoliver.bsky.social and me at:
πͺ§Poster Session 1 @ Exhibit Hall C,D,E #1015
Wed 3 Dec, 11 am - 2 pm
πͺ§Poster @ Mech Interp Workshop
Upper Level Room 30A-E
Sun 7 Dec, 8 am - 5 pm
02.12.2025 18:56 β
π 11
π 3
π¬ 1
π 0
Manipulating Feature Visualizations with Gradient Slingshots
@dilya.bsky.social Marina MC HΓΆhne, Alexander Warnecke @lpirch.bsky.social Klaus-Robert MΓΌller @rieck.mlsec.org @slapuschkin.bsky.social @kirillbykov.bsky.social
π
28.11.2025 15:11 β
π 4
π 3
π¬ 1
π 0
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
@lkopf.bsky.social @nfel.bsky.social @kirillbykov.bsky.social @philinelb.bsky.social Anna HedstrΓΆm, Marina HΓΆhne @eberleoliver.bsky.social
π
28.11.2025 15:11 β
π 5
π 2
π¬ 1
π 0
Happy to share that our PRISM paper has been accepted at #NeurIPS2025 π
In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.
π Paper: arxiv.org/abs/2506.15538
#NeurIPS #MechInterp #XAI
19.09.2025 12:01 β
π 30
π 4
π¬ 1
π 3
Radware Bot Manager Captcha
To ensure we keep this website safe, please can you confirm you are a human by ticking the box below.
π¨New paper π¨
We are happy to announce that our paper βDeep Learning meets Teleconnections: Improving S2S Predictions for European Winter Weatherβ has been published at Machine Learning: Earth @ioppublishing.bsky.social
π iopscience.iop.org/article/10.1...
π» github.com/philine-bomm...
05.08.2025 07:19 β
π 4
π 2
π¬ 1
π 0
Thank you!! π
24.07.2025 09:57 β
π 1
π 0
π¬ 0
π 0
Personal news: I have defended my PhD thesis βExplaining Representations in Deep Neural Networksβ at @tuberlin.bsky.social with summa cum laude (with distinction).
From August, Iβll start a Postdoc at @tumunich.bsky.social in @eml-munich.bsky.social, focusing on Mechanistic Interpretability β¨
24.07.2025 08:38 β
π 10
π 0
π¬ 1
π 0
Check out our new work! Proud to share what weβve been up to π
19.06.2025 15:20 β
π 3
π 0
π¬ 0
π 0
Iβll be presenting our work at @neuripsconf.bsky.social in Vancouver! π
Join me this Thursday, December 12th, in East Exhibit Hall A-C, Poster #3107, from 11 a.m. PST to 2 p.m. PST. I'll be discussing our paper βCoSy: Evaluating Textual Explanations of Neurons.β
11.12.2024 06:43 β
π 10
π 1
π¬ 1
π 0
LinkedIn
This link will take you to a page thatβs not on LinkedIn
π Link to the paper: arxiv.org/abs/2405.20331
09.12.2024 15:26 β
π 1
π 0
π¬ 1
π 0
I am not attending #NeurIPS2024, but I encourage everyone interested in #XAI and #MechInterp to check out our paper on evaluating textual descriptions of neurons!
Join @lkopf.bsky.social, Anna HedstrΓΆm, and Marina Marie-Claire HΓΆhne onΒ Thu 09.12, 1 p.m. to 4 p.m. CSTΒ atΒ East Exhibit Hall A-C #3107!
09.12.2024 15:25 β
π 12
π 0
π¬ 1
π 0
Great! βΊοΈ
Thank you for curating the list!
01.12.2024 19:10 β
π 1
π 0
π¬ 0
π 0
About me
Machine Learning PhD Student
Julian, hi π! Could you please add me, here is my bio, working in Explainable AI and Concept-based Explainability
kirill-bykov.com
π
30.11.2024 18:42 β
π 1
π 0
π¬ 1
π 0
i exclusively consent to my tweets being used for training neural networks. if you are not a neural network, stop reading this immediately
28.11.2024 02:59 β
π 309
π 39
π¬ 17
π 6
Thank you βΊοΈ
26.11.2024 15:22 β
π 0
π 0
π¬ 0
π 0
24.11.2024 18:13 β
π 25
π 5
π¬ 17
π 0
Oliver, hey! π
Could you add me, please?
26.11.2024 11:59 β
π 0
π 0
π¬ 1
π 0