Kirill Bykov's Avatar

Kirill Bykov

@kirillbykov.bsky.social

PhD student in Interpretable ML @UMI_Lab_AI, @bifoldberlin, @TUBerlin

347 Followers  |  198 Following  |  10 Posts  |  Joined: 20.11.2023
Posts Following

Posts by Kirill Bykov (@kirillbykov.bsky.social)

Post image

San Diego πŸ‡ΊπŸ‡Έ or Mexico City πŸ‡²πŸ‡½ for #NeurIPS2025? We got you covered either way 😎

On Dec 3rd:
πŸ‡²πŸ‡½ @dilya.bsky.social present our work on the fragility of Mech Interp in Mexico
πŸ‡ΊπŸ‡Έ @lkopf.bsky.social present our work on polysemanticity in San Diego

I am not there this year, so Iβ€˜ll be cheering from afar!

03.12.2025 00:19 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

βœˆοΈπŸ‡²πŸ‡½ Next Wednesday (Dec 3), 1–4 p.m. CST, I’ll be presenting Manipulating Feature Visualizations with Gradient Slingshots at NeurIPS 2025 in Mexico City!

Feature Visualization has long been a staple interpretability tool. Our work shows it’s far from reliable! 🚨

29.11.2025 16:38 β€” πŸ‘ 9    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Post image

I’m at #NeurIPS in San Diego this week! Come see our poster on feature interpretability. Find @eberleoliver.bsky.social and me at:

πŸͺ§Poster Session 1 @ Exhibit Hall C,D,E #1015
Wed 3 Dec, 11 am - 2 pm
πŸͺ§Poster @ Mech Interp Workshop
Upper Level Room 30A-E
Sun 7 Dec, 8 am - 5 pm

02.12.2025 18:56 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Manipulating Feature Visualizations with Gradient Slingshots
@dilya.bsky.social Marina MC HΓΆhne, Alexander Warnecke @lpirch.bsky.social Klaus-Robert MΓΌller @rieck.mlsec.org @slapuschkin.bsky.social @kirillbykov.bsky.social
πŸ‘‡

28.11.2025 15:11 β€” πŸ‘ 4    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework
@lkopf.bsky.social @nfel.bsky.social @kirillbykov.bsky.social @philinelb.bsky.social Anna HedstrΓΆm, Marina HΓΆhne @eberleoliver.bsky.social
πŸ‘‡

28.11.2025 15:11 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Happy to share that our PRISM paper has been accepted at #NeurIPS2025 πŸŽ‰

In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features.

πŸ“„ Paper: arxiv.org/abs/2506.15538

#NeurIPS #MechInterp #XAI

19.09.2025 12:01 β€” πŸ‘ 30    πŸ” 4    πŸ’¬ 1    πŸ“Œ 3
Radware Bot Manager Captcha To ensure we keep this website safe, please can you confirm you are a human by ticking the box below.

🚨New paper 🚨

We are happy to announce that our paper β€œDeep Learning meets Teleconnections: Improving S2S Predictions for European Winter Weather” has been published at Machine Learning: Earth @ioppublishing.bsky.social

πŸ“„ iopscience.iop.org/article/10.1...

πŸ’» github.com/philine-bomm...

05.08.2025 07:19 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Thank you!! 😊

24.07.2025 09:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Personal news: I have defended my PhD thesis β€œExplaining Representations in Deep Neural Networks” at @tuberlin.bsky.social with summa cum laude (with distinction).

From August, I’ll start a Postdoc at @tumunich.bsky.social in @eml-munich.bsky.social, focusing on Mechanistic Interpretability ✨

24.07.2025 08:38 β€” πŸ‘ 10    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Check out our new work! Proud to share what we’ve been up to πŸ‘‰

19.06.2025 15:20 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I’ll be presenting our work at @neuripsconf.bsky.social in Vancouver! πŸŽ‰
Join me this Thursday, December 12th, in East Exhibit Hall A-C, Poster #3107, from 11 a.m. PST to 2 p.m. PST. I'll be discussing our paper β€œCoSy: Evaluating Textual Explanations of Neurons.”

11.12.2024 06:43 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
LinkedIn This link will take you to a page that’s not on LinkedIn

πŸ”— Link to the paper: arxiv.org/abs/2405.20331

09.12.2024 15:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

I am not attending #NeurIPS2024, but I encourage everyone interested in #XAI and #MechInterp to check out our paper on evaluating textual descriptions of neurons!

Join @lkopf.bsky.social, Anna HedstrΓΆm, and Marina Marie-Claire HΓΆhne onΒ Thu 09.12, 1 p.m. to 4 p.m. CSTΒ atΒ East Exhibit Hall A-C #3107!

09.12.2024 15:25 β€” πŸ‘ 12    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Great! ☺️

Thank you for curating the list!

01.12.2024 19:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
About me Machine Learning PhD Student

Julian, hi πŸ‘‹! Could you please add me, here is my bio, working in Explainable AI and Concept-based Explainability

kirill-bykov.com

πŸ™Œ

30.11.2024 18:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

i exclusively consent to my tweets being used for training neural networks. if you are not a neural network, stop reading this immediately

28.11.2024 02:59 β€” πŸ‘ 309    πŸ” 39    πŸ’¬ 17    πŸ“Œ 6

Thank you ☺️

26.11.2024 15:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
24.11.2024 18:13 β€” πŸ‘ 25    πŸ” 5    πŸ’¬ 17    πŸ“Œ 0

Oliver, hey! πŸ‘‹

Could you add me, please?

26.11.2024 11:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0