BlackboxNLP's Avatar

BlackboxNLP

@blackboxnlp.bsky.social

The largest workshop on analysing and interpreting neural networks for NLP. BlackboxNLP will be held at EMNLP 2025 in Suzhou, China blackboxnlp.github.io

165 Followers  |  297 Following  |  28 Posts  |  Joined: 13.05.2025  |  1.9113

Latest posts by blackboxnlp.bsky.social on Bluesky

Writing your technical report for the MIB shared task?
Take a look at the task page for guidelines and tips!

06.08.2025 09:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The report deadline was also extended to August 10th!
Note that this is a final extension. We look forward to reading your reports! ✍️

06.08.2025 09:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Just 5 days left to submit your method to the MIB Shared Task at #BlackboxNLP!

Have last-minute questions or need help finalizing your submission?
Join the Discord server: discord.gg/n5uwjQcxPR

03.08.2025 06:40 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

Results + technical report deadline: August 8, 2025
Full task details: blackboxnlp.github.io/2025/task/

30.07.2025 05:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

With the new extended deadline, there's still plenty of time to submit your method to the MIB Shared Task!

We welcome submissions of existing methods, experimental POCs, or any approach addressing circuit discovery or causal variable localization πŸ’‘

30.07.2025 05:57 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Results deadline extended by one week!
Following requests from participants, we’re extending the MIB Shared Task submission deadline by one week.

πŸ—“οΈ New deadline: August 8, 2025
Submit your method via the MIB leaderboard!

29.07.2025 09:35 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 2
Post image

πŸ“ Technical report guidelines are out!

If you're submitting to the MIB Shared Task at #BlackboxNLP, feel free to take a look to help you prepare your report: blackboxnlp.github.io/2025/task/

28.07.2025 12:34 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Post image

Just 10 days to go until the results submission deadline for the MIB Shared Task at #BlackboxNLP!

If you're working on:
🧠 Circuit discovery
πŸ” Feature attribution
πŸ§ͺ Causal variable localization
now’s the time to polish and submit!

Join us on Discord: discord.gg/n5uwjQcxPR

23.07.2025 07:42 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1

Are you attending ICML? πŸ‘€

I'm sadly not, but if you are, you should check out the MIB πŸ•ΆοΈposter at 11AM: icml.cc/virtual/2025...

The benchmark is used as the shared task at this year's
@blackboxnlp.bsky.social (blackboxnlp.github.io/2025/task/) - there's still time to participate πŸ†

17.07.2025 15:56 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

⏳ Three weeks left! Submit your work to the MIB Shared Task at #BlackboxNLP, co-located with @emnlpmeeting.bsky.social

Whether you're working on circuit discovery or causal variable localization, this is your chance to benchmark your method in a rigorous setup!

13.07.2025 05:56 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 2

Have you started working on your submission for the MIB shared task yet? Tell us what you’re exploring!

New featurization methods?
Circuit pruning?
Better feature attribution?

We'd love to hear about it πŸ‘‡

09.07.2025 07:15 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

πŸ—“οΈ Deadline: August 1
πŸ“œ Full task details: blackboxnlp.github.io/2025/task/
πŸ’¬ Join the discussion: discord.gg/n5uwjQcxPR

08.07.2025 09:35 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Working on feature attribution, circuit discovery, feature alignment, or sparse coding?
Consider submitting your work to the MIB Shared Task, part of this year’s #BlackboxNLP

We welcome submissions of both existing methods and new or experimental POCs!

08.07.2025 09:35 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

New to mechanistic interpretability?
The MIB shared task is a great opportunity to experiment:
βœ… Clean setup
βœ… Open baseline code
βœ… Standard evaluation

Join the discord server for ideas and discussions: discord.gg/n5uwjQcxPR

07.07.2025 08:42 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

The wait is over! πŸŽ‰ Our speakers for #BlackboxNLP 2025 are finally out!

04.07.2025 09:37 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

🚨 Excited to announce two invited speakers at #BlackboxNLP 2025!

Join us to hear from two leading voices in interpretability:
πŸŽ™οΈ Quanshi Zhang (Shanghai Jiao Tong University)
πŸŽ™οΈ Verna Dankers (McGill University)

β€ͺ@vernadankers.bsky.social‬

04.07.2025 08:14 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

That’s a great idea, thank you for the suggestion! We’ll make sure to keep the channel going beyond the deadline

03.07.2025 06:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Join the BlackboxNLP Shared Task Discord Server! Check out the BlackboxNLP Shared Task community on Discord - hang out with 54 other members and enjoy free voice and text chat.

πŸ“œ Check out the full task description: blackboxnlp.github.io/2025/task/

πŸ“… Submission deadline: August 1

πŸ“’ Join the discord server for ideas, brainstorming, and q&a: discord.gg/n5uwjQcxPR

01.07.2025 16:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

A typical pipeline:
β€’ Build contrastive input pairs differing only in the target variable.
β€’ (If supervised) train the featurizer on these pairs.
β€’ To evaluate: Transform activation, intervene in feature space, transform back out, and check if behavior shifts as expected.

01.07.2025 16:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

One month to go! ⏰
Working on featurization methods - ways to transform LM activations to better isolate causal variables?
Submit your work to the Causal Variable Localization Track of the MIB Shared Task!

01.07.2025 16:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Working on the MIB shared task?
Join the discord server: discord.gg/n5uwjQcxPR

πŸ” Check out submission ideas
πŸ” Brainstorm possible directions
πŸ” Ask questions and get help with setup issues

Full task description: blackboxnlp.github.io/2025/task/

30.06.2025 08:32 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

Check out the full task description: blackboxnlp.github.io/2025/task/

Important dates:
πŸ—“οΈ Deadline for results: August 1
πŸ“„ Technical report: August 8

Join the discord server: discord.gg/n5uwjQcxPR

24.06.2025 14:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

The Circuit Localization Track benchmarks methods for discovering causal circuits, subgraphs of a model responsible for specific behavior.

These methods typically:
β€’ Score model components or edges
β€’ Ablate all but the top-ranked ones
β€’ Evaluate the performance of the resulting subgraph

24.06.2025 14:24 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Working on circuit discovery in LMs?
Consider submitting your work to the MIB Shared Task, part of #BlackboxNLP at @emnlpmeeting.bsky.social 2025!

The goal: benchmark existing MI methods and identify promising directions to precisely and concisely recover causal pathways in LMs >>

24.06.2025 14:24 β€” πŸ‘ 5    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

If you're working on feature attribution, circuit discovery, feature alignment, sparse coding, or related methods - this is for you!

πŸ—“οΈ Aug 1 - Deadline for results
πŸ—“οΈ Aug 8 - Deadline for technical report

More details: blackboxnlp.github.io/2025/task/
Join the discord server: discord.gg/n5uwjQcxPR

23.06.2025 14:45 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

The task builds on the new Mechanistic Interpretability Benchmark (MIB) by Mueller* & Geiger* et al. (2025), with two tracks:
* Circuit Localization – identify subgraphs that carry out specific computations
* Causal Variable Localization – align internal representations with known causal factors

23.06.2025 14:45 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Have you heard about this year's shared task? πŸ“’

Mechanistic Interpretability (MI) is quickly advancing, but comparing methods remains a challenge. This year at #BlackboxNLP, we're introducing a shared task to rigorously evaluate MI methods in language models 🧡

23.06.2025 14:45 β€” πŸ‘ 16    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1

Hi Marco, if you refer to the regular CfP, it is already available on our website, and OpenReview links will be published in the upcoming weeks. The call for submissions for the shared task is also out, and submissions should be performed through the MIB leaderboard system for later evaluation.

18.05.2025 18:18 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Interested in mechanistic interpretability and care about evaluation? Please consider submitting to our shared task at #blackboxNLP this year!

15.05.2025 09:57 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

Have a look at our website for more information! blackboxnlp.github.io/2025

15.05.2025 08:21 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@blackboxnlp is following 20 prominent accounts