BlackboxNLP's Avatar

BlackboxNLP

@blackboxnlp.bsky.social

The largest workshop on analysing and interpreting neural networks for NLP. BlackboxNLP will be held at EMNLP 2025 in Suzhou, China blackboxnlp.github.io

184 Followers  |  297 Following  |  44 Posts  |  Joined: 13.05.2025  |  1.9721

Latest posts by blackboxnlp.bsky.social on Bluesky

NicolΓ² & Mingyang: Can we understand which circuits emerge in small models and reasoning-tuned systems, and how do they compare with default systems? Are there methods that generalize better across all tasks?

09.11.2025 07:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Q: What's next for interpretability benchmarks? Michal: People sitting together and planning how to extend tests to multimodal, diverse contexts. @michaelwhanna.bsky.social: For circuit finding, integrating sparse features circuits could help us better understand our models.

09.11.2025 07:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

NicolΓ² & Mingyang: Starting to explore notebooks and public libraries can be very helpful in gaining early intuitions about what's promising.

09.11.2025 07:16 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@michaelwhanna.bsky.social: Don't try to read everything. Find Qs you really care about, and go a level deeper to answer meaningful questions.

09.11.2025 07:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Q: How would one go about approaching interpretability research these days? Michal: "When things don't work out of the box, it's a sign to double down and find out why. Negative results are important!"

09.11.2025 07:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@danaarad.bsky.social: As deep learning research converges on similar architectures for different modalities, it will be interesting to determine which interpretability method will remain useful across various models and tasks.

09.11.2025 07:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@michaelwhanna.bsky.social, NicolΓ² & Mingyang: Counterfactuals in minimal settings can be helpful, but they do not capture the whole story. Extending current methods to long contexts, and finding practical applications in safety-related areas are exciting challenges ahead.

09.11.2025 07:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Michal: Mechanistic interpretability has heavily focused on toy tasks and text-only models. The next step is scaling to more complex tasks that involve real-world reasoning.

09.11.2025 07:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Our panel moderated by @danaarad.bsky.social
"Evaluating Interpretability Methods: Challenges and Future Directions" just started! πŸŽ‰ Come to learn more about the MIB benchmark and hear the takes of @michaelwhanna.bsky.social, Michal Golovanevsky, NicolΓ² Brunello and Mingyang Wang!

09.11.2025 06:54 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Post image

Next up: Kentaro Ozeki presenting "Normative Reasoning in Large Language Models: A Comparative Benchmark from Logical and Modal Perspectives" aclanthology.org/2025.blackbo...

09.11.2025 06:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

After a productive poster session, BlackboxNLP returns with the second keynote "Memorization: Myth or Mystery?" by @vernadankers.bsky.social!

09.11.2025 05:48 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Nadav Shani is giving the first oral presentation of the day: Language Dominance in Multilingual Large Language Models. Find the paper here: aclanthology.org/2025.blackbo...

09.11.2025 02:19 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Next up: Circuit-Tracer: A New Library for Finding Feature Circuits presented by @michaelwhanna.bsky.social! Paper: aclanthology.org/2025.blackbo...

09.11.2025 02:17 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'll be presenting this work at @blackboxnlp.bsky.social in Suzhou, happy to chat there or here if you are interested !

22.10.2025 08:16 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Nov 9, @blackboxnlp.bsky.social , 11:00-12:00 @ Hall C – Interpreting Language Models Through Concept Descriptions: A Survey (Feldhus & Kopf) @lkopf.bsky.social

πŸ—žοΈ aclanthology.org/2025.blackbo...

bsky.app/profile/nfel...

06.11.2025 07:00 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Post image

Quanshi Zhang is giving the first keynote of the day: Can Neural Network Interpretability Be the Key to Breaking Through Scaling Law Limitations in Deep Learning?

09.11.2025 01:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

BlackboxNLP is up and running! Here's the topics covered by this year's edition at a glance. Excited to see so many interesting topics, and the growing interest in reasoning!

09.11.2025 01:38 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

πŸ“’ Call for Papers! πŸ“’
#BlackboxNLP 2025 invites the submission of archival and non-archival papers on interpreting and explaining NLP models.

πŸ“… Deadlines: Aug 15 (direct submissions), Sept 5 (ARR commitment)
πŸ”— More details: blackboxnlp.github.io/2025/call/

12.08.2025 19:10 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 3

Writing your technical report for the MIB shared task?
Take a look at the task page for guidelines and tips!

06.08.2025 09:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The report deadline was also extended to August 10th!
Note that this is a final extension. We look forward to reading your reports! ✍️

06.08.2025 09:49 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Just 5 days left to submit your method to the MIB Shared Task at #BlackboxNLP!

Have last-minute questions or need help finalizing your submission?
Join the Discord server: discord.gg/n5uwjQcxPR

03.08.2025 06:40 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

Results + technical report deadline: August 8, 2025
Full task details: blackboxnlp.github.io/2025/task/

30.07.2025 05:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

With the new extended deadline, there's still plenty of time to submit your method to the MIB Shared Task!

We welcome submissions of existing methods, experimental POCs, or any approach addressing circuit discovery or causal variable localization πŸ’‘

30.07.2025 05:57 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Results deadline extended by one week!
Following requests from participants, we’re extending the MIB Shared Task submission deadline by one week.

πŸ—“οΈ New deadline: August 8, 2025
Submit your method via the MIB leaderboard!

29.07.2025 09:35 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 2
Post image

πŸ“ Technical report guidelines are out!

If you're submitting to the MIB Shared Task at #BlackboxNLP, feel free to take a look to help you prepare your report: blackboxnlp.github.io/2025/task/

28.07.2025 12:34 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Post image

Just 10 days to go until the results submission deadline for the MIB Shared Task at #BlackboxNLP!

If you're working on:
🧠 Circuit discovery
πŸ” Feature attribution
πŸ§ͺ Causal variable localization
now’s the time to polish and submit!

Join us on Discord: discord.gg/n5uwjQcxPR

23.07.2025 07:42 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1

Are you attending ICML? πŸ‘€

I'm sadly not, but if you are, you should check out the MIB πŸ•ΆοΈposter at 11AM: icml.cc/virtual/2025...

The benchmark is used as the shared task at this year's
@blackboxnlp.bsky.social (blackboxnlp.github.io/2025/task/) - there's still time to participate πŸ†

17.07.2025 15:56 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

⏳ Three weeks left! Submit your work to the MIB Shared Task at #BlackboxNLP, co-located with @emnlpmeeting.bsky.social

Whether you're working on circuit discovery or causal variable localization, this is your chance to benchmark your method in a rigorous setup!

13.07.2025 05:56 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 2

Have you started working on your submission for the MIB shared task yet? Tell us what you’re exploring!

New featurization methods?
Circuit pruning?
Better feature attribution?

We'd love to hear about it πŸ‘‡

09.07.2025 07:15 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
BlackboxNLP 2025 The Eight Workshop on Analyzing and Interpreting Neural Networks for NLP

πŸ—“οΈ Deadline: August 1
πŸ“œ Full task details: blackboxnlp.github.io/2025/task/
πŸ’¬ Join the discussion: discord.gg/n5uwjQcxPR

08.07.2025 09:35 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@blackboxnlp is following 20 prominent accounts