11/11
This is joint work with Adam WrΓ³bel (project lead), Jacek Tabor, Bernt Schiele, Bartosz ZieliΕski, and Dawid Rymarczyk.
π°Preprint: arxiv.org/abs/2602.06613
π»Code: github.com/a-vrobell/DAVE [to be released soon]
@sidgairo18.bsky.social
ποΈππ©πͺ πͺπΊ PhD student at the Max Planck Institute for Informatics, and Institute of Science & Technology - Austria. π»ππ»ββοΈπ΄π»ππ»πβ·οΈπΈπΉπ Webpage: https://sidgairo18.github.io/
11/11
This is joint work with Adam WrΓ³bel (project lead), Jacek Tabor, Bernt Schiele, Bartosz ZieliΕski, and Dawid Rymarczyk.
π°Preprint: arxiv.org/abs/2602.06613
π»Code: github.com/a-vrobell/DAVE [to be released soon]
10/11 πΌοΈ Qualitative comparison
Vs prior methods, DAVE gives sharper, more object-centric attributions with less background + fewer patch-grid artifacts.
9/11 π Beyond standard ViTs
On inherently interpretable B-cos ViTs, DAVE yields sharper, more object-aligned maps and improves localization vs built-in B-cos explanations.
B-cos ViTs produce sharp attributions only when trained with a conv-stem, DAVE fixes this reliance.
8/11 π Main results
Consistent improvements on localization metrics and perturbation evaluations.
7/11 π§ The pipeline
Sample small spatial transforms + noise, compute effective transformation (conditioned forward blocks gradients through conditioning), inverse-transform & average, then apply element-wise to the input.
6/11 π«οΈ Stabilize (low-pass)
DAVE adds low-pass stabilization by averaging the equivariant effective transformation under small input perturbations (Gaussian smoothing in expectation).
This removes components unstable to tiny input changes. (See above fig., last column.)
5/11 π§Ό Remove grid artifacts
Even the effective transformation can carry architecture-induced grid patterns.
DAVE filters them by enforcing local equivariance: under small spatial transforms, the attribution must transform consistently.
4/11 β οΈ Why gradients break
Operator variation can amplify tiny perturbations β high-frequency junk in attributions.
DAVE drops this term and keeps the effective transformation as a cleaner attribution operator.
3/11 π‘ Key idea
Model each ViT layer as an input-dependent linear operator L(X) applied to X.
Then the input-gradient decomposes into:
(1) effective transformation L(X)
(2) operator variation (how L changes w.r.t. X)
2/11 π§© The problem
ViT components (patch embedding, attention routing, etc.) inject structured artifacts into gradients β explanations become noisy/unstable, or methods fall back to coarse patch-level maps.
πNew preprint: DAVE β Distribution-aware Attribution via ViT Gradient DEcomposition.
1/11 π Whatβs new:
We fix a persistent issue in ViT explainability: unstable, artifact-heavy pixel attributions. DAVE yields fine-grained pixel-level maps without patch-grid saliency.
ICCV 2025 πΊ Aloha from Hawaii! MPI-INF (D2) is presenting 4 papers this year (one Highlight). Thread π
19.10.2025 07:48 β π 13 π 6 π¬ 1 π 0
Heading to #ICCV2025? Donβt miss #CV4DC β Oct 19, Honolulu πΊ An in-person workshop spotlighting research for/with developing regions.
See exciting keynote talks from leading researches!
See accepted papers & posters here: cv4dc.github.io/2025/
#AI4Good
Hope these help you as much as theyβve helped me.
πFeel free to share, bookmark, and contribute here (github.com/sidgairo18/s...)!
P.S.: Please feel free to share relevant resources in the comments / thread and I'll add them as well π (n/n)
3οΈβ£ How to Write Academic Papers
A practical checklist-driven guide on writing with clarity, rigor, and reproducibility. Inspired by ICML's best practices and more.
πhttps://sidgairo18.github.io/how_to_write_academic_papers.html (4/n)
2οΈβ£ How to Review Scientific Papers
What makes a good review? This guide compiles best practices from ICML, ICLR, CVPR, and other leading conferences.
πhttps://sidgairo18.github.io/how_to_review_scientific_papers.html (3/n)
1οΈβ£ How to Do Research
Mindset, habits, tools, writing, productivity, and advice.
πhttps://sidgairo18.github.io/notes_and_resources_on_how_to_do_research.html (2/n)
πHere are some resources and notes about doing research that Iβve been compiling since I started my PhD.π
I rely on these often, and after sharing them with a few folks in my group who found them useful, I thought it could be of use to the broader community π§΅π (1/n)
β³Still need to wait for your last experiment results?
π£ We're pleased to announce that the deadline for non-proceeding track #CV4DC at @iccv.bsky.social has been extended to August 15, 2025
Looking forward to your submissions! cv4dc.github.io/2025/
#ICML2025 Stats
15.07.2025 02:04 β π 11 π 2 π¬ 1 π 1Papers accepted at ICML 2025 from the Computer Vision and Machine Learning Department at the Max Planck Institute for Informatics.
Papers being presented from our group at #ICML2025!
Congratulations to all the authors! To know more, visit us in the poster sessions!
A π§΅with more details:
@icmlconf.bsky.social @mpi-inf.mpg.de
Submit your papers to our non-proceeding track here: cv4dc.github.io/2025/call-fo...
12.07.2025 09:11 β π 0 π 0 π¬ 0 π 0
π£ Proceeding track's results are out.
π Congratulations to all the authors whose papers were accepted. We can't wait to meet you at @iccv.bsky.social in Hawaii on Oct 19th.
β° Our non-proceeding track is still accepting submissions until July 20th! Details in the comments
Cc: @mpi-inf.mpg.de
06.07.2025 08:19 β π 0 π 0 π¬ 0 π 0
Porto, Portugal π΅πΉ June-July 2025
One of the prettiest and liveliest cities Iβve traveled to β¨ π€
Congratulations Matt! π₯
26.06.2025 10:01 β π 0 π 0 π¬ 0 π 0πTry for yourself π: github.com/sidgairo18/p...
23.06.2025 16:27 β π 0 π 0 π¬ 0 π 0