Simon Schrodi @simonschrodi

Simon Schrodi

@simonschrodi.bsky.social

🎓 PhD student @cvisionfreiburg.bsky.social @UniFreiburg 💡 interested in mechanistic interpretability, robustness, AutoML & ML for climate science https://simonschrodi.github.io/

346 Followers | 75 Following | 8 Posts | Joined: 20.11.2024 | 1.8284

Latest posts by simonschrodi.bsky.social on Bluesky

Big thanks to our amazing co-authors: Max Argus, Volker Fischer, and @thomasbrox.bsky.social 🙌

20.04.2025 14:24 — 👍 0 🔁 0 💬 0 📌 0

Even better, if you're at #ICLR2025 next week:
🖼️ Poster — April 24, 10 a.m. - 12:30 p.m., Hall 3 + Hall 2B (#481)
🎤 Oral — April 24, 4:30 p.m. - 4:42 p.m, Garnet 213–215 (oral session 2B)
☕ Or just catch us over coffee!

20.04.2025 14:24 — 👍 0 🔁 0 💬 1 📌 0

Two Effects, One Trigger: On the Modality Gap, Object Bias, and... Contrastive vision-language models (VLMs), like CLIP, have gained popularity for their versatile applicability to various downstream tasks. Despite their successes in some tasks, like zero-shot...

Curious to dive deeper?
📑 Paper: openreview.net/forum?id=uAF...
💻 Code: github.com/lmb-freiburg...
📬 DM me or David (he's not on Bluesky, but you can dm him on other platforms)!

20.04.2025 14:24 — 👍 0 🔁 0 💬 1 📌 0

But what is the modality gap good for? Interestingly, we find it affects the model’s entropy suggesting it might not be a bug, but a feature. 👀

20.04.2025 14:24 — 👍 0 🔁 0 💬 1 📌 0

Our paper is packed with surprising and insightful findings about both phenomena. Most notably, we show that both effects stem from the information imbalance between image and text modalities and both can be reduced when that imbalance decreases.

20.04.2025 14:24 — 👍 0 🔁 0 💬 1 📌 0

In this work, we investigate two undesired properties of CLIP-like models:
- Modality gap: a complete separation of image and text embeddings in the shared embedding space.
- Object bias: a tendency to focus on objects over other semantic aspects like attributes.

20.04.2025 14:24 — 👍 0 🔁 0 💬 1 📌 0

David Hoffmann and I will present our joint work, “Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models,” this Thursday as an Oral (top 1.8%) at #ICLR2025!🎉

🧵

20.04.2025 14:24 — 👍 3 🔁 1 💬 1 📌 0

Hello world!

26.11.2024 09:39 — 👍 8 🔁 1 💬 0 📌 0

Hi Julian, could you please add me? I work on interpretability & data-centric ML for multi-modal models

24.11.2024 18:27 — 👍 0 🔁 0 💬 1 📌 0

A starter pack of people working on interpretability / explainability of all kinds, using theoretical and/or empirical approaches.

Reply or DM if you want to be added, and help me reach others!

go.bsky.app/DZv6TSS

14.11.2024 17:00 — 👍 80 🔁 26 💬 35 📌 0

@simonschrodi is following 19 prominent accounts

Olaf Dünkel
@oduenkel

ELLIS PhD @ MPI & Oxford - Generative Models for Vision https://odunkel.github.io/

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Cian Eastwood
@cianeastwood

Senior Research Scientist at Valence Labs. Generative modeling (causal, multimodal) and generalisation for scientific discovery. PhD in ML from UofEdinburgh and MPI-IS, with time at Google DeepMind, Meta AI and Spotify. 📍London 🔗 cianeastwood.github.io

Violeta
@violetilla

Loud Spaniard. Maths & Software fan 👩🏻‍💻🐱💻 Dr in Computer Vision. Working on AI in media production 🎬 Mostly rants about AI, ML, FOSS, science, ethics, politics & public transport in the UK 🚴‍♀️🥾🌈 She/her

Andreas Christen
@envmet

Professor of Environmental Meteorology at University of Freiburg | Observing and modelling airflow, greenhouse gases, climate change in urban & forest systems. https://www.meteo.uni-freiburg.de/en/team/andreas-christen

Samuel Müller
@sammuller

(Tab)PFNs, TrivialAugment etc.

Johanna Vielhaben
@johannavielhaben

PhD candidate in the XAI group at Fraunhofer HHI

Arian Mousakhan
@arianmousakhan

PhD student in the Computer Vision group at the University of Freiburg

Computer Vision Group Freiburg
@cvisionfreiburg

Lead by Prof. @thomasbrox.bsky.social https://lmb.informatik.uni-freiburg.de/

Adam Kortylewski 🚨 Hiring PhDs
@adamkortylewski

Head of Generative Intelligence Lab University of Freiburg and the Max Planck Institute for Informatics DFG #EmmyNoether Fellow https://genintel.mpi-inf.mpg.de

Rajat Sahay
@rajatsahay

ELLIS PhD | Interpreting world models

Thomas Brox
@thomasbrox

Tim Miller
@tmiller-uq

Professor in Artificial Intelligence, The University of Queensland, Australia Human-Centred AI, Decision support, Human-agent interaction, Explainable AI https://uqtmiller.github.io

Julius Adebayo
@juliusad

ML researcher, building interpretable models at Guide Labs (guidelabs.bsky.social).

ExplainableML
@eml-munich

Institute for Explainable Machine Learning at @www.helmholtz-munich.de and Interpretable and Reliable Machine Learning group at Technical University of Munich and part of @munichcenterml.bsky.social

Julian Skirzynski
@jskirzynski

PhD student in Computer Science @UCSD. Studying interpretable AI and RL to improve people's decision-making.

Fred Hohman
@fredhohman

HCI+ML Research Scientist at Apple. Drummer and frisbee thrower.

Federico Adolfi
@fedeadolfi

Computation & Complexity | AI Interpretability | Meta-theory | Computational Cognitive Science https://fedeadolfi.github.io

NeurIPS Conference
@neuripsconf

The Thirty-Eighth Annual Conference on Neural Information Processing Systems will be held in Vancouver Convention Center, on Tuesday, Dec 10 through Sunday, Dec 15. https://neurips.cc/