Computer Vision and Machine Learning at MPI Informatics's Avatar

Computer Vision and Machine Learning at MPI Informatics

@cvml.mpi-inf.mpg.de

Computer Vision and Machine Department at the Max Planck Institute for Informatics | https://www.mpi-inf.mpg.de/departments/computer-vision-and-machine-learning/

72 Followers  |  39 Following  |  40 Posts  |  Joined: 12.03.2025  |  2.3763

Latest posts by cvml.mpi-inf.mpg.de on Bluesky

Post image

People often use synthetic corruptions to test model robustness, but do these reflect real-world challenges?

We explore this in detail in our CVPR 2025 Workshop paper:

Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?

arxiv.org/abs/2505.04835

by: @margretkeuper.bsky.social

24.07.2025 09:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐ŸšจDeadline Extension Alert!

Our Non-proceedings track is open till August 15th for the eXCV workshop at ICCV.

Our nectar track accepts published papers, as is.

More info at: excv-workshop.github.io

@iccv.bsky.social #ICCV2025

18.07.2025 09:31 โ€” ๐Ÿ‘ 5    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

โณStill need to wait for your last experiment results?
๐Ÿ“ฃ We're pleased to announce that the deadline for non-proceeding track #CV4DC at @iccv.bsky.social has been extended to August 15, 2025

Looking forward to your submissions! cv4dc.github.io/2025/

24.07.2025 06:20 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
๐ŸŒ€Spatial Reasoners

๐Ÿ“„ spatialreasoners.github.io
๐Ÿ”— github.com/spatialreaso...

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Image for "๐ŸŒ€Spatial Reasoners for Continuous Variables in Any Domains"

Image for "๐ŸŒ€Spatial Reasoners for Continuous Variables in Any Domains"

4/ "๐ŸŒ€Spatial Reasoners for Continuous Variables in Any Domains" by @bartpog.bsky.social, @chriswewer.bsky.social, Bernt Schiele, and @janericlenssen.bsky.social (CODEML Workshop)

๐Ÿ” Software framework for training Spatial Reasoning Models in any domain

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿ” Can you really trust the explanations your classifier gives you? We show which pixels in the input are provably important to the classifierโ€™s prediction within a radius around the input.

๐Ÿ“„ openreview.net/pdf?id=NngoE...
๐Ÿ”— github.com/AlaaAnani/ce...

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Image for "Pixel-level Certified Explanations via Randomized Smoothing"

Image for "Pixel-level Certified Explanations via Randomized Smoothing"

3/ "Pixel-level Certified Explanations via Randomized Smoothing" by @aanani.bsky.social, Tobias Lorenz, Mario Fritz, and Bernt Schiele

13.07.2025 08:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Spatial Reasoning with Denoising Models We introduce Spatial Reasoning Models (SRMs), a framework to perform reasoning over sets of continuous variables via denoising generative models. SRMs infer continuous representations on a set of unob...

๐Ÿ“„ arxiv.org/abs/2502.21075
๐Ÿ”— geometric-rl.mpi-inf.mpg.de/srm/
๐Ÿ”— github.com/Chrixtar/SRM

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Image for "Spatial Reasoning with Denoising Models"

Image for "Spatial Reasoning with Denoising Models"

2/ "Spatial Reasoning with Denoising Models" by @chriswewer.bsky.social, @bartpog.bsky.social, Bernt Schiele, and @janericlenssen.bsky.social

๐Ÿ” Can image generators solve visual Sudoku? Naively, no, with sequentialization and the correct order, they can!

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
DCBM: Data-Efficient Visual Concept Bottleneck Models Concept Bottleneck Models (CBMs) enhance the interpretability of neural networks by basing predictions on human-understandable concepts. However, current CBMs typically rely on concept sets extracted ...

๐Ÿ“„ arxiv.org/abs/2412.11576
๐Ÿ”— github.com/KathPra/DCBM

13.07.2025 08:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Image for "DCBM: Data-Efficient Visual Concept Bottleneck Models"

Image for "DCBM: Data-Efficient Visual Concept Bottleneck Models"

1/ "DCBM: Data-Efficient Visual Concept Bottleneck Models" by @katharinaprasse.bsky.social*, @patrickknab.bsky.social*, Sascha Marton, Christian Bartelt, and @margretkeuper.bsky.social

๐Ÿ” Data-efficient CBMs (DCBMs) generate concepts from image regions detected by segmentation or detection models

13.07.2025 08:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Papers accepted at ICML 2025 from the Computer Vision and Machine Learning Department at the Max Planck Institute for Informatics.

Papers accepted at ICML 2025 from the Computer Vision and Machine Learning Department at the Max Planck Institute for Informatics.

Papers being presented from our group at #ICML2025!

Congratulations to all the authors! To know more, visit us in the poster sessions!

A ๐Ÿงตwith more details:

@icmlconf.bsky.social @mpi-inf.mpg.de

13.07.2025 08:00 โ€” ๐Ÿ‘ 21    ๐Ÿ” 5    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

๐Ÿ“ฃ Proceeding track's results are out.
๐ŸŽ‰ Congratulations to all the authors whose papers were accepted. We can't wait to meet you at @iccv.bsky.social in Hawaii on Oct 19th.

โฐ Our non-proceeding track is still accepting submissions until July 20th! Details in the comments

12.07.2025 09:11 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

๐ŸŽ‰ Congrats to Yue Fan on defending his PhD: "Improving Representation Learning from Data and Model Perspectives: Semi-Supervised Learning and Foundation Models" ๐Ÿง‘โ€๐ŸŽ“

He is now at Genmo.ai as a Research Engineer working on video generation! ๐Ÿš€

More: yue-fan.github.io

All the best!

05.07.2025 20:47 โ€” ๐Ÿ‘ 8    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Congratulations to our PhD alumna @annakukleva.bsky.social for being awarded the prestigious Otto Hahn Medal by @maxplanck.de! ๐ŸŽ‰

01.07.2025 21:44 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿš€ Just accepted to ICCV 2025!

In DIY-SC, we improve foundational features using a light-weight adapter trained with carefully filtered and refined pseudo-labels.

๐Ÿ”ง Drop-in alternative to plain DINOv2 features!
๐Ÿ“ฆ Code + pre-trained weights available now.
๐Ÿ”ฅ Try it in your next vision project!

26.06.2025 14:28 โ€” ๐Ÿ‘ 9    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Submission deadline is today!

26.06.2025 11:03 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Submission Deadline is extended by 6 days.

#ICCV2025 @iccv.bsky.social

23.06.2025 12:14 โ€” ๐Ÿ‘ 8    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@mattiasegu.bsky.social

26.06.2025 09:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

A heart congratulations to the freshly minted Dr. Mattia Segรน on successfully defending his PhD, Congratulazioni!!! ๐ŸŽ‰ ๐ŸŽ“.

His thesis is titled: Learning to Track: From Limited Supervision to Long-range Sequence Modeling

Checkout his web-page to learn more about his work: mattiasegu.github.io

26.06.2025 09:55 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Call for papers at the eXCV workshop at ICCV 2025.

Call for papers at the eXCV workshop at ICCV 2025.

Join us in taking stock of the state of the field of explainability in computer vision, at our Workshop on Explainable Computer Vision: Quo Vadis? at #ICCV2025!

@iccv.bsky.social

14.06.2025 15:47 โ€” ๐Ÿ‘ 13    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Video thumbnail

At #CVPR2025 and working on consistency in video and multi-view generative models?

Come and visit our poster on Friday afternoon, where I present ๐— ๐—˜๐˜๐Ÿฏ๐—ฅ: ๐— ๐—ฒ๐—ฎ๐˜€๐˜‚๐—ฟ๐—ถ๐—ป๐—ด ๐— ๐˜‚๐—น๐˜๐—ถ-๐—ฉ๐—ถ๐—ฒ๐˜„ ๐—–๐—ผ๐—ป๐˜€๐—ถ๐˜€๐˜๐—ฒ๐—ป๐—ฐ๐˜† ๐—ถ๐—ป ๐—š๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐˜๐—ฒ๐—ฑ ๐—œ๐—บ๐—ฎ๐—ด๐—ฒ๐˜€

@mohammadasim98.bsky.social @wimmerthomas.bsky.social @mpi-inf.mpg.de @cvml.mpi-inf.mpg.de

12.06.2025 22:38 โ€” ๐Ÿ‘ 17    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

7 / ๐Ÿงต ...
Workshop: Women in Computer Vision (WiCV)
๐Ÿ“ฑ @sukrutrao.bsky.social @SwetaMahajan @MoritzBoehle

11.06.2025 20:44 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

7/ ๐Ÿงต Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery
Authors: S. Rao, S. Mahajan, M. Bรถhle, B. Schiele
๐Ÿ” Explore sparse autoencoders to automatically extract and name concepts, enabling performance improvements on downstream tasks.
๐Ÿ“š arxiv.org/abs/2407.14499

11.06.2025 20:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

6 / ๐Ÿงต ...
Workshop: Women in Computer Vision (WiCV)
๐Ÿ“ฑ @tejaswinimedi.bsky.social @margretkeuper.bsky.social

11.06.2025 20:44 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

6/ ๐Ÿงต 3D-WAG: Wavelet-Guided Autoregressive Generation for 3D Shapes
Authors: T. Medi*, A. Rampini, P. Reddy, P. K. Jayaraman, M. Keuper
๐Ÿ” 3D-WAG introduces wavelet-guided autoregressive generation for 3D shapes, aiming for better geometry modeling.
๐Ÿ“š arxiv.org/abs/2411.19037

11.06.2025 20:44 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

5/ ๐Ÿงต ...
Workshop: Explainable AI for Computer Vision (XAI4CV)
๐Ÿ“ฑ @katharinaprasse.bsky.social @smarton.bsky.social @margretkeuper.bsky.social

11.06.2025 20:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

5/ ๐Ÿงต Data-Efficient Visual Concept Bottleneck Models
Authors: K. Prasse, P. Knab, S. Marton, C. Bartelt, M. Keuper
๐Ÿ” Introducing data-efficient visual concept bottleneck models for improved explainability in CV.
๐Ÿ“š arxiv.org/abs/2412.11576

11.06.2025 20:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

4/ ๐Ÿงต ...
Workshops: What is Next in Multimodal Foundation Models? | Women in Computer Vision
๐Ÿ“ฑ @maheensaleh.bsky.social @ninashv.bsky.social @annakukleva.bsky.social @hildekuehne.bsky.social

11.06.2025 20:44 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

4/ ๐Ÿงต HD-VILA-Caption: A Diverse Video-Text Dataset Derived from ASR Narrations
By: M. Saleh, N. Shvetsova, A. Kukleva, H. Kuehne, B. Schiele
๐Ÿ” HD-VILA-Caption is a large-scale, diverse video-text dataset with 10M high-quality captions, built from ASR subtitles for video-language pretraining.

11.06.2025 20:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@cvml.mpi-inf.mpg.de is following 20 prominent accounts