Antonia Wüst's Avatar

Antonia Wüst

@toniwuest.bsky.social

PhD student at AIML Lab TU Darmstadt Interested in concept learning, neuro-symbolic AI and program synthesis

44 Followers  |  45 Following  |  6 Posts  |  Joined: 18.02.2025  |  1.3139

Latest posts by toniwuest.bsky.social on Bluesky

Post image

Can concept-based models handle complex, object-rich images? We think so! Meet Object-Centric Concept Bottlenecks (OCB) — adding object-awareness to interpretable AI. Led by David Steinmann w/ @toniwuest.bsky.social & @kerstingaiml.bsky.social .
📄 arxiv.org/abs/2505.244...
#AI #XAI #NeSy #CBM #ML

07.07.2025 15:55 — 👍 10    🔁 4    💬 0    📌 0

I'll be at #ICML2025 next week presenting our recent work on VLMs and Bongard Problems! Feel free to reach out, happy to have a chat ☺️

12.07.2025 12:17 — 👍 3    🔁 0    💬 0    📌 0

Work together with my amazing co-authors @philosotim.bsky.social
Lukas Helff @ingaibs.bsky.social @wolfstammer.bsky.social @devendradhami.bsky.social @c-rothkopf.bsky.social @kerstingaiml.bsky.social ! ✨

02.05.2025 08:00 — 👍 4    🔁 1    💬 0    📌 0
Post image

We also identified 10 particularly challenging Bongard Problems that none of the models could solve under any setting. The challenge remains wide open!
3 examples of the challenging BPs:

02.05.2025 07:57 — 👍 2    🔁 1    💬 1    📌 1
Post image

Interestingly, success in solving the BPs (Open Question) doesn't translate to correctly categorizing individual images 👉 the sets of BPs solved in each task are not the same!
This suggests that getting the right final answer doesn’t always mean genuine understanding 🤔

02.05.2025 07:55 — 👍 1    🔁 1    💬 1    📌 0
Post image

Our evaluation shows the top-performing model (o1) solved 43 out of 100 problems, with the others trailing far behind. There’s still a long way to go for current AI models!

02.05.2025 07:53 — 👍 0    🔁 1    💬 1    📌 0
Post image

Excited to share that our paper got accepted at #ICML2025!! 🎉

We challenge Vision-Language Models like OpenAI’s o1 with Bongard problems, classic visual reasoning challenges and uncover surprising shortcomings.

Check out the paper: arxiv.org/abs/2410.19546
& read more below 👇

02.05.2025 07:47 — 👍 24    🔁 10    💬 1    📌 1

@toniwuest is following 20 prominent accounts