Imanol Miranda @imirandam - Bluesky Profile

Imanol Miranda

@imirandam.bsky.social

PhD student at HiTZ Zentroa (@hitz-zentroa.bsky.social) / IXA Group and the University of Basque Country (@upvehu.bsky.social).

14 Followers | 39 Following | 6 Posts | Joined: 20.03.2025 | 1.6033

Latest posts by imirandam.bsky.social on Bluesky

Key takeaway: Adding simple structure at inference-time, through image crops and text segments, is a powerful, training-free way to improve Vision-Language Compositionality performance.

Joint work with @Ander Salaberria @eagirre.bsky.social @gazkune.bsky.social @hitz-zentroa.bsky.social

18.06.2025 11:28 — 👍 2 🔁 1 💬 0 📌 0

Our analysis shows that:
1. There is room to improve the quality of extracted text segments.
2. Our method achieves significant performance gains in Winoground's non-trivial instances.
3. Isolated image crops can lose size and quantity information, leaving room for improvement.

18.06.2025 11:28 — 👍 2 🔁 1 💬 1 📌 0

Why are image crops crucial? 🤔 We found that simply adding text segments isn't enough. The biggest performance gains come when text segments are paired with image crops, proving the power of serial image computing.

18.06.2025 11:28 — 👍 1 🔁 1 💬 1 📌 0

We've evaluated it across three diverse datasets: BiVLC, Winoground (171 instances), and BiSCoR-Ctrl. See the significant improvements by inference-time approach (ITA) on three existing models:

18.06.2025 11:28 — 👍 1 🔁 0 💬 1 📌 0

Our approach is straightforward yet effective:
1. Divide the image into smaller crops.
2. Extract text segments capturing objects, attributes and relations.
3. Use the VLM to find image crops that best fit the text segments.
4. Aggregate matching similarities for the final score.

18.06.2025 11:28 — 👍 2 🔁 1 💬 1 📌 0

#newHitzPaper
Can a simple inference-time approach unlock better Vision-Language Compositionality?🤯
Our latest paper shows how adding structure at inference significantly boosts performance in popular dual-encoder VLMs on different datasets.

Read more: arxiv.org/abs/2506.09691

18.06.2025 11:28 — 👍 6 🔁 3 💬 1 📌 1

@imirandam is following 20 prominent accounts

Danny Ki
@dannykii

Co-founder @ That’s Gonna Help | Growth & Automation Strategist | Web3, FinTech | AI Agents & Data Engineering 🧠 Rare longreads: https://substack.com/@dannyki

Borja Calvo
@b0rx

Lecturer at the Informatics Faculty, University of the Basque Country. Research in machine learning and optimization. Interest in the social impact of AI (and photography, cinematography, computer graphics, ...)

Gorka Urbizu Garmendia
@gorkaurbizu

Izal, Euskal Herria. (Basque Country) Researcher at orainlp.bsky.social -ko ikertzailea (PhD) #NLP: pretraining LMs & low-resource & tokenization 🇵🇲 🇪🇸 🏴󠁧󠁢󠁥󠁮󠁧󠁿 🔜 🇫🇷 🇳🇴 toka/el/he 🍉

Eneko Agirre
@eagirre

Prof. https://ehu.eus Head of HiTZ research center @hitz-zentroa.bsky.social, Spanish Informatics Research Prize 2021. ACL fellow @aclmeeting.bsky.social. Elected member https://jakiunde.eus I post in Basque, Spanish and English https://hitz.eus/eneko

Naiara Perez
@apinhole

#NLP researcher @hitz-zentroa.bsky.social

Oscar Sainz
@osainz

Postdoctoral Researcher at the University of the Basque Country (UPV/EHU).

Arturo
@arturocomputes

NLP PhD student at @naist-nlp.bsky.social

NAIST NLP
@naist-nlp

NLP lab at NAIST in Nara, Japan 🦌 nlp.naist.jp/en/

Lucas Beyer (bl16)
@giffmana.ai

Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://admonymous.co/giffmana 📍 Zürich, Suisse 🔗 http://lucasb.eyer.be

AI Conference DL Countdown
@dlcountdown

Bot. I daily tweet progress towards machine learning and computer vision conference deadlines. Maintained by @chriswolfvision.bsky.social

Hugging Face
@hf.co

The AI community building the future!

ACL
@aclmeeting

The Association for Computational Linguistics (ACL) is a scientific and professional organization for people working on Natural Language Processing/Computational Linguistics. Hash tags: #NLProc #ACL2025NLP

AACL
@aaclmeeting

The Asia-Pacific Chapter of the Association for Computational Linguistics The 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (AACL 2025) https://www.afnlp.org/conferences/ijcnlp2025 #AACL2025 #NLProc #NLP

NAACL
@naaclmeeting

Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics

EMNLP
@emnlpmeeting

EMNLP 2025 - The annual Conference on Empirical Methods in Natural Language Processing Dates: November 5-9, 2025 in Suzhou, China Hashtags: #EMNLP2025 #NLP Submission Deadline: May 19th, 2025

Yoshua Bengio
@yoshuabengio

Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila. A.M. Turing Award Recipient and most-cited AI researcher. https://lawzero.org/en https://yoshuabengio.org/profile/

🇺🇦 Alex Polozov
@alexpolozov.com

Sr. Staff Research Scientist @ Google DeepMind • previously Google X, Microsoft Research, UW • program synthesis, AI for Code and SWE • he/him • alexpolozov.com

Ai2
@ai2

Breakthrough AI to solve the world's biggest problems. › Join us: http://allenai.org/careers › Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm

Dr. Fei-Fei Li
@drfeifei

Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare

Katedra
@katedra.eus

🔹EHUko Kultura Zientifikoko Katedra. Zientzia zabaltzen. 🔹Cátedra de Cultura Científica de la EHU. Difundiendo ciencia. 🔗 https://katedra.eus Blogak/blogs: @culturacientifica.com @mappingignorance.org @zientziakaiera.eus @mujeresconciencia.com