Sardine Lab's Avatar

Sardine Lab

@sardine-lab-it.bsky.social

SARDINE (Structure AwaRe moDelIng for Natural LanguagE) is a research group at Instituto de Telecomunicações and Instituto Superior Técnico, in Lisbon, Portugal, led by André Martins.

27 Followers  |  20 Following  |  3 Posts  |  Joined: 20.12.2024  |  1.3884

Latest posts by sardine-lab-it.bsky.social on Bluesky

Figure 1 of the paper shows that the metrics score the masculine forms higher than the feminine or neutral ones.

Figure 1 of the paper shows that the metrics score the masculine forms higher than the feminine or neutral ones.

As neural metrics are a pillar for #MT, being extensively used for evaluation but also improving translation, we'd want them to be fair.

🚨 Our #ACL2025 paper shows they consistently, unduly favor masculine-inflected translations, or gendered forms, over neutral ones.

arxiv.org/pdf/2410.10995

14.07.2025 14:00 — 👍 11    🔁 8    💬 1    📌 1
Preview
Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding Despite recent progress in vision-language models (VLMs), holistic understanding of long-form video content remains a significant challenge, partly due to limitations in current benchmarks. Many focus...

New paper from
Manos Zaranis, @tozefarinhas.bsky.social
and other sardines!!🚀

Meet MF²: Movie Facts & Fibs: a new benchmark for long-movie understanding

This benchmark focuses on narrative understanding (key events, emotional arcs, causal chains) in long movies.

Paper: arxiv.org/abs/2506.06275

24.06.2025 15:51 — 👍 0    🔁 0    💬 0    📌 0
LxMLS 2025 - The 15th Lisbon Machine Learning Summer School

Applications for the 2025 Lisbon Machine Learning Summer School (LxMLS) are open, with @andre-t-martins.bsky.social as one of the organizers.
LxMLS is a great opportunity to learn from top speakers and to interact with other students. You can apply for a scholarship.

Apply here:
lxmls.it.pt/2025/

28.02.2025 15:35 — 👍 2    🔁 1    💬 0    📌 0

📣 New paper alert! We released a new safety benchmark for VLMs with a core focus on test cases that become unsafe by combining text and images.

TL;DR: many modern VLMs are unsafe across various types of queries and languages.

arxiv.org/abs/2501.10057
huggingface.co/datasets/fel...

22.01.2025 13:43 — 👍 14    🔁 1    💬 0    📌 0
Preview
$\infty$-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Current video-language models struggle with long-video understanding due to limited context lengths and reliance on sparse frame subsampling, often leading to information loss. This paper introduces $...

🎉 New paper by Saul Santos in collaboration with @tozefarinhas.bsky.social and @andre-t-martins.bsky.social!! 🎉

∞-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation

Paper: arxiv.org/abs/2501.19098

03.02.2025 12:22 — 👍 8    🔁 2    💬 0    📌 0

@sardine-lab-it is following 20 prominent accounts