๐ Can Vision-Language Models plan effectively?
We introduce ViPlan, a benchmark comparing:
๐น VLM-as-planner
๐น VLM-as-grounder
๐ Home robotics
๐งฑ Visual Blocksworld
Spoiler alert:โ Visual-Reasoning
๐ Pre-print here: arxiv.org/abs/2505.13180
#VLM #AI #NLProc
21.05.2025 08:44 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Evalita Llm Leaderboard - a Hugging Face Space by evalitahf
Duplicate this leaderboard to initialize your own!
๐ Exciting News! The Evalita-LLM Leaderboard is now live on Hugging Face! Explore the performance of over 40 Large Language Models on native Italian tasks. Dive in here: huggingface.co/spaces/evali...
@FondazioneBrunoKessler
@igenius
@diunito.bsky.social
#NLProc #LLM #AI #Italian #Benchmarking
08.04.2025 11:08 โ ๐ 0 ๐ 2 ๐ฌ 0 ๐ 0
Don't miss out โผ๏ธ Join us at Wired Health 2025 #WH25๐ญ Our Head Unit, Bernardo Magnini will take the stage to discuss "Artificial Intelligence and Clinical Data: The Future of Emergency Medicine." ๐ฅ
Full program here: lnkd.in/eKi44biT
@wired.com
#AI #healthcare #Innovation
13.03.2025 10:29 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Collab with Cimec (UniTrento)- @raffagbernardi.bsky.social , ItalianNLP_Lab - @alessiomiaschi.bsky.social and UniPisa (Alessandro Lenci, Lucia Passaro and Alessandro Bondielli)
27.02.2025 11:28 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 1
All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark
We introduce MAIA (Multimodal AI Assessment), a native-Italian benchmark designed for fine-grained investigation of the reasoning abilities of visual language models on videos. MAIA differs from other...
Welcome to MAIA! ๐ Our new benchmark for evaluating multimodal reasoning in Vision-LMs on videos fully in Italian! ๐ฎ๐น MAIA tests understanding & generation with fine-grained reasoning categories and a brand-new evaluation metric! ๐๐ฅDiscover MAIA here: arxiv.org/abs/2502.16989
#NLProc #evaluation #AI
27.02.2025 09:34 โ ๐ 6 ๐ 2 ๐ฌ 1 ๐ 1
Thank you for supporting this project! ๐ช๐ป
25.02.2025 15:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Evalita-LLM: Benchmarking Large Language Models on Italian
We describe Evalita-LLM, a new benchmark designed to evaluate Large Language Models (LLMs) on Italian tasks. The distinguishing and innovative features of Evalita-LLM are the following: (i) all tasks ...
๐ **Exciting News!** ๐ Evalita-LLM is here! ๐ฎ๐น A new benchmark for evaluating LLMsโoffering native Italian tasks, generative challenges, and fair multi-prompt evaluations. Now also available in lm-evaluation harness by @eleutherai.bsky.social !
ArXiv: arxiv.org/abs/2502.02289
#NLProc #LLM #Evaluation
24.02.2025 17:07 โ ๐ 9 ๐ 3 ๐ฌ 0 ๐ 1
Our group leader took the stage at the FBK plenary session to showcase our research interests, ongoing projects, challenges and future plans. An exciting moment to share our vision and push the boundaries of NLP even further!
Hereโs a glimpse of the event! ๐ธโจ
#NLProc #AI #Research #FBK #Innovation
07.02.2025 14:35 โ ๐ 2 ๐ 1 ๐ฌ 0 ๐ 0
A big congratulations to Sofia Lugli, student of our Carlo Strapparava, for receiving a special mention for her thesis at
CLiC-it conference 2 weeks ago! ๐๐
We are incredibly proud of her achievement. Canโt wait to see what she accomplishes next. Well done Sofia!๐
#NLP #AI @ailc-nlp.bsky.social
20.12.2024 09:44 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 0
Data-LLM-Tutorial
You Are what You Eat Processing Data for Training and Evaluating LLMs Giovanni Bonetta and Bernardo Magnini Fondazione Bruno Kessler, Trento, Italy {gbonetta|magnini}@fbk.eu Tutorial at CLiC-it 2024, ...
Exciting news! ๐ If youโre curious about the opening #tutorial at CLiC-it 2024 conference made by our group on processing #data for #training and #evaluating #LLMs , hereโs your chance! ๐ Explore the slides and get inspired!!! ๐๐โจ
docs.google.com/presentation...
@ailc-nlp.bsky.social
17.12.2024 16:15 โ ๐ 3 ๐ 2 ๐ฌ 0 ๐ 1
too sweet darling!!! โค๏ธ
17.12.2024 12:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Here are our group leader Bernardo Magnini and postdoc
Giovanni Bonetta kicking off CLiC-it conference in #Pisa 2 weeks ago as invited speakers with an insightful #tutorial on #data: how to collect and use it for various purposes in #AI and #NLP !๐ก๐
Thanks to @ailc-nlp.bsky.social for inviting us!โจ
17.12.2024 10:37 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
#NLP Research group in action! Our group leader Bernardo Magnini, alongside @tizaino.bsky.social, Sofia Brenna, and Giovanni Bonetta, presenting their #paper "Are you a Good Assistant? Assessing LLM Trustability in Task-oriented Dialogues" at CLiC-it '24 #Pisa ๐โจ
@ailc-nlp.bsky.social
#NLProc #AI
16.12.2024 09:05 โ ๐ 7 ๐ 2 ๐ฌ 0 ๐ 1
๐โจ Happy Holidays from our Research Center! โจ๐
Our director @ferruccioresta shares a warm holiday message with all @FBK_research: gratitude for a year of outstanding achievements and best wishes for a joyful festive season and a 2025 filled with groundbreaking discoveries!
11.12.2024 23:25 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0
Exploring how new words convey novel meanings in ERC Consolidator project #BraveNewWord๐ง Unveiling language and cognition insights๐Join our research journey!
https://bravenewword.unimib.it/
#CoNLL2025 (co-located with ACL 2025)
conll.org/2025
July 31 & August 1, 2025
Una universitat compromesa a donar resposta als reptes globals i a desenvolupar talent en un entorn culturalment estimulant.
Gemma Boleda, Marco Baroni, Thomas Brochhagen, Iria de Dios Flores | Computational Linguistics and Linguistic Theory Universitat Pompeu Fabra.
upf.edu/web/colt
Barcelona
The MCML is a joint ยญresearch initiative of LMU Mรผnchen and TU Mรผnchen. It is institutionally funded by the Federal Ministry of Education and Research and the Free State of Bavaria.
IT-Universitetet i Kรธbenhavn er Danmarks fรธrende universitet med fokus pรฅ den digitale verden.
Center for Information and Language Processing (CIS): NLP research group at LMU Munich led by Hinrich Schuetze and @barbaraplank.bsky.social
MaiNLP research lab at CIS, LMU Munich directed by Barbara Plank @barbaraplank.bsky.social
Natural Language Processing | Artificial Intelligence | Computational Linguistics | Human-centric NLP
Prof, Chair for AI & Computational Linguistics,
Head of MaiNLP lab @mainlp.bsky.social, LMU Munich
Co-director CIS @cislmu.bsky.social
Visiting Prof ITU Copenhagen @itu.dk
ELLIS Fellow @ellis.eu
Vice-President ACL
PI MCML @munichcenterml.bsky.social
ELLIS PhD student at University of Valencia and MPI-BGC. ๐ช๐ธ๐ช๐บ๐ฉ๐ช
Passionate language learner and strong supporter of brillant coffee. ๐ฆโ
ELLIS PhD Student in Machine Learning at DTU Copenhagen and Helmholtz AI Munich (formerly Tรผbingen AI Center, Google AR).
Interested in generative modeling, computer vision, and more :)
NLP ELLIS PhD student at University of Copenhagen & Pioneer centre for AI. @belongielab.org
PhD candidate @amlab.bsky.social @ellis.eu
Probabilistic Machine Learning | Sequence Models
PhD student in NLP at Cambridge | ELLIS PhD student
https://lucasresck.github.io/
โขPhD student @ https://www.ucl.ac.uk/gatsby ๐ง ๐ป
โขMasters Theoretical Physics UoM|UCLA๐ช
โขIntern @zuckermanbrain.bsky.social|
@SapienzaRoma | @CERN | @EPFL
https://linktr.ee/Clementine_Domine
(machine) learning @ MPI BGC
โจ https://vitusbenson.github.io/
Assistant Professor for 3D Computer Vision at University of Amsterdam.
3D Human-centric Perception & Synthesis: bodies, hands, objects.
Past: MPI for Intelligent Systems, Univ. of Bonn, Aristotle Univ. of Thessaloniki
Website: https://dtzionas.com
ELLIS PhD student in Computational Biology @Theis lab (Helmholtz Munich)
Secular Bayesian.
Professor of Machine Learning at Cambridge Computer Lab
Talent aficionado at http://airetreat.org
Alum of Twitter, Magic Pony and Balderton Capital