Congrats! Looks like time is a big failure case for these models (cc @neuralnoise.com @aryopg.bsky.social @rohit-saxena.bsky.social )
bsky.app/profile/emil...
17.05.2025 07:07 β π 3 π 2 π¬ 1 π 0
Work done with @neuralnoise.com Frank Keller
10.03.2025 14:19 β π 1 π 0 π¬ 0 π 0
We tested state-of-the-art multimodal LLMs on this challenging taskβand they struggled! π€π
We also propose a new method:
π₯SEGMENT & SUMMARIZE, a training-free approach that outperforms existing models by:
πΉ Segmenting the poster into logical regions
πΉ Performing local & global summarization
10.03.2025 14:19 β π 0 π 0 π¬ 1 π 0
π PosterSum features 16,305 poster-abstract pairs from major ML conferences.
Task: Summarize a research poster image into a concise abstract summary.
10.03.2025 14:19 β π 1 π 0 π¬ 1 π 0
Can multimodal LLMs truly understand research poster images?π
π We introduce PosterSumβa new multimodal benchmark for scientific poster summarization!
π Dataset: huggingface.co/datasets/rohitsaxena/PosterSum
π Paper: arxiv.org/abs/2502.17540
10.03.2025 14:19 β π 8 π 4 π¬ 1 π 0
πββοΈ
20.11.2024 17:19 β π 1 π 0 π¬ 0 π 0
I'd love to be added!
Thanks
20.11.2024 12:15 β π 1 π 0 π¬ 1 π 0
Would love to be added!
20.11.2024 12:08 β π 0 π 0 π¬ 0 π 0
Hello, can you please add me? Thanks
20.11.2024 11:59 β π 1 π 0 π¬ 0 π 0
I'd love to be added!
Thanks
20.11.2024 11:48 β π 0 π 0 π¬ 1 π 0
PhD student @ EdinburghNLP | undergrad+masters @ Georgia Tech
the youngest boomer β’ language models β© knowledge graphs β’ phd cand @UtrechtUniversityβ’ msc in artificial intelligence @KULeuven
https://duyguislakoglu.github.io
Assistant Professor @Mila-Quebec.bsky.social
Co-Director @McGill-NLP.bsky.social
Researcher @ServiceNow.bsky.social
Alumni: @StanfordNLP.bsky.social, EdinburghNLP
Natural Language Processor #NLProc
Professor of Computer Science at UT Austin and Visiting Researcher at Google Deepmind, London. Automated Reasoning + Machine Learning + Formal Methods. https://www.cs.utexas.edu/~swarat
I lead Cohere For AI. Formerly Research
Google Brain. ML Efficiency, LLMs,
@trustworthy_ml.
Assistant Prof @sbucompsc @stonybrooku.
Researcher β @SFResearch
Ph.D. β @ColumbiaCompSci
Human Centered AI / Future of Work / AI & Creativity
Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare
Research Intern @Adobe | PhD at @ApgAsu @ASU | Vision & Language | T2I Diffusion Modeling
maitreyapatel.com
Professor at UW; Researcher at Meta. LMs, NLP, ML. PNW life.
CS PhD @UMassAmherst | Working on Robustness, NLP & Healthcare | Prev. @mckinsey @ShivNadarUniv | Side Quest: Dj & Deadlift | Opinions: Personal
Open-Source Interpretability Toolkit for Generative Language Models π π
https://github.com/inseq-team/inseq
Geometry processor, discrete differentiator of geometry, mesher, directional fielder, finite elementor, and reconstructor. Reader (Associate professor) @ School of Informatics @ University of Edinburgh
human being | assoc prof in #ML #AI #Edinburgh | PI of #APRIL | #reliable #probabilistic #models #tractable #generative #neuro #symbolic | heretical empiricist | he/him
π https://april-tools.github.io
Generative AI @Noah's Ark Lab, Huawei & @TuringInstitute | PhD candidate in Biomedical AI @ University of Edinburgh | Efficient Fine-Tuning in Medical AI, Diffusion Models, Autoregressive Image Generation
Assistant professor in Natural Language Processing at the University of Edinburgh and visiting professor at NVIDIA | A Kleene star shines on the hour of our meeting.
The Conference of the European Chapter of the Association for Computational Linguistics
Next event: Rabat, Morocco, March 24-29, 2026
Hashtags: #EACL2026 #NLProc
Master student at ENS Paris-Saclay / aspiring AI safety researcher / improviser
Prev research intern @ EPFL w/ wendlerc.bsky.social and Robert West
MATS Winter 7.0 Scholar w/ neelnanda.bsky.social
https://butanium.github.io
Interpretable Deep Networks. http://baulab.info/ @davidbau
https://mega002.github.io
AI Safety Research // Software Engineering