Visual-spatial intelligenceβwe rely on it to perceive, interact, and navigate our everyday spaces. To what capacity do MLLMs possess it? Do they mirror how humans think and reason about space?
Presenting βThinking in Space: How Multimodal Models See, Remember, and Recall Spacesβ! [1/n]
23.12.2024 22:45 β π 10 π 4 π¬ 7 π 0
x.com
TL;DR π§΅
x.com/_ellisbrown/...
10.12.2024 17:31 β π 0 π 0 π¬ 1 π 0
Heading to #NeurIPS2024 to present Cambrian-1! Catch our oral presentation Friday @ 10am (Oral 5C) and our poster afterwards until 2pm (#3700 in East Hall A-C) πͺΌπ
10.12.2024 17:31 β π 3 π 1 π¬ 1 π 0