Organized by: Junyu Xie, Ridouane Ghermi, @tengda.bsky.social, Max Bain, Arsha Nagrani, @vickykalogeiton.bsky.social, @gulvarol.bsky.social, Weidi Xie, Ivan Laptev and Andrew Zisserman.
See you in Hawaii! πΊ
@tengda.bsky.social
Researcher at Google DeepMind. Computer vision and machine learning.
Organized by: Junyu Xie, Ridouane Ghermi, @tengda.bsky.social, Max Bain, Arsha Nagrani, @vickykalogeiton.bsky.social, @gulvarol.bsky.social, Weidi Xie, Ivan Laptev and Andrew Zisserman.
See you in Hawaii! πΊ
As a part of the workshop, we have a MovieQA competition based on the SF20K dataset and hosted on HuggingFace @hf.co
Main Track: huggingface.co/spaces/SLoMO...
Plus, we have a special track for small models (< 8B)! huggingface.co/spaces/SLoMO...
Weβre excited to have a fantastic lineup of speakers:
@amypavel.bsky.social, Anna Rohrbach, Mike Zheng Shou, Makarand Tapaswi. Weβll also host a panel discussion with the organizers!
Movies are more than just video clips, they are stories! π¬
Weβre hosting the 1st SLoMO Workshop at #ICCV2025 to discuss Story-Level Movie Understanding & Audio Descriptions!
Website: slomo-workshop.github.io
Competition: huggingface.co/spaces/SLoMO...
Thank @dimadamen.bsky.social for presenting our Orthogonal Optimizer! Itβs a simple modification on standard optimizers for streaming video learning. We have code available at sites.google.com/view/orthogo...
14.06.2025 20:10 β π 5 π 1 π¬ 0 π 0Check out our CVPR 2025 paper: arxiv.org/abs/2504.01961. Work with Dilara Gokay, Joseph Heyward, Chuhan Zhang, Daniel Zoran, Viorica PΔtrΔucean, JoΓ£o Carreira, Dima Damen and Andrew Zisserman, from Google DeepMind
09.04.2025 14:20 β π 2 π 0 π¬ 0 π 1Humans learn from one continuous visual stream, but large video models have to be trained on billions of web videos.
We found that learning from such sequential streams is challenging for video modelsβand we introduce a family of "orthogonal optimizers" to bridge the gap!
It's interesting to see that visual counting remains to be quite challenging for generalist AI models. But this specialist model counts very well. Nice work from @nikigoliai.bsky.social last year!
17.03.2025 17:01 β π 1 π 0 π¬ 0 π 0We are looking for a student researcher to work on video understanding plus 3D, in Google DeepMind London. DM/Email me or pass it to someone if you feel it may be a good fit!
05.03.2025 20:43 β π 20 π 6 π¬ 0 π 0How do you know he is not π€π
25.01.2025 14:14 β π 0 π 0 π¬ 1 π 0From an award candidate... to best paper #ACCV2024
Glad to share that "It's Just Another Day" received the top award at the conference.
@bristoluni.bsky.social @ox.ac.uk
This paper is worth reading :-) based on the reviewers, AC and awards committee. We thank them for their time and effort.