We have released the Stereo4D dataset! Explore the real-world dynamic 3D tracks: github.com/Stereo4d/ste...
15.04.2025 19:59 β π 13 π 3 π¬ 0 π 0
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos
Use stereo videos from the internet to create a dataset of over 100,000 real-world 4D scenes with metric scale and long-term 3D motion trajectories.
See more scenes & details of how it works on our website: stereo4d.github.io
Paper link: arxiv.org/abs/2412.09621
Thanks to the great team! Richard Tucker, @zhengqili.bsky.social , David Fouhey @snavely.bsky.social , @holynski.bsky.social
Please stay tuned for updates on data & code.
13.12.2024 03:13 β π 3 π 1 π¬ 0 π 0
This type of data is ideal for learning the structure and dynamics of the real world.
We gave this a shot β by extending DUSt3R to model 3D motion, and training on our dataset. Given a pair of frames, our model predicts a 3D point cloud, and corresponding 3D motion trajectories.
13.12.2024 03:13 β π 4 π 0 π¬ 1 π 0
Introducing πStereo4Dπ
A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories.
We used Stereo4D to make a dataset of over 100k real-world 4D scenes.
13.12.2024 03:13 β π 59 π 12 π¬ 2 π 3
A fast and accurate method to get camera poses, focal length, and consistent depth map from dynamic casual videos. Checkout this amazing work led by @zhengqili.bsky.social
06.12.2024 18:50 β π 2 π 0 π¬ 0 π 0
Assistant Professor at UC Berkeley
Researcher in generative 3D AI @ Google. PhD from UCL.
CS PhD student at @Cornell
Computer vision, graphics and Machine Learning
gemmechu.github.io
PhD Student at Princeton University https://araistrick.com
Computer Vision with Procedural Graphics Data infinigen.org
Research Scientist in Computer Vision and Generative AI
PhD student at Cornell, interested in 3D generation, reconstruction; prev Princeton '22
https://genechou.com
Student Researcher @ RAI Institute, MSc CS Student @ ETH Zurich
visual computing, 3D vision, spatial AI, machine learning, robot perception.
πZurich, Switzerland
PhD student at Dyson Robotics Lab, Imperial College London
http://edexheim.github.io
PhD Student at Cornell CS
https://www.cs.cornell.edu/~ruojin/
Staff Research Scientist at Google - http://sniklaus.com/
Niantic Spatial, Research.
Throws machine learning at traditional computer vision pipelines to see what sticks. Differentiates the non-differentiable.
πEurope π http://ebrach.github.io
CS PhD Student @ NYU doing 3D computer vision
https://jot-jt.github.io/
Incoming Assistant Professor at Johns Hopkins University | RAP at Toyota Technological Institute at Chicago | web: https://anandbhattad.github.io/ | Knowledge in Generative Image Models, Intrinsic Images, Image-based Relighting, Inverse Graphics
UC Berkeley + Google DeepMind
holynski.org
3D vision fanatic
http://snavely.io
official Bluesky account (check usernameπ)
Bugs, feature requests, feedback: support@bsky.app