bskyView

Joseph Tung

@jtung.bsky.social

CS PhD Student @ NYU doing 3D computer vision https://jot-jt.github.io/

28 Followers | 72 Following | 1 Posts | Joined: 23.11.2024

Posts Following

Posts by Joseph Tung (@jtung.bsky.social)

Thanks Zhenjun for sharing!

25.04.2025 12:21 — 👍 1 🔁 0 💬 1 📌 0

Late to post, but excited to introduce CUT3R!

An online 3D reasoning framework for many 3D tasks directly from just RGB. For static or dynamic scenes. Video or image collections, all in one!

Project Page: cut3r.github.io
Code and Model: github.com/CUT3R/CUT3R

18.02.2025 17:03 — 👍 34 🔁 6 💬 2 📌 1

🤔Can Generative Video Models Help Pose Estimation?
✅Yes!
We find that generative video models can hallucinate plausible intermediate frames that provide useful context for pose estimators (e.g. DUSt3R), especially for images with little to no overlap.
🔗 inter-pose.github.io

23.12.2024 17:44 — 👍 15 🔁 4 💬 1 📌 1

Introducing 👀Stereo4D👀

A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories.

We used Stereo4D to make a dataset of over 100k real-world 4D scenes.

13.12.2024 03:13 — 👍 59 🔁 12 💬 2 📌 3

Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features

Yuanbo Xiangli, Ruojin Cai, Hanyu Chen, Jeffrey Byrne,
@snavely.bsky.social

tl;dr: new dataset (55K pairs) + Mast3r == PROFIT
arxiv.org/abs/2412.05826

10.12.2024 10:19 — 👍 17 🔁 5 💬 1 📌 0