Vincent Tao Hu's Avatar

Vincent Tao Hu

@vtaohu.bsky.social

LMU postdoc from Ommer-Lab, MCML junior member. UvA PhD, PKU

936 Followers  |  173 Following  |  3 Posts  |  Joined: 19.11.2024  |  1.6373

Latest posts by vtaohu.bsky.social on Bluesky

Post image

Our work received an invited talk at the Imageomics-AAAI-25 workshop of #AAAI25. @vtaohu.bsky.social will be representing us there. Without me being there, I still would like to share our poster with you :D

We also have another oral presentation for DepthFM on March 1, 2:30 pm-3:45 pm.

28.02.2025 17:03 — 👍 3    🔁 1    💬 0    📌 0
Post image

typos

01.03.2025 00:42 — 👍 1    🔁 0    💬 1    📌 0
Our method pipeline

Our method pipeline

🤔When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks?

🤨Interested? Check out our latest work at #AAAI25:

💻Code and 📝Paper at: github.com/CompVis/DisCLIP

🧵👇

08.01.2025 15:54 — 👍 15    🔁 8    💬 1    📌 0

Did you know you can distill the capabilities of a large diffusion model into a small ViT? ⚗️
We showed exactly that for a fundamental task:
semantic correspondence📍

A thread 🧵👇

06.12.2024 14:35 — 👍 4    🔁 2    💬 1    📌 2

Your Diffusion Model is secretly an implicit timestep model, no matter discrete or continuous~

04.12.2024 23:42 — 👍 6    🔁 0    💬 0    📌 0

👍

27.11.2024 11:11 — 👍 1    🔁 0    💬 0    📌 0
Video thumbnail

Introducing “MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM”! We do SLAM with novel view synthesis capabilities on multiple simultaneously operating agents!

vladimiryugay.github.io/magic_slam/i...
1/7

27.11.2024 05:34 — 👍 51    🔁 17    💬 3    📌 1

@vtaohu is following 20 prominent accounts