Vladimir Yugay's Avatar

Vladimir Yugay

@vyuga3d.bsky.social

Doing research in 3D Computer Vision. Ph.D. student at the University of Amsterdam. Previously at TUM. https://vladimiryugay.github.io/

543 Followers  |  54 Following  |  32 Posts  |  Joined: 18.11.2024  |  2.6204

Latest posts by vyuga3d.bsky.social on Bluesky

Will you release the slides?πŸ‘€ They're superb

24.06.2025 19:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I will be presenting our previous work at CVPR Nashville. Drop by if you want to chat!

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This work was conducted in collaboration wit Kersten Thies, @lucacarlone.bsky.social , Theo Gevers, @martin-r-oswald.bsky.social , and Lukas Schmid at the Computer Vision Group of the University of Amsterdam and the SPARKLab of @mit.edu

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We evaluate our method on synthetic and real-world datasets that undergo significant changes, including the movement, removal, and addition of large pieces of furniture, cutlery, a coffee machine, and pictures on the walls

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

GaME detects scene changes and directly manipulates the 3D Gaussians to keep the map up to date. Additionally, our keyframe management system identifies and eliminates pixels that observe stale geometry, thereby minimizing the amount of discarded information

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We found two main problems. First, the 3D Gaussian maps can not easily β€œoptimize out” changes in the geometry on the fly. Second, frames observing the old state of the scene contaminate the optimization process, resulting in visual artifacts and inconsistencies

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Imagine you want ot create a 3DGS map of your apartment. You reconstructed your kitchen and continued to the bedroom. While you are in the bedroom, someone has moved the chair and added a table in the kitchen without telling you. That’s what can happen with your reconstructionπŸ‘‡

10.06.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Introducing β€œGaussian Mapping of Evolving Scenes”! We present an RGBD mapping system with novel view synthesis capabilities that accurately reconstruct scenes that change over time
vladimiryugay.github.io/game/

10.06.2025 12:06 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

Resubmission mentality in marathons

Munich 2023 -> 8 months prep -> COVID -> ❌

Amsterdam 2024 -> 6 months prep -> COVID -> ❌

Leiden 2025 -> 6 months prep -> lfg βœ…

11.05.2025 14:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”Ή@rerun.io visualisation script for easy debugging, analysis, and replaying of reconstruction results with minimal effort

19.03.2025 18:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Live coding Graph SLAM in Python (Part 1)
YouTube video by Jeff Irion Live coding Graph SLAM in Python (Part 1)

πŸ”ΉFully Pythonic pose graph optimisation module. The core library live coding by the author is tremendously enlightening www.youtube.com/watch?v=yXWk...

19.03.2025 18:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ”ΉPlace recognition module based on a large vision model - no more annoying dependency chains for DBoVW or NetVLAD

19.03.2025 18:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ”ΉSimple yet efficient mechanism for correcting and merging multiple 3D Gaussian Splatting maps into a global map

19.03.2025 18:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

⏩Code release for MAGiC-SLAM!
github.com/VladimirYuga...

We vibe-coded hard to make the code as simple as possible. Here are some features you can seamlessly integrate into your 3D reconstruction pipeline right away:

19.03.2025 18:47 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

πŸ”ΉDinoV2-based place recognition module - no more annoying dependency chains of DBoVW or NetVLAD

19.03.2025 18:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ”ΉA simple yet efficient mechanism for correcting and merging multiple 3D Gaussian Splatting sub-maps into a global map

19.03.2025 18:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Fantastic work! Can't wait to try it out!

26.02.2025 08:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It feels like a tighter bubble on bsky. It also seems that the more people are aligned, the less they engage

08.02.2025 15:19 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
MonST3R: A Simple Approach for Estimating Geometry in the Presense of Motion

Ye ye. Or monst3r-project.github.io. One can use them as a prior for dynamic envs just like mast3r for static ones

18.12.2024 10:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

There's so much progress in there partially bc *3r and splats are inexpensive. GPU poor can iterate fast :)

18.12.2024 10:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Probably more methods for dynamic environments. Smth monst3r-like

18.12.2024 10:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Last year splats, this year *3r

18.12.2024 10:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

This work was done with amazing collaborators Theo Gevers and @martin-r-oswald.bsky.social at the Computer Vision Group of the University of Amsterdam.
7/7

27.11.2024 05:34 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Finally, we extend evaluation to novel view synthesis on real-world datasets. By extracting sequences from the ego-centric Aria dataset to simulate multi-agent operations, we prepared a hold-out test with novel view trajectories, ensuring a comprehensive evaluation of our system's capabilities.
6/7

27.11.2024 05:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Our sub-maps inherently support local pose corrections provided by the loop closure module. Combined with an efficient caching scheme and a two-stage merging process, this allows for fast and precise global map reconstruction.
5/7

27.11.2024 05:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Inevitably, agents’ trajectories drift over the run. We tackle this by integrating a loop closure mechanism into our SLAM system. Additionally, we experiment with foundational vision model features for loop detection, with promising results in our benchmarks.
4/7

27.11.2024 05:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Scaling SLAM systems requires a careful balance between computational resources and speed. In our approach, agents manage their local maps independently communicating with a centralized server. We achieve significant performance gains by using 3DGS sub-maps with efficient tracking and caching
3/7

27.11.2024 05:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

With the rise of AR/VR and an ever-growing number of gadgets, NVS-SLAM systems must scale up while achieving greater accuracy. A natural approach is to have multiple agents collaborate - proving that "the whole is greater than the sum of its parts."
But what challenges still stand in the way?
2/7

27.11.2024 05:34 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Introducing β€œMAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM”! We do SLAM with novel view synthesis capabilities on multiple simultaneously operating agents!

vladimiryugay.github.io/magic_slam/i...
1/7

27.11.2024 05:34 β€” πŸ‘ 51    πŸ” 17    πŸ’¬ 3    πŸ“Œ 1

Hey there! I'm working on 3d vision, can you please add me?

23.11.2024 22:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@vyuga3d is following 20 prominent accounts