Slava Elizarov's Avatar

Slava Elizarov

@selizarov.bsky.social

Senior research scientist at Unity | Generative models, Computer Graphics

262 Followers  |  857 Following  |  15 Posts  |  Joined: 12.02.2024
Posts Following

Posts by Slava Elizarov (@selizarov.bsky.social)

Thrilled to share that Geometry Image Diffusion has been accepted to #ICLR2025! πŸš€

Paper: openreview.net/forum?id=Glm...

23.01.2025 21:14 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Object Images 64x An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion

P.P.S. We recommend you check out Omages (omages.github.io), an awesome concurrent work that also explores geometry images (called "Omages") for 3D generation. We believe GIMs have a bright future in deep learning β€” let’s bring it forward together πŸš€

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

P.S.
Thanks to Simon DonnΓ©, @ciararowles.bsky.social, Shimon Vainer, Dante De Nigris, and Konstantin Kutsy for being such an awesome team!
Additional thanks to Alexander Demidko and Dr. Lev Melnikovsky from the Weizmann Institute for all the insightful discussions we had during this project

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation Generating high-quality 3D objects from textual descriptions remains a challenging problem due to computational cost, the scarcity of 3D data, and complex 3D representations. We introduce Geometry Ima...

So whether you’re looking for speed, flexibility, or eco-friendly workflows, Geometry Image Diffusion has you covered. Got curious? Dive into our paper to learn more!
Paper: arxiv.org/abs/2409.03718
Code: coming soon
Site: unity-research.github.io/Geometry-Ima...
(10/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

And we’re not just saving forests. The assets you generate with Geometry Image Diffusion are free from baked-in lighting. Re-light them in any environment to fit your scene and save some energy while you’re at it! πŸ’‘ (9/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

(But I must admit that it’s hard to resist generating thousand barrels because they’re all so different)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Why produce a thousand barrels? Let’s save the forest! 🌳 Just edit the one you’ve already generated (8/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Want an unexpected twist? The generated 3D objects come with meaningful, separable parts, making them easy to edit and manipulate (7/10)

06.12.2024 16:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Our assets can be easily triangulated by connecting neighboring pixels, and come unwrapped with textures included β€” no waste here ♻️ (6/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

(Prompts: Lovecraftian teacup with a tentacle instead of the handle; A steampunk airplane; An avocado-shaped chair)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Our model is trained on a 100k subset of Objaverse β€” smaller than what’s typically used for 3D generation. Yet, it generalizes well across a wide range of prompts (5/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

With a frozen Stable Diffusion model for textures and its trainable copy for geometry, the geometry model can tap into SD’s powerful natural image prior (4/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

At the heart of our method is Collaborative Control. It allows two models to work together β€” one for generating the geometry image and another for creating textures β€” all while sharing information to ensure everything lines up perfectly 🀝(3/10)

06.12.2024 16:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

The secret? We use geometry images, which are essentially 2D representations of 3D surfaces πŸ–ΌοΈ (think of GIMs as UV maps’ close cousins) This lets us recycle existing Text-to-Image models like Stable Diffusion, instead of building complex 3D architectures from scratch (2/10)

06.12.2024 16:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Does 3D generation always have to be either slow or complex and data-hungry?πŸ€” We don’t think so! With Geometry Image Diffusion, we’re all about reusing (and recycling ♻️) what already works β€” making it faster and easier by reducing complexity and data needs πŸš€(1/10)

06.12.2024 16:25 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1