πΊ Just 4 days to go!
Join us in Honolulu for the Instance-Level Recognition and Generation Workshop at #ICCV2025 π
ποΈ Oct 19, 8:30amβ12:30pm π Room 306 A
Weβll have amazing keynotes, plus oral and poster sessions featuring accepted and invited papers.
Donβt miss it!
ilr-workshop.github.io/ICCVW2025/
15.10.2025 15:54 β π 9 π 5 π¬ 1 π 1
ReSWD - a Hugging Face Space by stabilityai
Create images using color matching and guidance features. Upload your reference images and get generated images that match the colors and styles.
Thanks to my co-authors Andreas Engelhardt, Simon DonnΓ©, Varun Jampani
Also check out the HF demo huggingface.co/spaces/stabi..., the code github.com/Stability-AI..., and the explainer youtu.be/ckcSgf0s-jI
02.10.2025 12:42 β π 0 π 0 π¬ 1 π 0
This can be used for multiple applications such as color matching or diffusion guidance. Here, we showcase the diffusion process of generating a medieval house with the reference to the right.
02.10.2025 12:42 β π 0 π 0 π¬ 1 π 0
Variance in MC is quite common in computer graphics so we combined ReSTIR -- more precisely the weighted reservoir sampling -- with SWD to keep more impactful random directions in the optimization.
02.10.2025 12:42 β π 0 π 0 π¬ 1 π 0
Happy to announce: ReSWD. Sliced Wasserstein Distances are quite powerful, but they perform a Monte Carlo (MC) integration (over random directions). During an optimization this can lead to noisy gradients due to variance.
Project page: reservoirswd.github.io
02.10.2025 12:42 β π 8 π 5 π¬ 1 π 0
AI for Content Creation Workshop
Iβll also be sharing these and other works at the AI4CC workshop on the 12th at 11:00. ai4cc.net
11.06.2025 20:35 β π 0 π 0 π¬ 0 π 0
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
SV3D generates novel multi-view synthesis from a single input image.
There was SV3D where we mostly discarded any SDS loss (only for unseen areas). I was mainly working on the 3D part and it required quite a few tricks to make it work. sv3d.github.io
11.06.2025 02:14 β π 2 π 0 π¬ 0 π 0
Material Editing in CLIP Space
Material Editing in CLIP Space
3οΈβ£ MARBLE: Edit materials effortlessly using simple CLIP feature manipulation, supporting exemplar-based interpolation or parametric edits across various styles. Check it out: marblecontrol.github.io
10.06.2025 20:51 β π 0 π 0 π¬ 1 π 0
SPAR3D
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
2οΈβ£ SPAR3D (follow-up to SF3D): Integrates a fast point diffusion module, enhancing depth, backside modeling, and enabling easier editing. Project page: spar3d.github.io
10.06.2025 20:51 β π 0 π 0 π¬ 1 π 0
Iβm at CVPR this week! Looking forward to connecting and discussing all things graphics, 3D, and gen AI. I'll be presenting 3 papersβstop by and chat!
10.06.2025 20:51 β π 0 π 0 π¬ 1 π 0
Stable Point-Aware 3D - a Hugging Face Space by stabilityai
Discover amazing ML apps made by the community
Check out the HF demo to test the model: huggingface.co/spaces/stabi.... The model (huggingface.co/stabilityai/...) is also available with code and Comfy Nodes github.com/Stability-AI.... We also have a project page available at spar3d.github.io
08.01.2025 19:58 β π 9 π 0 π¬ 0 π 0
An image showcasing editing of the point cloud representation to add a cup to a mug or a tail to a plush toy.
One neat implication is that we can edit the point cloud to fix missing features or wrong scaling. We even created a small gradio component for simple edits in the demo (pypi.org/project/grad...)
08.01.2025 19:58 β π 9 π 0 π¬ 1 π 0
Introducing Stable Point Aware 3D: Real-Time Editing and Complete Object Structure Generation β Stability AI
Stable Point Aware 3D (SPAR3D) introduces real-time editing and complete structure generation of a 3D object from a single image in less than a second.
Happy to announce SPAR3D! A fast <1s single image to 3D reconstruction model that combines the best from diffusion and regression models by leveraging a point diffusion module to perform a fast initial point cloud. This aids 3D understanding for the mesh estimation
stability.ai/news/stable-...
08.01.2025 19:58 β π 18 π 0 π¬ 1 π 1
A single procedural modeling system is a huge undertaking when you aim for a high quality level. Take speed tree for example which combines procedural aspects with hand authored elements and itβs an entire company dedicated to that.
16.12.2024 07:33 β π 0 π 0 π¬ 0 π 0
Yes I agree for certain things it can work. Simple cities (Manhattan style) and natural landscapes are rather well fitting and are explored heavily in video games already. Going for interiors or any object is another beast.
16.12.2024 07:32 β π 0 π 0 π¬ 2 π 0
The realistic rendering is not the problem and even full path tracing scenes is doable for room scale scenes on GPU. It still requires some denoising tho as otherwise rendering times are too long to generate any meaningful amount of data. But even then data is the bottleneck
16.12.2024 06:38 β π 0 π 0 π¬ 1 π 0
Itβs hard to scale 3D data similarly to image or video. We run around with capable cameras all the time. Only few people can model 3D and itβs takes time and isnβt offered for free (rightfully). So even if we would pay all artists in the world, we still wonβt hit the scale of image and video.
16.12.2024 06:34 β π 0 π 0 π¬ 3 π 0
I recently went with recreating the rooms in Blender. A lot of furniture websites now have 3D viewers and you can download the models from devtools. They are also metric sized. Then blender becomes sims pro and you can iterate quite fast.
08.12.2024 07:04 β π 0 π 0 π¬ 0 π 0
Would love to be added too ;)
28.11.2024 23:29 β π 1 π 0 π¬ 0 π 0
Looking at ICLR submissions with the lowest score - What a work of art! π§΅
25.11.2024 17:52 β π 173 π 19 π¬ 5 π 9
I used ππ emojis to maximize Twitter/Bluesky parity in my profile. This is definitely pointless, but it's fun.
22.11.2024 16:44 β π 17 π 3 π¬ 5 π 1
bsky.app/profile/cspr... markboss.me/publication/... :D
21.11.2024 19:38 β π 0 π 0 π¬ 1 π 0
I wasnβt aware that ads are not that bad as long as they are of good quality and diverse. Now I know.
21.11.2024 08:04 β π 0 π 0 π¬ 0 π 0
Hi Kosta :). Can you also add me?
20.11.2024 12:47 β π 1 π 0 π¬ 0 π 0
I had this account lying around for quite some time. It seems π¦ is starting to take off. It's great to see many scientists here - and no weird gadget ads in between π
20.11.2024 06:45 β π 7 π 0 π¬ 1 π 0
Programmer. Worked on Unity game engine 2006-2021. Now working on Blender. Primarily over at mastodon.gamedev.place/@aras
Professor at @UniFAU, and at UCL. Visual computing, inverse rendering, point-based graphics, computational fabrication, digital humanities β Digital Reality.
Building a new renderer at HypeHype. Former principal engineer at Unity and Ubisoft. Opinions are my own.
Graphics researcher at TU Delft. Formerly Intel, KIT, NVIDIA, Uni Bonn. Known for moment shadow maps, MBOIT, blue noise, spectra, light sampling. Opinions are my own.
https://MomentsInGraphics.de
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Wearing all the hats I possibly can at the Frostbite Rendering team
Math & Art Videos.
* https://youtube.com/Inigo_Quilez
* https://iquilezles.org
Created Shadertoy, Pixar's Wondermoss, Quill, and more.
Trending papers in Vision and Graphics on www.scholar-inbox.com.
Scholar Inbox is a personal paper recommender which keeps you up-to-date with the most relevant progress in your field. Follow us and never miss a beat again!
Applied math and Geometry lover, discrete geometer wannabe. PhD Student at LIRIS.
he/him
Computer Graphics PhD student at TU Berlin.
(differentiable) rendering, inverse graphics, GPGPU
mworchel.github.io
Activision, previously at Unity, Bungie, AMD/ATI
all opinions my own.
Real-Time Rendering Enthusiast. But let's be honest - all rendering. and some ML.
Rendering at Respawn Entertainment. Previously Luxology, The Foundry, Google. Enthusiast landscape photographer (andrewhelmer.com/photography). All views my own. He/him.
Associate Professor, University of Utah
Founder, Cyber Radiance
http://www.cemyuksel.com
Former ML+3D Engineer @ Stability AI
Ex. AMD Research Engineer, RT & Neural Rendering
2021 Graduate, Computer Graphics Group @ University of Tokyo.
https://aaryaman.net
Professor of Computer Vision and AI at TU Munich, Director of the Munich Center for Machine Learning mcml.ai and of ELLIS Munich ellismunich.ai
cvg.cit.tum.de
Computer graphics, systems programming, general methods.
Staff graphics engineer at @unity.com. Formerly Playdead, NVIDIA, Microsoft.
apoorvaj.io
πCopenhagen
Professor of Computer Vision/Machine Learning at Imagine/LIGM, Γcole nationale des Ponts et ChaussΓ©es @ecoledesponts.bsky.social Music & overall happiness π³πͺ» Born well below 350ppm π¬ mostly silly personal views
πParis π https://davidpicard.github.io/