Given N image generation jobs, can we do better than N calls to text-to-image ? @daledecatur.bsky.social proposes to share compute across a batch of jobs, achieving higher efficiency at similar quality.
Check out our #ICCV2025 poster #153 today during Poster Session #4 from 2:45-4:45 HST!
22.10.2025 20:42 β π 6 π 0 π¬ 0 π 0
22.05.2025 02:38 β π 0 π 0 π¬ 0 π 0
Likewise, i am a big fan !
19.05.2025 20:08 β π 2 π 0 π¬ 0 π 0
Man-made objects are often repeated in urban scene. π³ Can we leverage these repetitions to improve 3D reconstruction π·? Exploration led by the titan Nicolas Violante Grezzi ππ§΅
13.05.2025 15:38 β π 5 π 0 π¬ 1 π 0
Great opportunity! This is a dream team, and they are located 20 minutes from Paris.
25.04.2025 16:56 β π 2 π 0 π¬ 0 π 0
OpenAi Ghibli style + the new FramePack (ControlNet team). I am very impressed by this model, and it was super easy to run. Is it a commoditization moment for video GenAi?
18.04.2025 00:05 β π 2 π 0 π¬ 0 π 0
Would anyone know the best current code for human keypoint estimation from a video of a single human?
05.03.2025 01:58 β π 0 π 0 π¬ 0 π 0
Proposal: Reviewers who have not given any sign-of-life to the AC get an automatic flag on the rebuttal of the papers they submitted, to be considered at the discretion of the reviewers of those papers.
17.01.2025 00:28 β π 0 π 0 π¬ 1 π 0
Best of 2024 ?
Movies : Perfect Days (runner-up Anora)
Series: three-body problem
Animated series: Arcane
Research paper: Dust3r
Manga: Oshi no ko
What about you?
28.12.2024 12:07 β π 8 π 0 π¬ 1 π 0
From a few user clicks to 3D material segmentation - in seconds β. It's exciting to see so many pieces in 3D generation and analysis starting to work reliably and fast ! Super nice work from Michael and team (mfischer-ucl.github.io)
10.12.2024 01:09 β π 1 π 0 π¬ 0 π 0
This work was led by Amir Barda, in collaboration with Matheus
@gadelha.bsky.social
, Noam Aigerman
@noamiko.bsky.social
, Vova Kim @vovakim.bsky.social and Amit Bermano.
Check out our paper for more details π: arxiv.org/abs/2412.00518
7/end
04.12.2024 01:55 β π 3 π 1 π¬ 0 π 0
In the end, with the editing tool becoming FAST π , 3D editing becomes really FUN to play with! 6/
04.12.2024 01:49 β π 2 π 0 π¬ 1 π 0
Do we teach inpainting to a multiview backbone π€? Or do we teach multiview to an inpainting backbone? We show that the latter is much better. Multiview is easier to learn than inpainting. 4/
04.12.2024 01:49 β π 1 π 0 π¬ 1 π 0
Now, all we need is a multiview inpainting model π
. How do we train one? Data is always king. We know inpainting masks canβt be random; they need to be realistic and close to what users would do. We propose 3 strategies, in 3D, to create Objaverse masks, closely resembling what a user would do. 4/
04.12.2024 01:49 β π 1 π 0 π¬ 1 π 0
However, SDS remains slow and brittle π’π₯. Instead, we propose to cast the problem of 3D inpainting as 2D *multiview* inpainting πΈ-πΈ-πΈ-πΈ. This is possible thanks to off-the-shelf pre-trained transformer models (LRM), which reconstruct multiview image back to meshes, Gsplats, and NeRFs. Great! 3/
04.12.2024 01:49 β π 1 π 0 π¬ 1 π 0
There has been previous attempts to tackle generative mesh editing. Check out Amir Bardaβs talk on MagicClay this Thursday at Siggraph Asia, Japan π―π΅, using SDS. 2/
04.12.2024 01:49 β π 1 π 0 π¬ 1 π 0
π Text-to-3D is awesome ! But how do we iterate on the generated 3D model, to get just the right result? Do we tweak the prompt endlessly? Revert to traditional 3D modeling techniques?
We propose a solution to "3D inpaintingβ π€©π¨
Project: amirbarda.github.io/Instant3dit....
A thread. π§΅ 1/
04.12.2024 01:49 β π 9 π 1 π¬ 1 π 1
Assistant Professor @uchicago @uchicagocs. PhD from @TelAvivUni. Interested in computer graphics, machine learning, & computer vision π€
CS PhD student @ UChicago
https://ddecatur.github.io/
PhD student @ LIX | BX 21 | MVA 23
Starting Researcher in Visual Computing | GraphDeco Team, Inria
PhD Student at GraphDeco Inria
Computer Vision team of LIGM/A3SI @EcoledesPonts ParisTech (ENPC)
Research faculty @ImagineENPC. https://gulvarol.github.io/
Senior researcher at IMAGINE (ENPC, LIGM).
Machine learning & computer vision for 3D + geospatial + historical data.
loiclandrieu.com
2nd Year PhD Student from Imagine-ENPC/IGN/CNES
Working on Self-supervised Cross-modal Geospatial Learning.
Personal WebPage: https://gastruc.github.io/
27π«π· Researcher and AI Coordinator at @ignfrance.bsky.social. Interested in unsupervised learning, remote sensing, satellite imagery and scene understanding.
π³οΈβπ he/him
PhD student at IMAGINE (ENPC) and GeoVic (Ecole Polytechnique). Working on image generation.
http://nicolas-dufour.github.io
PhD at Imagine (ENPC) and Willow (Inria) under the supervision of @gulvarol.bsky.social and Cordelia Schmid.
Telecommunication Engineer from UPC.
Postdoc in Digital Narratives @ P1 in Copenhagen; PhD in Computer Vision; conceptual artist; tortured-philosopher; ex-poet
PhD student in computer vision at Imagine, ENPC - @imagineenpc.bsky.social
I'm interested in 3D Reconstruction, Radiance Fields, Gaussian splatting, 3D Scene Rendering, 3D Scene Understanding, etc.
Webpage: https://anttwo.github.io/
PhD student in computer vision at Imagine, ENPC and EFEO.
PhD Student at IMAGINE (ENPC)
Working on camera pose estimation
thibautloiseau.github.io
I add noise to images and make neural networks remove them!
Postdoc at @ImagineEnpc Research in Computer Vision | Ph.D at @IMTAtlantique
Research: Text Recognition (OCR, HTR, Chinese HTR)
PhD student in computer vision at Imagine, ENPC
Student Intern ENPC, Paris | Undergraduate Researcher at CVIT, IIIT Hyderabad