The AI for Content Creation workshop is kicking off today at #CVPR2025 - Grand Ballroom A1 - @magrawala.bsky.social Kai Zhang (Adobe), Charles Herrmann (Google), Mark Boss (Stability AI), Yutong Bai (UC Berkeley), Cherry Zhao (Adobe), Ishan Misra (Meta) and @jonbarron.bsky.social ! See you soon!
12.06.2025 13:45 β π 1 π 2 π¬ 0 π 0
AI for Content Creation workshop @ #CVPR2025 - Grand Ballroom A1 - 4pm - panel on "Open Source in AI and the Creative Industry" - with @magrawala.bsky.social (Stanford), Cherry Zhao (Adobe), Ishan Misra (Meta) and @jonbarron.bsky.social (Google) - go go!
12.06.2025 18:56 β π 2 π 1 π¬ 0 π 0
[2/2] Work led by @avalovelace.bsky.social, @kangledeng.bsky.social, Ruixuan Liu, and CMU faculty Changliu Liu and Deva Ramanan. LegoGPT is a small first step towards generative manufacturing of physical objects. Current version is limited to 20x20x20, 21 object categories, and simple brick types.
10.05.2025 03:06 β π 4 π 0 π¬ 0 π 0
[1/2] We've released the code for LegoGPT. Our autoregressive model generates physically stable and buildable designs from text prompts by integrating physics laws and assembly constraints into LLM training and inference.
Code: github.com/AvaLovelace1...
Website: avalovelace1.github.io/LegoGPT/
10.05.2025 03:06 β π 70 π 23 π¬ 4 π 2
Reve Image is our first step towards world-class image generation β and you can experience it for free today π
(π)
26.03.2025 23:31 β π 6 π 4 π¬ 1 π 0
AI4CC 2025
The AI for Content Creation workshop #CVPR2025 is accepting paper submissions. ai4cc.net Deadline March 21st 2025 midnight PST. 4 page extended abstracts, 8 pagers, and previously published work (ECCV, NeurIPS, even CVPR)! Many topics π·πΉπ¬π²βοΈππΌοΈπππ’ - come spend the day with us!
14.03.2025 16:02 β π 9 π 5 π¬ 1 π 0
SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization
Can we generate a training dataset of the same object in different contexts for customization? Check out our work SynCD, which uses Objaverse assets and shared attention in text-to-image models for the same.
cs.cmu.edu/~syncd-proje...
w/ Xi Yin, @junyanz.bsky.social, Ishan Misra, and Samaneh Azadi
11.02.2025 18:12 β π 4 π 1 π¬ 0 π 0
The Illusion of Awareness: Why We See Much Less Than We Think We Do
A few years ago, while walking home, I noticed a dry cleaners across the street from my house. βWas that always there?β I thought, surprised. Iβd walked by that spot many, many times over the years, b...
One day walking home, I noticed a dry cleaners across the street. βWas that always there?β I thought. A little Googling revealed that it was on my street longer than I have.
Here's a blog post on why we often miss what's right in front of us. #visionscience
aaronhertzmann.com/2024/05/09/i...
30.01.2025 19:21 β π 17 π 4 π¬ 0 π 0
Excited to bring the 5th CV4Animals Workshop to #CVPR2025
We welcome submissions in 2 tracks:
1) unpublished work up to 4 pages
2) papers published within last 2 years
Submit by Mar 28 & join us with amazing speakers in Nashville:
www.cv4animals.com
π¦πͺΌπ¬πΏοΈπ¦©π’π¦π¦π¦₯π¦
@cvprconference.bsky.social
01.02.2025 04:24 β π 10 π 4 π¬ 0 π 3
3D content creation with touch!
We exploit tactile sensing to enhance geometric details for text- and image-to-3D generation.
Check out our #NeurIPS2024 work on Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation: ruihangao.github.io/TactileDream...
1/3
11.12.2024 09:08 β π 13 π 6 π¬ 1 π 0
I created a huggingface space for my current work PairCustomization - You can choose from a set of pretrained LoRAs trained with our method, and run inference with our novel style guidance:
huggingface.co/spaces/pairc...
I demo'ed this at #SIGRAPHASIA2024 and it went great! :)
3/3
04.12.2024 22:55 β π 3 π 1 π¬ 0 π 0
Check out Maxwell et al.'s recent SIGGRAPH Asia paper on model customization with a single image pair. The code is available at github.com/PairCustomiz...
05.12.2024 01:40 β π 9 π 1 π¬ 0 π 0
Introducing Generative Omnimatte:
A method for decomposing a video into complete layers, including objects and their associated effects (e.g., shadows, reflections).
It enables a wide range of cool applications, such as video stylization, compositions, moment retiming, and object removal.
26.11.2024 15:55 β π 134 π 20 π¬ 3 π 8
TTIC building. Photo credit, TTIC.
I am recruiting exceptional PhD students & postdocs with an adventurous soul for my π«new TTIC AI labπ«! We aim to understand intelligence, one pixel at a time, inspired by psychology, neuroscience, language, robotics, and the arts. Apply: www.ttic.edu/studentappli...
sites.google.com/ttic.edu/ope...
12.11.2024 19:28 β π 30 π 7 π¬ 0 π 0
Associate Professor @brownvc.bsky.socialβ¬
Affiliate Faculty @uwcse.bsky.social
Chair βͺ@wigraph.bsky.socialβ¬
Computer Science PhD student @ Carnegie Mellon University. Draws things and codes things, sometimes both at the same time.
Assistant Prof. at Georgia Tech | NVIDIA AI | Making robots smarter
Assistant Professor at University of Pennsylvania.
Robot Learning.
https://www.seas.upenn.edu/~dineshj/
Professor at UW-Madison. https://pages.cs.wisc.edu/~yongjaelee/
Assistant Research Professor in the Robotics Institute at Carnegie Mellon University. Working on Computer Vision, Spatial Intelligence, Digital Humans, and Computational Behavior.
Computer Vision PhD at MIT
georgecazenavette.github.io/
Artist, Prof. of Engineering @UCBerkeley, Chief Scientist, @AmbiRobotics & @JacobiRobotics. Interested in robots, rockets, redwoods, rebels.
AI Research Scientist at Meta | posts are my own
Assistant Professor @ University of Virginia
https://craigleili.github.io
MPhil in Computer Graphics @HKU. Visiting @Penn.
CharacterAnimation&Geometry&Topology&Physics-based Animation&Simulation&AIGC.
Homepage: frank-zy-dou.github.io
I doubt existence.
Sr Research Scientist, Adobe Research
PhD Berkeley, BS/MEng Cornell
https://richzhang.github.io/
Assistant Professor @uchicago @uchicagocs. PhD from @TelAvivUni. Interested in computer graphics, machine learning, & computer vision π€
Computer Vision PhD student @ CMU
Professor for Computer Science at TU Darmstadt, Germany
neural-capture.com
MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: rachelg@csail.mit.edu
Professor of HCII and LTI at Carnegie Mellon School of Computer Science.
jeffreybigham.com