Yash Kant's Avatar

Yash Kant

@yashkant.bsky.social

ai phd at university of toronto // prev at meta, snap research and georgia tech // web: https://yashkant.github.io/

11 Followers  |  16 Following  |  1 Posts  |  Joined: 19.02.2025  |  1.3795

Latest posts by yashkant.bsky.social on Bluesky

Post image

I will be at hashtag#CVPR25 in Nashville! โœจ

Please come chat with me and Ethan Weber - during our poster session on Pippo, on Sat 5-7pm (Hall D)! ๐Ÿ˜Š ๐Ÿ‘‹

Web: yashkant.github.io/pippo

CC: @ethanjohnweber.bsky.social, @igilitschenski.bsky.social

10.06.2025 02:18 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Pippo: High-Resolution Multi-View Humans from a Single Image We present Pippo, a generative model capable of producing 1K resolution dense turnaround videos of a person from a single casually clicked photo. Pippo is a multi-view diffusion transformer and does n...

๐Ÿง‘Pippo: High-Resolution Multi-View Humans from a Single Image
@yashkant.bsky.social, Ethan Weber, Jin Kyu Kim, Rawal Khirodkar, Su Zhaoen, Julieta M., Igor Gilitschenski, Shunsuke Saito, Timur Bagautdinov 3/๐Ÿงต
arxiv.org/abs/2502.07785

03.03.2025 19:47 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I am excited to share that my students @kai-he.bsky.social, @yashkant.bsky.social, Ziyi Wu, and Toshiya Yura, our previous research visitor from Sony, will present papers at #CVPR2025. ๐ŸŽ‰ Check out their amazing work! 1/๐Ÿงต

03.03.2025 19:47 โ€” ๐Ÿ‘ 8    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Pippo : High-Resolution Multi-View Humans from a Single Image

TL;DR: 1K Multiview Diffusion Transformer pre-trained on 3B Human images without captions; post-trained on 2.5K studio captures with pixel-aligned control via ControlMLP; generates > 5x views at inference

18.02.2025 10:16 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@yashkant is following 16 prominent accounts