Agneet Chatterjee's Avatar

Agneet Chatterjee

@agneet.bsky.social

Image and Video generation. https://agneetchatterjee.com/

153 Followers  |  37 Following  |  2 Posts  |  Joined: 20.11.2024  |  1.6448

Latest posts by agneet.bsky.social on Bluesky

Preview
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models Text-to-Image (T2I) and multimodal large language models (MLLMs) have been adopted in solutions for several computer vision and multimodal learning tasks. However, it has been found that such vision-l...

We also develop a benchmark to evaluate spatial understanding of VLM's. The core idea is to use synthetic images which avoids any possibility of test time leakage: arxiv.org/abs/2408.02231

26.11.2024 15:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@csprofkgd.bsky.social could you add me too? Thank you!

24.11.2024 21:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@agneet is following 20 prominent accounts