Roopal Garg's Avatar

Roopal Garg

@roopalgarg.bsky.social

Multimodal Multi-lingual research at Google DeepMind for Gemini post-training. #NLProc #Multimodal

878 Followers  |  882 Following  |  5 Posts  |  Joined: 25.11.2023  |  1.7682

Latest posts by roopalgarg.bsky.social on Bluesky

Post image Post image

πŸ₯Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. πŸ₯‡

25.03.2025 17:25 β€” πŸ‘ 215    πŸ” 65    πŸ’¬ 34    πŸ“Œ 11

folks working on one or more of the following

πŸ–ΌοΈ Image Descriptions to improve Image-Text alignment
AND/OR
πŸ’¬Multi/Cross Lingual image-text understanding/generation
AND/OR
🌏Geo-Cultural representation and learning

Please DM if you are willing to discuss the current state/challenges/future-work.

25.11.2024 06:57 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

New starter pack! go.bsky.app/GZ4hZzu

28.10.2024 09:43 β€” πŸ‘ 42    πŸ” 17    πŸ’¬ 6    πŸ“Œ 5

Too soon but 🀞

24.11.2024 17:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ™‹β€β™‚οΈ Could I be added ? Thanks :)

24.11.2024 16:53 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We had a great experience presenting our work on ImageInWords to the community #EMNLP2024 . Thank you everyone for stopping byπŸ™! Looking forward to future work and seeing image descriptions as a foundational multi-modal task! @emnlpmeeting.bsky.social @deep-mind.bsky.social #NLProc #Multimodal

23.11.2024 22:53 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc

19.11.2024 03:48 β€” πŸ‘ 107    πŸ” 37    πŸ’¬ 1    πŸ“Œ 3
Preview
Research Engineer, GenMedia Mountain View, California, US

hello new followers! we’re actively hiring on our generative media team in Mountain View: boards.greenhouse.io/deepmind/job...

we work on image, video, audio, etc… come work with us if you’re interested! apply asap :)

22.11.2024 06:08 β€” πŸ‘ 15    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
ImageInWords: Unlocking Hyper-Detailed Image Descriptions Despite the longstanding adage "an image is worth a thousand words," generating accurate hyper-detailed image descriptions remains unsolved. Trained on short web-scraped image text, vision-language mo...

πŸ“’ Excited to unveil our latest research, ImageInWords (IIW)! πŸš€We're pushing the boundaries of image descriptions with a new seeded, sequential, human-in-the-loop approach producing SoTA, articulate, hyper-detailed descriptions.

arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal

21.11.2024 00:26 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@roopalgarg is following 20 prominent accounts