Roopal Garg @roopalgarg - Bluesky Profile

Latest posts by roopalgarg.bsky.social on Bluesky

Happy new year to everyone...

01.01.2026 21:30 — 👍 0 🔁 0 💬 0 📌 0

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇

25.03.2025 17:25 — 👍 215 🔁 65 💬 34 📌 11

folks working on one or more of the following

🖼️ Image Descriptions to improve Image-Text alignment
AND/OR
💬Multi/Cross Lingual image-text understanding/generation
AND/OR
🌏Geo-Cultural representation and learning

Please DM if you are willing to discuss the current state/challenges/future-work.

25.11.2024 06:57 — 👍 3 🔁 1 💬 1 📌 0

New starter pack! go.bsky.app/GZ4hZzu

28.10.2024 09:43 — 👍 42 🔁 17 💬 6 📌 5

Too soon but 🤞

24.11.2024 17:24 — 👍 1 🔁 0 💬 0 📌 0

🙋‍♂️ Could I be added ? Thanks :)

24.11.2024 16:53 — 👍 1 🔁 0 💬 0 📌 0

We had a great experience presenting our work on ImageInWords to the community #EMNLP2024 . Thank you everyone for stopping by🙏! Looking forward to future work and seeing image descriptions as a foundational multi-modal task! @emnlpmeeting.bsky.social @deep-mind.bsky.social #NLProc #Multimodal

23.11.2024 22:53 — 👍 9 🔁 0 💬 0 📌 0

All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc

19.11.2024 03:48 — 👍 107 🔁 37 💬 1 📌 3

Research Engineer, GenMedia Mountain View, California, US

hello new followers! we’re actively hiring on our generative media team in Mountain View: boards.greenhouse.io/deepmind/job...

we work on image, video, audio, etc… come work with us if you’re interested! apply asap :)

22.11.2024 06:08 — 👍 15 🔁 4 💬 1 📌 0

ImageInWords: Unlocking Hyper-Detailed Image Descriptions Despite the longstanding adage "an image is worth a thousand words," generating accurate hyper-detailed image descriptions remains unsolved. Trained on short web-scraped image text, vision-language mo...

📢 Excited to unveil our latest research, ImageInWords (IIW)! 🚀We're pushing the boundaries of image descriptions with a new seeded, sequential, human-in-the-loop approach producing SoTA, articulate, hyper-detailed descriptions.

arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal

21.11.2024 00:26 — 👍 7 🔁 1 💬 0 📌 0

@roopalgarg is following 20 prominent accounts

VLMs4All - CVPR 2025 Workshop
@vlms4all

Workshop on Vision Language Models For All: Building Geo-Diverse and Culturally Aware Vision-Language Models @ CVPR 2025 https://sites.google.com/view/vlms4all

Douglas Eck
@douglaseck

Senior Research Director at Google DeepMind in our San Francisco office. I created Magenta (magenta.withgoogle.com) and sometimes find time to be a musician.

@anianruoss

Omar Rivasplata
@omarrivasplata

MIT Press
@mitpress

Committed to the daily re-imagining of what a university press can be since 1962. Website: https://mitpress.mit.edu // The Reader (our home for excerpts, essays, & interviews): https://thereader.mitpress.mit.edu

François Fleuret
@francois.fleuret.org

Research Scientist Meta/FAIR, Prof. University of Geneva, co-founder Neural Concept SA. I like reality. https://fleuret.org

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Georg Ostrovski
@gostrovski

Research Engineer @ Google DeepMind

Milo B. Fasken, Ph.D.
@milofasken

Senior Scientist, Department of Biology, Emory University, Atlanta, GA. Molecular Biologist. RNA Scientist. Yeast Geneticist 🧬. British🇬🇧. Runner 🏃🏻…. Opinions are my own. Interests: RNA Decay, RNA Processing, RNA & Disease.

John Schwartz
@jswatz

UT Austin journalism professor; former NYT, WP. he/him.

Chanda Prescod-Weinstein 🌌
@chanda.blacksky.app

Theoretical Astro/Physicist: https://chanda.science First book: https://tinyurl.com/DisorderedCosmos PREORDER MY NEXT BOOK: https://tinyurl.com/EdgeOfSpaceTime Newsletter: news.chanda.science all Black/all Jewish. 🏳️‍🌈/agender/woman. Posts by/for me🖖🏽

@ranjaykrishna

Hilde Kuehne
@hildekuehne

Professor for CS at the Tuebingen AI Center and affiliated Professor at MIT-IBM Watson AI lab - Multimodal learning and video understanding - GC for ICCV 2025 - https://hildekuehne.github.io/