Ivana Balazevic @ibalazevic

Ivana Balazevic

@ibalazevic.bsky.social

Senior Research Scientist at Google DeepMind, working on Gemini. PhD from University of Edinburgh. ibalazevic.github.io

922 Followers | 134 Following | 4 Posts | Joined: 16.11.2024 | 1.5006

Latest posts by ibalazevic.bsky.social on Bluesky

Disentanglement is an intriguing phenomenon that arises in generative latent variable models for reasons that are not fully understood.

If you’re interested in learning why, I highly recommend giving Carl’s blog a read!

18.12.2024 17:08 — 👍 3 🔁 1 💬 0 📌 0

Research Scientist, Language London, UK

I am hiring for RS/RE positions! If you are interested in language-flavored multimodal learning, evaluation, or post-training apply here 🦎 boards.greenhouse.io/deepmind/job...

I will also be #NeurIPS2024 so come say hi! (Please email me to find time to chat)

06.12.2024 23:07 — 👍 28 🔁 7 💬 1 📌 1

Our big_vision codebase is really good! And it's *the* reference for ViT, SigLIP, PaliGemma, JetFormer, ... including fine-tuning them.

However, it's criminally undocumented. I tried using it outside Google to fine-tune PaliGemma and SigLIP on GPUs, and wrote a tutorial: lb.eyer.be/a/bv_tuto.html

03.12.2024 00:18 — 👍 117 🔁 19 💬 3 📌 2

I think this comes down to the model behind p(x,y). If features of x cause y, e.g. aspects of a website (x) -> clicks (y); age/health -> disease, then p(y|x) is a (regression) fn of x. But if x|y is a distrib'n of different y's (e.g. cats) then p(y|x) is given by Bayes rule (squint at softmax).

02.12.2024 08:20 — 👍 7 🔁 1 💬 1 📌 0

Read our paper:
Context-Aware Multimodal Pretraining

Now on ArXiv

Can you turn vision-language models into strong any-shot models?

Go beyond zero-shot performance in SigLixP (x for context)

Read @confusezius.bsky.social thread below…

And follow Karsten … a rising star!

28.11.2024 17:03 — 👍 36 🔁 4 💬 0 📌 0

We maintain strong zero-shot transfer of CLIP / SigLIP across model size and data scale, while achieving up to 4x few-shot sample efficiency and up to +16% performance gains!

Fun project with @confusezius.bsky.social, @zeynepakata.bsky.social, @dimadamen.bsky.social and
@olivierhenaff.bsky.social.

28.11.2024 14:43 — 👍 20 🔁 3 💬 0 📌 1

Just a heads up to everyone: @deep-mind.bsky.social is unfortunately a fake account and has been reported. Please do not follow it nor repost anything from it.

25.11.2024 23:24 — 👍 83 🔁 34 💬 9 📌 3

Could you add me please? :)

24.11.2024 20:33 — 👍 2 🔁 0 💬 2 📌 0

Me too please :)

22.11.2024 00:30 — 👍 1 🔁 0 💬 1 📌 0

@ibalazevic is following 20 prominent accounts

Pauline Luc
@paulineluc

Research Scientist @ Google DeepMind - working on video models for science. Worked on video generation; self-supervised learning; VLMs - 🦩; point tracking.

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Simone Schaub-Meyer
@simoneschaub

Assistant Professor of Computer Science at TU Darmstadt, Member of @ellis.eu, DFG #EmmyNoether Fellow, PhD @ETH Computer Vision & Deep Learning

Adam Wiemerslage
@adamwiemerslage

NLP PhD from CU Boulder. prev: Apple, ETS, Pearson, Army Research Lab. Next: Kensho https://adamits.github.io

Eugene Vinitsky 🍒
@eugenevinitsky

Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU https://emerge-lab.github.io https://www.admonymous.co/eugenevinitsky

Stephanie Chan
@scychan

Staff Research Scientist at Google DeepMind. Artificial and biological brains 🤖 🧠

Hilde Kuehne
@hildekuehne

Professor for CS at the Tuebingen AI Center and affiliated Professor at MIT-IBM Watson AI lab - Multimodal learning and video understanding - GC for ICCV 2025 - https://hildekuehne.github.io/

Dileep George @dileeplearning
@dileeplearning

AGI research @DeepMind. Ex cofounder & CTO Vicarious AI (acqd by Alphabet), Cofounder Numenta Triply EE (BTech IIT-Mumbai, MS&PhD Stanford). #AGIComics blog.dileeplearning.com

Andrew Saxe
@saxelab

Professor at the Gatsby Unit and Sainsbury Wellcome Centre, UCL, trying to figure out how we learn

Aidan Clark
@aidanclark

I train models @ OpenAI. Previously Research at DeepMind. Hae sententiae verbaque mihi soli sunt.

roon
@tszzl

Noam Brown
@polynoamial

Researching reasoning at OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series / 🍓

Chris Olah
@colah

Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.

Tim Green
@tfgg.me

Research Engineer at Google DeepMind. AlphaFold, LLMs, Physics and Civic Tech. tfgg.me

Sebastien Bubeck
@sbubeck

I work on AI at OpenAI. Former VP AI and Distinguished Scientist at Microsoft.

Gabriel Peyré
@gabrielpeyre

Aida Nematzadeh
@aidanematzadeh

Research scientist at Google DeepMind.🦎 She/her. http://www.aidanematzadeh.me/

Terence Tao
@teorth

Mathematician at UCLA. My primary social media account is https://mathstodon.xyz/@tao . I also have a blog at https://terrytao.wordpress.com/ and a home page at https://www.math.ucla.edu/~tao/

Hal Daumé III
@haldaume3

Human-centered AI #HCAI, NLP & ML. Director TRAILS (Trustworthy AI in Law & Society) and AIM (AI Interdisciplinary Institute at Maryland). Formerly Microsoft Research NYC. Fun: 🧗🧑‍🍳🧘⛷️🏕️. he/him.

Sophia Sanborn
@naturecomputes

Searching for principles of neural representation | Neuro + AI @ enigmaproject.ai | Stanford | sophiasanborn.com