André Susano Pinto's Avatar

André Susano Pinto

@asusanopinto.bsky.social

131 Followers  |  13 Following  |  1 Posts  |  Joined: 02.12.2024  |  1.5826

Latest posts by asusanopinto.bsky.social on Bluesky

Post image

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7

05.12.2024 18:16 — 👍 68    🔁 21    💬 1    📌 5
Post image

Our big_vision codebase is really good! And it's *the* reference for ViT, SigLIP, PaliGemma, JetFormer, ... including fine-tuning them.

However, it's criminally undocumented. I tried using it outside Google to fine-tune PaliGemma and SigLIP on GPUs, and wrote a tutorial: lb.eyer.be/a/bv_tuto.html

03.12.2024 00:18 — 👍 116    🔁 18    💬 3    📌 2

Did you ever try to get an auto-regressive transformer to operate in a continuous latent space which is not fixed ahead of time but learned end to end from scratch?

Enter JetFormer: arxiv.org/abs/2411.19722 -- joint work in a dream team: @mtschannen.bsky.social and @kolesnikov.ch

02.12.2024 18:17 — 👍 14    🔁 2    💬 0    📌 0

@asusanopinto is following 13 prominent accounts