Gemma explained: Whatβs new in Gemma 3- Google Developers Blog
Google's Gemma 3 model includes vision-language support and architectural changes for resource-friendly multimodal language models.
Gemma 3 explained: Longer context, image support, and a new 1B model. β goo.gle/4lV8iaw
Other key enhancements:
πΈ Best model that fits in a single consumer GPU or TPU host
πΈ KV-cache memory reduction with 5-to-1 interleaved attention
πΈ And more!
Read the blog for the full details on Gemma 3.
30.04.2025 21:46 β π 22 π 8 π¬ 1 π 0
There's a link to a really nice interactive viewer for a sample of the data (will only make sense after you read the post). There's some examples that I would have expected (where something is implied but not directly stated) but also a surprising number of kind of topical things.
17.12.2024 16:12 β π 3 π 1 π¬ 0 π 0
Want to get started using PaliGemma 2?
π€ developers.googleblog.com/en/introduci...
π€ huggingface.co/blog/paligem...
πΎ kaggle.com/models/googl...
π§ github.com/google-resea...
7/7
05.12.2024 18:19 β π 7 π 1 π¬ 0 π 0
ALTA: Compiler-Based Analysis of Transformers
We propose a new programming language called ALTA and a compiler that can map ALTA programs to Transformer weights. ALTA is inspired by RASP, a language proposed by Weiss et al. (2021), and Tracr (Lin...
Iβm pretty excited about this one!
ALTA is A Language for Transformer Analysis.
Because ALTA programs can be compiled to transformer weights, it provides constructive proofs of transformer expressivity. It also offers new analytic tools for *learnability*.
arxiv.org/abs/2410.18077
24.10.2024 03:31 β π 53 π 16 π¬ 2 π 0
Zed - The editor for what's next
Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Not news, but I recently saw the zed.dev demo and it looks amazing. Has anyone used it or something similar?
25.10.2024 14:43 β π 3 π 0 π¬ 0 π 0
Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence.
Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Researcher in NLP, ML, computer music. Prof @uwcse @uwnlp & helper @allen_ai @ai2_allennlp & familiar to two cats. Single reeds, tango, swim, run, cocktails, ΧΧΦ·ΧΧ’ΦΎΧΧ©ΧΧ, GenX. Opinions not your business.
Stanford Linguistics and Computer Science. Director, Stanford AI Lab. Founder of @stanfordnlp.bsky.social . #NLP https://nlp.stanford.edu/~manning/
LM/NLP/ML researcher Β―\_(γ)_/Β―
yoavartzi.com / associate professor @ Cornell CS + Cornell Tech campus @ NYC / nlp.cornell.edu / associate faculty director @ arXiv.org / researcher @ ASAPP / starting @colmweb.org / building RecNet.io
VP and Distinguished Scientist at Microsoft Research NYC. AI evaluation and measurement, responsible AI, computational social science, machine learning. She/her.
One photo a day since January 2018: https://www.instagram.com/logisticaggression/
Sr. Principal Research Manager at Microsoft Research, NYC // Machine Learning, Responsible AI, Transparency, Intelligibility, Human-AI Interaction // WiML Co-founder // Former NeurIPS & current FAccT Program Co-chair // Brooklyn, NY // http://jennwv.com
NYU professor, Google research scientist. Good at LaTeX.
Parker Distinguished Professor, @UNC. Program Chair #EMNLP2024. Director http://MURGeLab.cs.unc.edu (@uncnlp). @Berkeley_AI @TTIC_Connect @IITKanpur
#NLP #CV #AI #ML
https://www.cs.unc.edu/~mbansal/
Professor at UW; Researcher at Meta. LMs, NLP, ML. PNW life.
Entrepreneur
Costplusdrugs.com
Anti-cynic. Towards a weirder future. Reinforcement Learning, Autonomous Vehicles, transportation systems, the works. Asst. Prof at NYU
https://emerge-lab.github.io
https://www.admonymous.co/eugenevinitsky
Research scientist at FAIR NY β€οΈ LLMs + Information Theory. Previously, PhD at UoAmsterdam, intern at DeepMind + MSRC.
Chief Scientist & Technical Fellow at Microsoft. Professor at UW. Mother to four wild boys. #AI #HCI #Productivity #FutureOfWork
Research Engineer at Google DeepMind
Research at Google DeepMind. Ex-Physicist. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). TLM Veo Capabilities (Ingredients & more).
π San Francisco, CA
data scientist at Google DeepMind
Staff Research Engineer @ Google DeepMind
PhD Candidate @ UCL
π΅πΈ
natural language processing and computational linguistics at google deepmind.
Runner, biker, hiker. Software engineer @DeepMind, and open source enthusiast. Sometimes crafts things out of wood. he/his.