jeffcarp's Avatar

jeffcarp

@jeffcarp.bsky.social

๐ŸŒ‰ Running, biking, and drinking coffee around SF ๐Ÿง‘โ€๐Ÿ’ป ML at Goog, prev. Waymo, fintech, NPR ๐Ÿบ Social media manager for @noonathehusky https://www.jeffcarp.com

128 Followers  |  86 Following  |  13 Posts  |  Joined: 21.10.2024  |  2.021

Latest posts by jeffcarp.bsky.social on Bluesky

Video thumbnail

A Waymo vehicle was driving in a 25mph zone in LA when an oncoming car swerved into its lane while speeding up to over 70mphโ€ฆ 3x the speed means 9x the destructive energy.

Waymo vehicle reacted safely.

Source: Dmitri Dolgov (Co-CEO at waymo)

06.03.2025 03:46 โ€” ๐Ÿ‘ 37    ๐Ÿ” 9    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Post image

Awesome LLM Post-training

This repository is a curated collection of the most influential papers, code implementations, benchmarks, and resources related to Large Language Models (LLMs) Post-Training Methodologies.

github.com/mbzuai-oryx/...

04.03.2025 00:03 โ€” ๐Ÿ‘ 42    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Are you aware 94% of traffic fatalities are caused by human error?

04.03.2025 00:18 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Thanks for reposting my video. I think Waymo also solves the critical urban problem of literally not having people dying every day for no reason

03.03.2025 23:58 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Iโ€™ll take a look!

18.02.2025 20:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

How to Scale Your Model

This book aims to demystify the science of scaling language models on TPUs: how TPUs work and how they communicate with each other, how LLMs run on real hardware, and how to parallelize your models during training and inference so they run efficiently at massive scale.

04.02.2025 18:02 โ€” ๐Ÿ‘ 12    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

That immigrants, from China, India, Iran, Latin America, and so many places choose to come here is a blessing and a gift. That this needs to be said and that politicians ever make them feel otherwise is an eternal disappointment.

01.02.2025 20:30 โ€” ๐Ÿ‘ 31    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

A vision researcherโ€™s guide to some RL stuff: PPO & GRPO by Yuge (Jimmy) Shi

This is a deep dive into Proximal Policy Optimization (PPO), which is one of the most popular algorithm used in RLHF for LLMs, as well as Group Relative Policy Optimization (GRPO) proposed by the DeepSeek folks.

31.01.2025 05:56 โ€” ๐Ÿ‘ 22    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Same, I love seeing spelling errors now, theyโ€™re so human

26.01.2025 04:07 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
A guide to JAX for PyTorch developers | Google Cloud Blog PyTorch users can learn about JAX in this tutorial that connects JAX concepts to the PyTorch building blocks that theyโ€™re already familiar with.

The PyTorch developer's guide to JAX fundamentals cloud.google.com/blog/product...

10.01.2025 04:05 โ€” ๐Ÿ‘ 16    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

For next time, consider rsync to copy files in parallel

10.01.2025 00:31 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

#2025 is the sum of the first 9 cubes ๐Ÿคฉ

1+8+27+64+125+216+343+512+729 ๐Ÿฅ‚ ๐ŸŽ‰ ๐ŸŽ† ๐ŸŽ‡ ๐Ÿงฎ

31.12.2024 16:13 โ€” ๐Ÿ‘ 81    ๐Ÿ” 19    ๐Ÿ’ฌ 9    ๐Ÿ“Œ 4
Preview
Charting the Shapes of Stories with Game Theory Stories are records of our experiences and their analysis reveals insights into the nature of being human. Successful analyses are often interdisciplinary, leveraging mathematical tools to extract str...

Would be fascinated to learn how this paper came into being given the authors on it: arxiv.org/abs/2412.05747

31.12.2024 00:21 โ€” ๐Ÿ‘ 45    ๐Ÿ” 8    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Preview
Research scholar program Overview

Applications open on Dec 20 for the #Research Scholar program, which aims to strengthen long-term collaboration with the academic community by supporting early-career professors pursuing research in fields relevant to #Google. Learn more & apply by Jan 27 โ†“

research.google/programs-and...

22.12.2024 00:43 โ€” ๐Ÿ‘ 38    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

A good advice from Victor Dibia

19.12.2024 18:20 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Eugene Vinitsky

A short list of tips for keeping a clean, organized ML codebase for new researchers: eugenevinitsky.com/posts/quick-...

18.12.2024 20:00 โ€” ๐Ÿ‘ 134    ๐Ÿ” 29    ๐Ÿ’ฌ 12    ๐Ÿ“Œ 3
Fiddle documentation โ€” Fiddle documentation

If you like Pyrallis you might also want to look at Fiddle, which goes one step further and uses the model classes themselves as config (skipping the need for extra dataclasses).
fiddle.readthedocs.io/en/latest/

19.12.2024 16:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

If machine learning is the high-interest credit card of technical debt, quantization is the back alley predatory loan

12.12.2024 19:35 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Release 0.7 ยท simonw/llm-gemini New Gemini 2.0 Flash model: llm -m gemini-2.0-flash-exp 'prompt goes here'. #28

Gemini 2.0 is out, and there's a ton of interesting stuff about it. From my testing it looks like Gemini 2.0 Flash may be the best currently available multi-modal model - I upgraded my LLM plugin to support that here: github.com/simonw/llm-g...

Gemini 2.0 announcement: blog.google/technology/g...

11.12.2024 17:55 โ€” ๐Ÿ‘ 129    ๐Ÿ” 17    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1
Preview
Adds initial support for an MLX backend by angeloskath ยท Pull Request #18962 ยท keras-team/keras This PR adds an MLX (https://github.com/ml-explore/mlx) backend as mentioned also in #18901 . A lot of things are still missing but my initial goal was to be able to pass the trainer and backend te...

There has been ongoing work to add one, you can follow along here:
github.com/keras-team/k...

09.12.2024 20:27 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Going full circle: in 2019 Waymo made an April Fools video about a dog-only robotaxi service. Now people are actually sending their dogs in Waymo unattended.
youtu.be/ljbeFpOHvEA?...

09.12.2024 15:54 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Reinforcement Learning: An Overview

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics.

arxiv.org/abs/2412.05265

09.12.2024 08:37 โ€” ๐Ÿ‘ 54    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
Looking back at the first year of the Gemini era Google's Gemini family of AI models has empowered developers to build innovative applications and explore its journey over the first year.

Happy Birthday to Gemini! โœจ๐ŸŽ‚

Around this time last year, I was working on the Gemini Launch and it was exciting to have access to such models

After one year I've learned a lot and I'm still amazed of what can be done!

best feature: 2M context window ๐Ÿคฏ

developers.googleblog.com/en/looking-b...

07.12.2024 23:56 โ€” ๐Ÿ‘ 24    ๐Ÿ” 2    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

Iโ€™m late to the game here, but super impressed this UI is an open source React Native app, the framework has come a long way!

08.12.2024 01:46 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Google Colab

It's possible to access token embeddings directly in KerasHub models - would something like this work for you?
colab.research.google.com/drive/1HaKXa...

07.12.2024 17:06 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Derek Sivers once said โ€œMastery is the best goal because the rich canโ€™t buy it, the impatient canโ€™t rush it, the privileged canโ€™t inherit it, and nobody can steal it. You can only earn it through hard work. Mastery is the ultimate status.โ€

Does it still hold in the age of LLMs? ๐Ÿ˜Ž

06.12.2024 04:27 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Paligemma2 is out! Bigger models, better results. For the best experience, do not forget to finetune.

Congrats Paligemma2 team!

05.12.2024 18:28 โ€” ๐Ÿ‘ 13    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Hi Bluesky! Any AI/ML following recommendations?

04.12.2024 21:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@jeffcarp is following 20 prominent accounts