Nathan Lambert's Avatar

Nathan Lambert

@natolambert.bsky.social

A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef Writes http://interconnects.ai At Ai2 via HuggingFace, Berkeley, and normal places

12,501 Followers  |  265 Following  |  1,456 Posts  |  Joined: 30.04.2023  |  1.808

Latest posts by natolambert.bsky.social on Bluesky

this'll be a big week for American momentum with open models, multiple things are going to start to fall into place

03.08.2025 21:28 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

No just chatting with people who have

03.08.2025 19:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

OpenAI is going to release an open model (1st since GPT 2) and GPT-5 within weeks of eachother. The open model is indicative of where impact is possible in the AI space, where GPT-5 isnt a huge step like GPT-4.

Open models & agents/products are big stories of 2025 so far.

03.08.2025 18:47 β€” πŸ‘ 17    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1
Preview
xAI's Grok 4: The tension of frontier performance with a side of Elon favoritism An o3 class model, the possibility of progress, chatbot beige, and the illusiveness of taste.

I discussed some of this in my Grok 4 post on Interconnects.

Will be a fun space to follow. It’s exciting that Gemini had much bigger gains. www.interconnects.ai/p/grok-4-an-...

02.08.2025 21:00 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

There’s so much more to play with in the parallel compute space - raw parallelism (like BoN sampling), independent agents with an orchestrator (deep research), different base models w more finetuning, how much compute they’re willing to throw at one prompt, reward model types.

02.08.2025 21:00 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Was reading Swyx’s AI News summary on Gemini DeepThink and it’s understated how DeepThink, Grok Heavy, and o3 pro are more likely to be more different in how they use parallel compute relative to the underlying models (which end up being similar).

02.08.2025 21:00 β€” πŸ‘ 11    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Case 912901 of academia being disconnected from reality. Getting weird ethics flags on normal ass papers while people in the real world are building MechaHitler.

01.08.2025 16:12 β€” πŸ‘ 11    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

You get AGI, but the only form factor you can use to interact with it is short term videos.
Would you use it?

31.07.2025 20:59 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 2    πŸ“Œ 1

I've finally figured out how to know when OpenAI will release.
Case 1: normal announcement "coming soon" is an Elon style deadline. Totally mysterious, not soon.
Case 2: google has a major announcement, 100% OpenAI lands the exact deadline

Right now we're in case 1

31.07.2025 14:23 β€” πŸ‘ 16    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I downgraded from OpenAI pro a few days ago. Already hit my rate limits on o3. Turns out I wasn't wasting money after all.

31.07.2025 02:10 β€” πŸ‘ 15    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The funny thing about Meta sounding like it'll go closed w AI is how Zuck has craved the next "platform" ever since their problems with Apple, eg investing $20B+/yr in VR, but Llama is the closest they have come to making one, and they're just going to let it what, fade away?

30.07.2025 23:29 β€” πŸ‘ 16    πŸ” 0    πŸ’¬ 5    πŸ“Œ 0
Personal Superintelligence Explore Meta's vision of personal superintelligence, where AI empowers individuals to achieve their goals, create, connect, and lead fulfilling lives. Insights from Mark Zuckerberg on the future of…

2025: buff.ly/oxjt89B
2024: buff.ly/kgmaitj

30.07.2025 13:36 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

New Zuck post, what a difference a few years makes:

Today: "We'll need to be rigorous about mitigating these risks and careful about what we choose to open source."

2024: "Meta is committed to open source AI... and therefore a platform that will be around for the long term."

30.07.2025 13:36 β€” πŸ‘ 23    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1

Recent AI news (Chinese models and OpenAI’s coming releases)
β€œDo and don’t” of LLM training organizations
Reasoning research and academic blind spots
Research people aren’t paying enough attention to
Non language modeling news & other topics

Enjoy!

YouTube: buff.ly/OWMU0qZ
Post: buff.ly/vBBYAHP

29.07.2025 13:45 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Back in the interview chair with a general recap of the AI research ecosystem with one of my favorite people to talk to. Ross Taylor (former Llama reasoning lead) is back for round 2!

In this episode we cover some of everything.

29.07.2025 13:45 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Any day now

27.07.2025 23:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

17 of the top 20 open models on the Artificial Analysis leaderboard are made by Chinese companies. 2 from the US and 1 from Europe. This is what current domination of open models looks like, but it can be changed quickly with focused funding and effort.

Leaderboard: buff.ly/hVyaNl8

27.07.2025 19:13 β€” πŸ‘ 30    πŸ” 6    πŸ’¬ 2    πŸ“Œ 1

I bet pretty soon a Chinese research org drops a LLM scaling laws for RL paper.

Closed frontier labs have definitely done this and wont share it, academics havent mastered the data + infra tweaks yet (or lack the $).

27.07.2025 13:44 β€” πŸ‘ 30    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

GitHub copilot is hit or miss but if it’s the only one available it’s better than nothing. Haven’t tried cline

26.07.2025 17:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sonnet 4 probably fine, but depends on the interface.

26.07.2025 16:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Whenever I start mentoring a new junior student or researcher one of the first things I do is to check that they use sufficiently strong AI.

If they don't I'll pay out of pocket to make sure they aren't shooting themselves in the foot. The payoff is worth it.

26.07.2025 15:38 β€” πŸ‘ 12    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Chinese open models being at the frontier today isn’t thaaat surprising for folks who were following closely. What is more surprising is how quickly Llama disappeared from the clear top spot in the conversation. Two trends combine for massive impact.

25.07.2025 16:04 β€” πŸ‘ 24    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Pretty much lol

24.07.2025 22:53 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Important distinction: open AI models, not OpenAI models

24.07.2025 19:46 β€” πŸ‘ 28    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Slack shipped a summary feature, AGI coming 0.1% sooner, finally man.

23.07.2025 23:34 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

progress is far more valuable than the money cost

23.07.2025 18:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Which kind of non profit are you, OpenAI or Ai2?

23.07.2025 18:04 β€” πŸ‘ 15    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

some parts of it are totally fine? I won't speak for all of it, but e.g. open models / research investment are good

23.07.2025 16:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

This is a major shift in the tenor of how the U.S. government feels about and prioritizes open AI models. This narrow piece of the action plan is a huge W overall, with a couple nit picks. Happy to share my thoughts as I scope my own path to building this reality.

buff.ly/Rq5T5Yp

23.07.2025 16:14 β€” πŸ‘ 20    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1
Post image

The new AI action plan is out from the White House
buff.ly/YXNpTtF

23.07.2025 14:23 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 2

@natolambert is following 20 prominent accounts