Chris Paxton's Avatar

Chris Paxton

@cpaxton.bsky.social

AI, robotics, and other stuff. Currently AI @ agility robotics Former Hello Robot, NVIDIA, Meta. Writing about robots https://itcanthink.substack.com/ All opinions my own

5,520 Followers  |  1,386 Following  |  2,754 Posts  |  Joined: 25.02.2024  |  2.1026

Latest posts by cpaxton.bsky.social on Bluesky

Post image

This isn't AI "eating" the economy, its that only one part of the economy is still growing/doing well.

03.08.2025 12:41 β€” πŸ‘ 17    πŸ” 3    πŸ’¬ 1    πŸ“Œ 3

Seems bad tbh

03.08.2025 12:29 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We got a good couple of brutal wars out of it, though, while we figured it out

03.08.2025 12:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

I mean he's not wrong but this would be... bad. This is a bad thing. People should not be doing this.

03.08.2025 03:49 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

What the hell are we doing here

03.08.2025 03:33 β€” πŸ‘ 41    πŸ” 2    πŸ’¬ 4    πŸ“Œ 1

Thanks!

03.08.2025 03:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Gotta practice my whistling and egg-laying to remain hireable

03.08.2025 02:54 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Yeah

03.08.2025 01:48 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Why Blue States Can’t Have Nice Things In the tangle of outsourcing, no one is actually in charge.

Oh this is from the very excellent Persuasion newsletter: open.substack.com/pub/persuasi...

03.08.2025 01:45 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

This gets to the core of it: "What’s missing is the basic expectation that public employees are directly responsible for solving public problems, and the role we as citizens play in creating a culture that makes good governance possible."

You rarely expect public servants to, you know, help

03.08.2025 01:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 1
Post image

We are going through this in thoroughly democratic Pittsburgh where roadwork and maintenance stretches out indefinitely and nothing is ever fixed; random closures every hundred meters, sidewalks shut down, the works. Total failure of state capacity.

03.08.2025 01:40 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Whole body controller from my team. Lean on it, push it, robot doesn't fall. Important to watch the feet: it smartly changes its posture as Jonah pushes down on it.

03.08.2025 01:32 β€” πŸ‘ 36    πŸ” 3    πŸ’¬ 5    πŸ“Œ 0

At least there's no moat deep enough to stop some random Chinese team with like $50 of bootleg gpus lol, Microsoft and Amazon can't figure it out and nvidia still has to fine tune qwen so...

03.08.2025 01:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yeah that tracks

03.08.2025 01:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Genuinely what the fuck, how?

03.08.2025 01:08 β€” πŸ‘ 19    πŸ” 1    πŸ’¬ 3    πŸ“Œ 1
A side-by-side bar chart compares the performance of three modelsβ€”XBai o4 (blue), OpenAI-o3-mini (gray), and Claude Opus 4 (yellow)β€”across two operational modes: Medium Mode and Low Mode. Each mode has results on three benchmarks: AIME24, AIME25, and LiveCodeBench v5.

Medium Mode:
	β€’	AIME24: XBai o4 (85.4), o3-mini (79.6), Claude Opus 4 (75.7)
	β€’	AIME25: XBai o4 (77.6), o3-mini (74.8), Claude Opus 4 (75.5)
	β€’	LiveCodeBench v5: XBai o4 (67.0), o3-mini (66.3), Claude Opus 4 (61.3)

Low Mode:
	β€’	AIME24: XBai o4 (82.4), o3-mini (60.0), Claude Opus 4 (75.7)
	β€’	AIME25: XBai o4 (74.8), o3-mini (48.3), Claude Opus 4 (75.5)
	β€’	LiveCodeBench v5: XBai o4 (66.6), o3-mini (62.0), Claude Opus 4 (61.3)

XBai o4 leads in nearly every category, with particularly strong performance on AIME24 in both modes. Claude Opus 4 closely trails o3-mini in some Medium Mode results but outperforms o3-mini in Low Mode for AIME25.

A side-by-side bar chart compares the performance of three modelsβ€”XBai o4 (blue), OpenAI-o3-mini (gray), and Claude Opus 4 (yellow)β€”across two operational modes: Medium Mode and Low Mode. Each mode has results on three benchmarks: AIME24, AIME25, and LiveCodeBench v5. Medium Mode: β€’ AIME24: XBai o4 (85.4), o3-mini (79.6), Claude Opus 4 (75.7) β€’ AIME25: XBai o4 (77.6), o3-mini (74.8), Claude Opus 4 (75.5) β€’ LiveCodeBench v5: XBai o4 (67.0), o3-mini (66.3), Claude Opus 4 (61.3) Low Mode: β€’ AIME24: XBai o4 (82.4), o3-mini (60.0), Claude Opus 4 (75.7) β€’ AIME25: XBai o4 (74.8), o3-mini (48.3), Claude Opus 4 (75.5) β€’ LiveCodeBench v5: XBai o4 (66.6), o3-mini (62.0), Claude Opus 4 (61.3) XBai o4 leads in nearly every category, with particularly strong performance on AIME24 in both modes. Claude Opus 4 closely trails o3-mini in some Medium Mode results but outperforms o3-mini in Low Mode for AIME25.

XBai-o4: a new supermodel

* Open weights, apache 2
* 32B
* beats o3-mini
* for TTC they train an extra head as a reward model to do binary classification

hf: huggingface.co/MetaStoneTec...
paper: arxiv.org/abs/2507.01951

02.08.2025 21:53 β€” πŸ‘ 36    πŸ” 7    πŸ’¬ 3    πŸ“Œ 2

Thank you!

03.08.2025 00:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It depends on what you mean? If you took asimov or someone forward to ten years from now, they'd certainly think we had achieved it, yes. I think a superintelligence doesn't need to be superior in all dimensions, just many.

03.08.2025 00:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

An airplane is not a bird, after all, and though a bird could never dream of the speed and altitude of an airplanes flight, there's plenty they can do that planes can't.

Likewise, human intelligence may well have comparative advantage even in an age of superintelligence.

03.08.2025 00:23 β€” πŸ‘ 10    πŸ” 0    πŸ’¬ 2    πŸ“Œ 1
Post image

I confess I read this one a while ago, and it’s very much stuck with me. I keep coming back to it. It’s one of those bits that, once you’ve spent a lot of time around little kids, just feels too real, so it sticks with you and hollows you out a bit inside.

substack.com/home/post/p-...

03.08.2025 00:09 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

100% agree

The most important skill right now is to be adaptable and open to the fluid and the abstract (specifically, the uncanny).

02.08.2025 15:43 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

For most of human history, there was no (appreciable) technological change within the course of an average lifespan. We live in uniquely dynamic times, and therefore the only way to stay afloat in my opinion is to be willing to change

02.08.2025 15:40 β€” πŸ‘ 19    πŸ” 2    πŸ’¬ 3    πŸ“Œ 0

The "defeat" part is still in question but to me this makes it kind of the series of our era

02.08.2025 14:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is basically Robert Jackson Bennett's new book series (most recent book, a drop of corruption) for what it's worth

02.08.2025 14:59 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

One thing about humanoid robots i think is under appreciated, is that you dont have to make some super crazy training set for some specialized arm on wheels. It can learn from humans performing actions, so this is incredibly useful

01.08.2025 15:12 β€” πŸ‘ 39    πŸ” 5    πŸ’¬ 3    πŸ“Œ 0

The biggest driver of robotics innovation over the last couple of years hasn't been anything clever. It's just been people making cheap robots easier to use. And that's a good thing; we have all the tools we need for a robotics revolution, we just need to actually get people out there using robots.

01.08.2025 16:45 β€” πŸ‘ 81    πŸ” 6    πŸ’¬ 2    πŸ“Œ 1

One thought that's stuck with me is how much of recent robotics research -- so many startups and papers -- have just acknowledged that we should make it really, really easy to use robots and to collect data. I wrote about it here: open.substack.com/pub/itcanthi...

01.08.2025 13:13 β€” πŸ‘ 25    πŸ” 4    πŸ’¬ 1    πŸ“Œ 1
Post image

It seems that many AI companies and research labs β€” including ByteDance β€” are exploring Diffusion-based LLMs.

Seed Diffusion: A large scale language model based on discrete-state diffusion, specializing in code generation, achieves an inference speed of 2,146 token/s, a 5.4x improvement over

01.08.2025 02:55 β€” πŸ‘ 36    πŸ” 6    πŸ’¬ 3    πŸ“Œ 1

seriously lol

01.08.2025 03:20 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

*Reading all the comments under this post*

I thought you guys wanted the AI that did the laundry

01.08.2025 02:55 β€” πŸ‘ 58    πŸ” 8    πŸ’¬ 7    πŸ“Œ 0

@cpaxton is following 20 prominent accounts