Jonathan Cheng's Avatar

Jonathan Cheng

@jonathancheng.bsky.social

English PhD turned Machine Learning Researcher. I medicate my imposter syndrome with coffee. Current: Foundation Models @ Apple Prev: LLMs/World Models @ Riot Games RecSys @ Apple Seattle πŸ³οΈβ€πŸŒˆ

4,592 Followers  |  1,808 Following  |  1,102 Posts  |  Joined: 28.06.2023  |  2.0072

Latest posts by jonathancheng.bsky.social on Bluesky

Post image

A ton of attention over the years goes to plots comparing open to closed models.
The real trend that matters for AI impacts on society is the gap between closed frontier models and local consumer models.
Local models passing major milestones will have major repercussions.
buff.ly/ccMJydQ

04.10.2025 18:40 β€” πŸ‘ 56    πŸ” 8    πŸ’¬ 1    πŸ“Œ 1

Dark blue = most
Gray = middle
Light blue = least?!?

Please.

03.10.2025 22:18 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Someone get ucla law on the line, cause this data viz’s color choice is hot garbo.

03.10.2025 22:17 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

If it provides any comfort/ assuages imposter syndrome, I just learned about a very smart person who wrote pedabytes of NaNs, bc of a bug in their pipeline.

25.09.2025 04:14 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Assistant Professor - Information - School of Information University of California, Berkeley is hiring. Apply now!

The UC Berkeley School of Information is hiring an assistant professor in the broad field of Information--including areas of info seeking/retrieval, digital humanities, cultural analytics, info viz, & philosophy of information (among others). Deadline Nov 1! aprecruit.berkeley.edu/JPF05014

23.09.2025 14:43 β€” πŸ‘ 76    πŸ” 74    πŸ’¬ 1    πŸ“Œ 1

Also, if you haven’t imagined being haunted in one of these buildings, you haven’t truly lived in the Midwest.

22.09.2025 22:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

They were selling these in the gift shop in the Seattle art museum.

And, my god, the restraint I showed? Applaudable. *waits for applause*

But also, these are absolutely stunning.

22.09.2025 22:08 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

β€œYou get 100 flops per minute!”
β€œFloating point operations?”
β€œWhat? Pfft. No. Fish- lopped per minute!”

20.09.2025 22:56 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

You might think tpus/GPUs are complicated, but have you tried operating a 1910 β€œfish processing machine”?

20.09.2025 22:51 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1
Post image Post image Post image Post image

The animals at the Seattle Zoo are *sleepy*

20.09.2025 21:10 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Big warning for all weebs out there. This is a code red.

17.09.2025 22:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Somewhere, my Korean and Chinese ancestors are like β€œoh yeah? You like ordering Japanese things? GET TARRIFFED, SON!”

17.09.2025 22:09 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Comparing AI Labs in the United States and China, by Jingyuan Liu

14.09.2025 00:40 β€” πŸ‘ 80    πŸ” 18    πŸ’¬ 3    πŸ“Œ 2
Qwen Qwen Chat offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.

πŸ”ΉMulti-Token Prediction β†’ turbo-charged speculative decoding

Demo: chat.qwen.ai
Blog: qwen.ai/blog?id=4074...
Model - Huggingface: huggingface.co/collections/...
Model - Kaggle: www.kaggle.com/models/qwen-...
Alibaba Cloud API: www.alibabacloud.com/help/en/mode...

11.09.2025 22:11 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

I’ve lost power in Seattle for like the third time in a month.

Which is not helping my nostalgia for Chicago’s better transit & stable power grid πŸ˜‚

Otoh, there are some pretty church buildings in the area.

11.09.2025 03:12 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Building Machine Learning Systems for a Trillion Trillion Floating Point Operations
YouTube video by Jane Street Building Machine Learning Systems for a Trillion Trillion Floating Point Operations

Super fun high-level talk on the history of ml frameworks.

18:50 is a nice overview of torch compilers
19:10: has a nice demarcation of user/execution categories
21:30 nicely describes, why GPUs?

Nicely goes between broader historical effects, software, and hardware.

youtu.be/139UPjoq7Kw?...

11.09.2025 01:08 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

I really enjoy working from cafes, and zero trust network access/ ip reputation filtering/ geofencing is going to ruin me πŸ˜‚

07.09.2025 19:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

There are some perks of my job switch…but having just visited the Seattle Riot office β€” work somewhere with a dope IP, the work spaces are so great

06.09.2025 19:24 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Praxis - Embedding Driven Text Analysis of Crease’s Stance Towards Chinese Immigrants This notebook aims to demonstrate how machine learning can assist with historical and other humanity research.

At long last, I can post my team's summer project: applied modules to teach how ML/AI tools are changing social science and humanities research: ubcecon.github.io/praxis-ubc/

Highlights:

LegalBERT to analyze anti/pro-immigrant sentiment in 19th c. BC law: ubcecon.github.io/praxis-ubc/d...
🧡1

05.09.2025 21:43 β€” πŸ‘ 53    πŸ” 21    πŸ’¬ 1    πŸ“Œ 2

My slogan to young DHers would be β€œvibe code till dawn, there’s intellectual ground for the taking!”

05.09.2025 06:58 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

+1 I just think that a field that has both coded and taught young readers/writers for years β€” thrives in an era when ml papers fumble about their β€œrephrasing” strategies and preference datasets have not been treated with some basic stylometry.

05.09.2025 06:55 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

In hindsight, what I should’ve done is written a few articles or a book, *and then* joined an outfit where I shouldn’t say much. There’s *a lot* to say here.

05.09.2025 06:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Something something, one person’s language data augmentation is another person’s distant writing is another person’s creative writing project

05.09.2025 06:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yeah, I know exactly what you mean. Given that *nervous cough* I suspect we’re thinking about very similar problems.

There’s a line to draw from before nanogenmo, back translation, synthetic data making, etc etc.

It’s both interesting and, frankly, very fun.

05.09.2025 06:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

Seattle, advertising itself to be the setting of many fog-related thrillers.

04.09.2025 16:45 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I thought I ordered Chinese food…but I might be receiving way more than I bargained for….

02.09.2025 03:51 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Just made a playlist called β€œThe Goats of Melancholy,” and I really wish I had the musical talent needed to build a band on that name.

31.08.2025 18:19 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Now look, I can understand, just because transparent does not mean weak.

But my primate brain, otoh, will not step on this.

30.08.2025 20:37 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - google-deepmind/limit: On the Theoretical Limitations of Embedding-Based Retrieval On the Theoretical Limitations of Embedding-Based Retrieval - google-deepmind/limit

"On the Theoretical Limitations of Embedding-based Retrieval"

Paper: arxiv.org/abs/2508.21038
Repo: github.com/google-deepm...

30.08.2025 02:13 β€” πŸ‘ 25    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0

πŸ’― the artists/ designers at riot games should really take pride that they were heavily influential in this

30.08.2025 01:58 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@jonathancheng is following 20 prominent accounts