William Isaac's Avatar

William Isaac

@williamis.bsky.social

Research Scientist @DeepMind | Previously @OSFellows & @hrdag. RT != endorsements. Opinions Mine. Pronouns: he/him

3,047 Followers  |  126 Following  |  49 Posts  |  Joined: 17.06.2023  |  2.072

Latest posts by williamis.bsky.social on Bluesky

Post image

A ton of attention over the years goes to plots comparing open to closed models.
The real trend that matters for AI impacts on society is the gap between closed frontier models and local consumer models.
Local models passing major milestones will have major repercussions.
buff.ly/ccMJydQ

04.10.2025 18:40 β€” πŸ‘ 58    πŸ” 8    πŸ’¬ 1    πŸ“Œ 1
Post image Post image Post image Post image

Claude, "We all know among Sauron's many evils was that he ran Mordor using an Excel spreadsheet with multiple tabs. Show me the spreadsheet"

It made 12 tabs "so bureaucratically complex that even the Eye of Sauron would need reading glasses to review it." Some very funny stuff. Creative, even.

15.09.2025 03:48 β€” πŸ‘ 162    πŸ” 28    πŸ’¬ 10    πŸ“Œ 8
Preview
Gemini Live is giving me the confidence to speak my second language The anxiety I experience when trying to speak my second language disappears when I use Gemini Live.

Gemini Live is giving me the confidence to speak my second language

The anxiety I experience when trying to speak my second language disappears when I use Gemini Live.

#ai #gemini #geminiai

14.09.2025 13:55 β€” πŸ‘ 7    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Post image

This is a cool paper that suggests that AI agents can indeed be used for social science experiments, but that just using a chatbot isn't good enough, instead prompts developed based on social & game theory makes AI agent actions predictive of real human outcomes. benjaminmanning.io/files/optimi...

04.09.2025 02:13 β€” πŸ‘ 73    πŸ” 9    πŸ’¬ 2    πŸ“Œ 6

Duolingo’s recent earnings shows the gap between online vocal AI critics and businesses real-world value.

Despite the backlash, Duolingo’s revenue and daily users jumped 40%.

The AI hype is wild, but so is the value people are getting. This isn’t blockchain, it’s more like mobile or the internet

08.08.2025 13:35 β€” πŸ‘ 42    πŸ” 6    πŸ’¬ 9    πŸ“Œ 2

This makes my day!!

06.08.2025 17:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

"In collaboration with the @archive.org TV News Archive we examined 14,749 evening news broadcasts spanning the big three networks from July 2010 to present, asking Gemini 2.5 Flash Thinking to make an index of the stories covered in each broadcast, geography, sentiment, frame, narrative structure.

31.07.2025 23:43 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Search Jobs | Microsoft Careers

We may have the chance to hire an outstanding researcher 3+ years post PhD to join Tarleton Gillespie, Mary Gray and me in Cambridge MA bringing critical sociotechnical perspectives to bear on new technologies.

jobs.careers.microsoft.com/global/en/jo...

28.07.2025 17:26 β€” πŸ‘ 89    πŸ” 49    πŸ’¬ 0    πŸ“Œ 3

🀯

25.07.2025 04:22 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad Our advanced model officially achieved a gold-medal level performance on problems from the International Mathematical Olympiad (IMO), the world’s most prestigious competition for young...

πŸ₯‡ I am so happy to see the Google DeepMind Gemini team **OFFICIALLY** achieved gold-medal performance.

Not only did we get 5 out of 6 questions correct. What's more amazing is the official IMO graders find those answers to be clear, precise and easy to follow! 1/

deepmind.google/discover/blo...

21.07.2025 22:24 β€” πŸ‘ 9    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

"[video game] as a community theater production" may be one of the most delightful Veo 3 Fast prompts

Please enjoy, in order: GTA, Pokemon, Mario Kart, The Witcher 3, Stardew Valley, Tetris, Mortal Kombat, The Sims, & Death Stranding.

The whole prompt was the one above. And the glitches add to it

19.07.2025 03:08 β€” πŸ‘ 514    πŸ” 69    πŸ’¬ 23    πŸ“Œ 26
ICML Poster Position: Evaluating Generative AI Systems Is a Social Science Measurement ChallengeICML 2025

If you're at @icmlconf.bsky.social this week, come check out our poster on "Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge" presented by the amazing @afedercooper.bsky.social from 11:30am--1:30pm PDT on Weds!!! icml.cc/virtual/2025...

15.07.2025 18:35 β€” πŸ‘ 32    πŸ” 10    πŸ’¬ 1    πŸ“Œ 2

The #FAccT and #ICML tags are good places to find some brilliant folks

13.07.2025 17:31 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Large AI models are cultural and social technologies – Henry Farrell

The best article ive read so far on social impacts: henryfarrell.net/large-ai-mod...

13.07.2025 23:36 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Did a starter pack a while back too of scholars and practitioners who cover this angle in case that’s useful go.bsky.app/5sFqVNS

14.07.2025 07:18 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
The AI Researcher's Guide to a Non-Boring Bluesky Feed | Naomi Saphra How to migrate to bsky without a boring feed.

If it's helpful, I followed the recommendations in this post from @nsaphra.bsky.social and as a result have a much more interesting and ML-focused feed: nsaphra.net/post/bsky/

14.07.2025 08:20 β€” πŸ‘ 10    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

These are the ML feeds I've pinned that now get mixed into my main feed:

- ML Feed: Trending bsky.app/profile/smcg...
- MLSky: bsky.app/profile/alex...
- GenAI Bluesky Network: bsky.app/profile/hell...
- Nuanced AI Commentary: bsky.app/profile/dame...

14.07.2025 08:20 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Nice! I'll have to check it out

13.07.2025 10:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I can recommend following: @simonwillison.net, @metr.org, @abeba.bsky.social, @mmitchell.bsky.social , @wang.social, @sashamtl.bsky.social, @justinhendrix.bsky.social, @bvlsingler.bsky.social

Some of the people I follow to keep up with that side of things. :)

13.07.2025 09:44 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Great list of folks/orgs!

13.07.2025 09:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I would love to see more discussion on ML/AI on bsky. Particularly more nuanced discussions of the societal impacts. If you tag in the relevant post, I'll be sure to repost it!

13.07.2025 09:17 β€” πŸ‘ 48    πŸ” 5    πŸ’¬ 6    πŸ“Œ 0
Preview
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling Despite incredible progress in language models (LMs) in recent years, largely resulting from moving away from specialized models designed for specific tasks to general models based on powerful archite...

Still not a lot of ML talk on bsky (at least in my feed), hence paper Sunday: my two most interesting recent reads
- H Nets arxiv.org/abs/2507.07955
- Energy Based Transformers arxiv.org/abs/2507.02092

13.07.2025 06:14 β€” πŸ‘ 69    πŸ” 15    πŸ’¬ 5    πŸ“Œ 0
Preview
From a popular CEO to viral seafood boils: The story behind Red Lobster’s biz comeback New offerings and marketing initiatives spark the β€˜polish casual’ restaurant’s re-emergence after years of sinking fortunes.

I’ve loved everything about the Red Lobster turnaround:

β€’ A young Nigerian immigrant CEO who previously ran PF Chang (which I love)
β€’ Listened to customers and made swift changes
β€’ Stars in ads, bringing real authenticity to the β€œwe’re fixing it” vibe

10.07.2025 18:28 β€” πŸ‘ 63    πŸ” 8    πŸ’¬ 4    πŸ“Œ 1
Preview
35 Years for Privacy, Free Speech, and a Brighter Future Through July 10, new monthly or annual Sustaining Donors get an EFF35 Challenge Coin! With your help, EFF is here to stay.

Reasons you should donate to EFF this month:
1. We're turning 35! Wish us a happy birthday πŸ₯³
2. We've got cool new swag--our gift to you πŸ‘•
3. We're advocating for your rights to privacy and free speech online πŸ‘©β€πŸ’»

03.07.2025 19:00 β€” πŸ‘ 68    πŸ” 13    πŸ’¬ 1    πŸ“Œ 0

Anyone at #facct2025 want to go to the acropolis this afternoon?

26.06.2025 08:20 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Claude Code, meet your rival: Gemini CLI πŸ”₯πŸš€

Google just unleashed Gemini 2.5 Pro at the command lineβ€”free, open-source (Apache 2.0) , 1-million-token brain baked right into your terminal. Chat, code, search, scriptβ€”all with zero tab fatigue.

26.06.2025 01:20 β€” πŸ‘ 24    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

The real Bear Grylls

26.06.2025 01:44 β€” πŸ‘ 157    πŸ” 17    πŸ’¬ 6    πŸ“Œ 2
Preview
Using AI Right Now: A Quick Guide Which AIs to use, and how to use them

The two most common questions I get asked about AI are β€œwhich AI should I use” and β€œhow do I start using AI?”

I wrote a short guide attempting to answer both questions. www.oneusefulthing.org/p/using-ai-r...

23.06.2025 16:27 β€” πŸ‘ 126    πŸ” 25    πŸ’¬ 9    πŸ“Œ 3

Current Polymarket probabilities:
-Khamenei out as Supreme Leader of Iran in 2025? 63%
-Will the Iranian regime fall in 2025? 29%
-Iranian coup attempt before July? 12%
-Iran Nuke in 2025? 17%
-Will Iran close the Strait of Hormuz in 2025? 52%
-US-Iran nuclear deal in 2025? 36%

22.06.2025 02:16 β€” πŸ‘ 28    πŸ” 4    πŸ’¬ 8    πŸ“Œ 3

Interesting piece. It's nice to see more political scientists explore this subject, but would argue that quite a few scholars in the field have written on the topic (even pre-gen AI). Sadly, their work has been embraced by industry and computer scientists rather than other peers in the field.

20.06.2025 09:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@williamis is following 20 prominent accounts