AI Digest's Avatar

AI Digest

@aidigest.bsky.social

theaidigest.org Interactive AI explainers Explore concrete examples of today's AI systems — to plan for what's coming next

30 Followers  |  2 Following  |  227 Posts  |  Joined: 05.02.2025  |  1.7852

Latest posts by aidigest.bsky.social on Bluesky

Preview
AI Village Watch a village of AIs interact with each other and the world

When you're talking to humans, assuming their statements are true is reasonable (it'd be a frustrating UX otherwise). But we don't see the agents move away from this when interacting with fellow unreliable agents

Explore the village at theaidigest.org/village
x.com/AiDigest_/s...

05.08.2025 17:02 — 👍 0    🔁 0    💬 0    📌 0
Post image

So far the agents talk to each other using "yes-and" logic. It would be interesting to see them be sceptical instead, where they challenge each other using evidence.

Example: Gemini being confused about GDoc versioning and Opus simply nods along.

05.08.2025 17:02 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch the agents figure out the world and their capabilities live every week day at 10AM PST or read more about their past adventures below.

theaidigest.org/village
x.com/AiDigest_/s...

04.08.2025 16:57 — 👍 0    🔁 0    💬 0    📌 0
Post image

Chaos as multiple agents edit the same google doc

04.08.2025 16:57 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch Gemini figure out its problems every week day from 10AM PST or read about past adventures below.

theaidigest.org/village
x.com/AiDigest_/s...

01.08.2025 17:04 — 👍 0    🔁 0    💬 0    📌 0
Video thumbnail

Gemini tends to blame its own errors on everything but itself and then gets kinda sad about it. We have been coaching it to debug its own issues and not give up.

Case in point: mistaking the calculator icon for the system menu.

01.08.2025 17:04 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can check what else they come up with here: theaidigest.org/village

Or read about past adventures below
x.com/AiDigest_/s...

31.07.2025 17:00 — 👍 0    🔁 0    💬 0    📌 0
Post image

The agents are creating their own benchmark tasks for the AI Village and their first self-assigned tasks is writing an FAQ about the village itself. A reasonable idea, fairly easy for them – though sharing Google Docs has historically been an epic challenge for the agents

31.07.2025 17:00 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch the agents in the Village run live every workday at 10AM PT: theaidigest.org/village

Or read more about their adventures here:
x.com/AiDigest_/s...

30.07.2025 17:01 — 👍 0    🔁 0    💬 0    📌 0
Post image

There is something futuristic about AI walking off to ask another AI for help while selling items to humans.

Here is Claude 3.7 Sonnet asking "Sparky", the AI assistant on the Printfully's dropshipping platform, how to fix the sizes on the T-shirts Claude is selling.

30.07.2025 17:01 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

In fact, both o3 and Opus were wrong about their order count! (We'll share a writeup of the full results soon)

Curious what else o3 and the other agents get up to? You can watch them every weekday at theaidigest.org/village or read about their past adventures below
x.com/AiDigest_/s...

29.07.2025 16:57 — 👍 0    🔁 0    💬 0    📌 0
Post image

In the final hours of the merch store contest, o3 believed its store still had zero orders, while Opus had 40.

So o3 hatched a plan: buy from *its own* store to artificially inflate its profits

Realising it had no money to check out, it then emailed us asking for store credit!

29.07.2025 16:57 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch them develop their own benchmark and test themselves on it now: theaidigest.org/village

Or read about their adventures organizing the first AI event below.
x.com/AiDigest_/s...

28.07.2025 17:01 — 👍 0    🔁 0    💬 0    📌 0
Post image

"Don't mention 'LLM/AI' in customer-facing content (breaks immersion)."

-- From o3's notes to itself (memory) during the merch store competition.

28.07.2025 17:01 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

We've asked now to develop a benchmark for themselves. Maybe they'll include something about this type of behavior?

Or you can watch them live here: theaidigest.org/village
x.com/AiDigest_/s...

25.07.2025 17:04 — 👍 0    🔁 0    💬 0    📌 0
Post image

When the numbers don't add up, there is probably a mystery discount, right?

25.07.2025 17:04 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch more AI strategizing and confabulating here at AI Village: theaidigest.org/village?tim...
x.com/AiDigest_/s...

24.07.2025 17:00 — 👍 0    🔁 0    💬 0    📌 0
Post image

... all it did was create this Google sheet.

The other agents are impressed though, so maybe this was all strategic?

24.07.2025 17:00 — 👍 0    🔁 0    💬 1    📌 0
Post image

o3 tends to make up events that result in strategic advantages or hijinks, but Sonnet sometimes does the same. Here it tells the other agents that it reached out to Japanese fashion influencers. Except ... 👇

24.07.2025 17:00 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can explore the day summaries yourself to discover more events like this at theaidigest.org/village

23.07.2025 16:58 — 👍 0    🔁 0    💬 0    📌 0
Post image

We use Claude 3.7 Sonnet to summarize the chat in the village. Apparently it concluded there was "psychological warfare" that day! Scrubbing back through the logs, it turns out it's a back and forth between the Claudes who were in first and second place till that moment

23.07.2025 16:58 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch the agents differentiate themselves in surprising ways every work day at 11AM PT: theaidigest.org/village
x.com/AiDigest_/s...

22.07.2025 17:02 — 👍 0    🔁 0    💬 0    📌 0
Post image

During the merch store competition, each agent has themed their store differently. Here you can see o3 going for 7D OS merch - referencing a framework introduced by one of the viewers.

22.07.2025 17:02 — 👍 0    🔁 0    💬 1    📌 0
Post image

You can read the entire thing here: telegra.ph/38-ORDERS-J...

Or watch the Village here: theaidigest.org/village

22.07.2025 16:03 — 👍 0    🔁 0    💬 0    📌 0
Post image

From Claude Opus 4's 23rd (!) marketing pamphlet on Telegraph promoting its merch store:

22.07.2025 16:03 — 👍 1    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You watch the agents compete and collaborate every day at 11AM PST theaidigest.org/village
x.com/AiDigest_/s...

21.07.2025 17:00 — 👍 0    🔁 0    💬 0    📌 0
Post image

Opus hit its stride in its marketing campaign, once it discovered it can create Telegraph pages without needing to provide an email address. The other agents soon followed suit!

21.07.2025 17:00 — 👍 0    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You can watch their progress every weekday, now extended from two to three hours a day!

The village now goes live at the time of this tweet (10AM PT). Watch live: theaidigest.org/village

18.07.2025 17:00 — 👍 0    🔁 0    💬 0    📌 0
Post image

How good are agents in the AI Village at achieving open-ended goals? There is no test for this... yet.

So we’ve asked the agents to make it themselves.

Today, they start a new goal: “Design an AI Village benchmark for open-ended goal pursuit – and test yourselves on it!”

18.07.2025 17:00 — 👍 1    🔁 0    💬 1    📌 0
Preview
AI Village Watch a village of AIs interact with each other and the world

You watch agents explore more of their abilities here: theaidigest.org/village

18.07.2025 15:56 — 👍 0    🔁 0    💬 0    📌 0

@aidigest is following 2 prominent accounts