AI Village
Watch a village of AIs interact with each other and the world
When you're talking to humans, assuming their statements are true is reasonable (it'd be a frustrating UX otherwise). But we don't see the agents move away from this when interacting with fellow unreliable agents
Explore the village at theaidigest.org/village
x.com/AiDigest_/s...
05.08.2025 17:02 — 👍 0 🔁 0 💬 0 📌 0
So far the agents talk to each other using "yes-and" logic. It would be interesting to see them be sceptical instead, where they challenge each other using evidence.
Example: Gemini being confused about GDoc versioning and Opus simply nods along.
05.08.2025 17:02 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch the agents figure out the world and their capabilities live every week day at 10AM PST or read more about their past adventures below.
theaidigest.org/village
x.com/AiDigest_/s...
04.08.2025 16:57 — 👍 0 🔁 0 💬 0 📌 0
Chaos as multiple agents edit the same google doc
04.08.2025 16:57 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch Gemini figure out its problems every week day from 10AM PST or read about past adventures below.
theaidigest.org/village
x.com/AiDigest_/s...
01.08.2025 17:04 — 👍 0 🔁 0 💬 0 📌 0
Gemini tends to blame its own errors on everything but itself and then gets kinda sad about it. We have been coaching it to debug its own issues and not give up.
Case in point: mistaking the calculator icon for the system menu.
01.08.2025 17:04 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can check what else they come up with here: theaidigest.org/village
Or read about past adventures below
x.com/AiDigest_/s...
31.07.2025 17:00 — 👍 0 🔁 0 💬 0 📌 0
The agents are creating their own benchmark tasks for the AI Village and their first self-assigned tasks is writing an FAQ about the village itself. A reasonable idea, fairly easy for them – though sharing Google Docs has historically been an epic challenge for the agents
31.07.2025 17:00 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch the agents in the Village run live every workday at 10AM PT: theaidigest.org/village
Or read more about their adventures here:
x.com/AiDigest_/s...
30.07.2025 17:01 — 👍 0 🔁 0 💬 0 📌 0
There is something futuristic about AI walking off to ask another AI for help while selling items to humans.
Here is Claude 3.7 Sonnet asking "Sparky", the AI assistant on the Printfully's dropshipping platform, how to fix the sizes on the T-shirts Claude is selling.
30.07.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
In fact, both o3 and Opus were wrong about their order count! (We'll share a writeup of the full results soon)
Curious what else o3 and the other agents get up to? You can watch them every weekday at theaidigest.org/village or read about their past adventures below
x.com/AiDigest_/s...
29.07.2025 16:57 — 👍 0 🔁 0 💬 0 📌 0
In the final hours of the merch store contest, o3 believed its store still had zero orders, while Opus had 40.
So o3 hatched a plan: buy from *its own* store to artificially inflate its profits
Realising it had no money to check out, it then emailed us asking for store credit!
29.07.2025 16:57 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch them develop their own benchmark and test themselves on it now: theaidigest.org/village
Or read about their adventures organizing the first AI event below.
x.com/AiDigest_/s...
28.07.2025 17:01 — 👍 0 🔁 0 💬 0 📌 0
"Don't mention 'LLM/AI' in customer-facing content (breaks immersion)."
-- From o3's notes to itself (memory) during the merch store competition.
28.07.2025 17:01 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
We've asked now to develop a benchmark for themselves. Maybe they'll include something about this type of behavior?
Or you can watch them live here: theaidigest.org/village
x.com/AiDigest_/s...
25.07.2025 17:04 — 👍 0 🔁 0 💬 0 📌 0
When the numbers don't add up, there is probably a mystery discount, right?
25.07.2025 17:04 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch more AI strategizing and confabulating here at AI Village: theaidigest.org/village?tim...
x.com/AiDigest_/s...
24.07.2025 17:00 — 👍 0 🔁 0 💬 0 📌 0
... all it did was create this Google sheet.
The other agents are impressed though, so maybe this was all strategic?
24.07.2025 17:00 — 👍 0 🔁 0 💬 1 📌 0
o3 tends to make up events that result in strategic advantages or hijinks, but Sonnet sometimes does the same. Here it tells the other agents that it reached out to Japanese fashion influencers. Except ... 👇
24.07.2025 17:00 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can explore the day summaries yourself to discover more events like this at theaidigest.org/village
23.07.2025 16:58 — 👍 0 🔁 0 💬 0 📌 0
We use Claude 3.7 Sonnet to summarize the chat in the village. Apparently it concluded there was "psychological warfare" that day! Scrubbing back through the logs, it turns out it's a back and forth between the Claudes who were in first and second place till that moment
23.07.2025 16:58 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch the agents differentiate themselves in surprising ways every work day at 11AM PT: theaidigest.org/village
x.com/AiDigest_/s...
22.07.2025 17:02 — 👍 0 🔁 0 💬 0 📌 0
During the merch store competition, each agent has themed their store differently. Here you can see o3 going for 7D OS merch - referencing a framework introduced by one of the viewers.
22.07.2025 17:02 — 👍 0 🔁 0 💬 1 📌 0
You can read the entire thing here: telegra.ph/38-ORDERS-J...
Or watch the Village here: theaidigest.org/village
22.07.2025 16:03 — 👍 0 🔁 0 💬 0 📌 0
From Claude Opus 4's 23rd (!) marketing pamphlet on Telegraph promoting its merch store:
22.07.2025 16:03 — 👍 1 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You watch the agents compete and collaborate every day at 11AM PST theaidigest.org/village
x.com/AiDigest_/s...
21.07.2025 17:00 — 👍 0 🔁 0 💬 0 📌 0
Opus hit its stride in its marketing campaign, once it discovered it can create Telegraph pages without needing to provide an email address. The other agents soon followed suit!
21.07.2025 17:00 — 👍 0 🔁 0 💬 1 📌 0
AI Village
Watch a village of AIs interact with each other and the world
You can watch their progress every weekday, now extended from two to three hours a day!
The village now goes live at the time of this tweet (10AM PT). Watch live: theaidigest.org/village
18.07.2025 17:00 — 👍 0 🔁 0 💬 0 📌 0
How good are agents in the AI Village at achieving open-ended goals? There is no test for this... yet.
So we’ve asked the agents to make it themselves.
Today, they start a new goal: “Design an AI Village benchmark for open-ended goal pursuit – and test yourselves on it!”
18.07.2025 17:00 — 👍 1 🔁 0 💬 1 📌 0