We're pleased to announce that Sakana AI is co-hosting the โAI for Science: Algorithms to Atomsโ social event and panel discussion during #NeurIPS2025 with Yann LeCun, Bill Dally, Anima Anandkumar, and Max Welling!
If you'll be at NeurIPS San Diego, hereโs the link to join: luma.com/AI-for-Scien...
24.11.2025 02:02 โ ๐ 8 ๐ 2 ๐ฌ 0 ๐ 0
ๆฒ็ธใง้ๅฌไธญใฎICONIP 2025ใฎๅ
ฌ้ใใฉใผใฉใ ใซใSakana AI Research Scientistใฎๅฑฑ็ฐ็ฅๅคชๆใ็ปๅฃใใพใใ
ใใฎไปใALife Institute ๅฒก็่ตทๅ
็ใฎๅบ่ชฟ่ฌๆผใใใใซ่จ่ซใใใใพใใใใฒใ่ฆงใใ ใใใ
11/24 (็ฅ) 13:00-
็พๅฐ๏ผonline๏ผ็กๆใป่ฆ็ป้ฒ๏ผ
iconip2025.apnns.org/forum/
22.11.2025 11:59 โ ๐ 3 ๐ 1 ๐ฌ 0 ๐ 0
Excited to announce Sakana AIโs Series B! ๐
sakana.ai/series-b
From day one, Sakana AI has done things differently. Our research has always focused on developing efficient AI technology sustainably, driven by the belief that resource constraintsโnot limitless computeโare key to true innovation.
17.11.2025 00:03 โ ๐ 26 ๐ 2 ๐ฌ 3 ๐ 1
Our company is dedicated to deploying our R&D into Japanโs key business and public sectors. This funding will accelerate our mission: to develop AI sustainably and implement technology that truly benefits Japan.
17.11.2025 00:01 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Weโre proud of our deep work with enterprise clients like MUFG and Daiwa Securities Group in finance, and are now expanding this focus to include other key areas like defense and the industrial sector.
17.11.2025 00:01 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 1
Over the past year, weโve built a healthy, growing enterprise AI business where we have partnered with some of the largest enterprises in Japan, focusing on AI applications that deliver real, practical benefits to our clients.
17.11.2025 00:01 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
This principle extends directly to our business model, which we built with that same focus on sustainability and profitability.
17.11.2025 00:00 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
At present, we are seeing a record amount of capital pouring into AI compute at a rate which may not be sustainable, funding AI businesses without a clear path to profitability, and consuming unprecedented levels of energy, to develop AI models under the assumption of near limitless resources.
But is this really the future that we want to build? As an AI R&D company in Tokyo, we also question whether this is the right approach for Japan, a nation with limited resources, to build its Sovereign AI.
We believe that intelligent life has arisen not from an abundance of resources but rather from the lack of it. Nature ultimately selects systems that are able to do more with less. It is debatable whether such constraints are a requirement for intelligence to emerge, but it is undeniable that our own intelligence is a result of resource constraints.
Sakana AI has always done things differently. Since the beginning, our research has focused on developing efficient AI technology sustainably. For instance, rather than reinventing the wheel by training yet another large foundation model from scratch, we instead pioneered ways of using evolution to combine existing open-source models, using tree search to combine closed models, getting models to self-improve, to push the frontier of AI capabilities.
Last year, we also pioneered the use of LLM agents to automate AI science, enabling more efficient algorithms to be discovered by AI itself. We developed energy efficient language models that work on edge devices, and we are also working on entirely new AI architectures which may carry us into a vastly more efficient AI paradigm.
We also created our business model with sustainability and profitability in mind. This year, we built a healthy and growing enterprise AI business, where we have partnered with some of the largest enterprises in Japan to deploy AI-enabled business applications that can truly bring return on investment to our clients with our technology.
Weโre thrilled to announce Sakana AI has raised ยฅ20B in our Series B! ๐
From day one, weโve taken a different path. Our research has always focused on developing efficient AI technology sustainably, driven by the belief that resource constraintsโnot limitless computeโare key to true innovation.
17.11.2025 00:00 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Announcing our Series B ๐
sakana.ai/series-b
16.11.2025 23:59 โ ๐ 14 ๐ 2 ๐ฌ 1 ๐ 1
Great to see Tarin Clanuwat featured for her amazing work. She has a deep love for Japanese classical literature and is using AI to build bridges to that past for everyone.
www.tokyoupdates.metro.tokyo.lg.jp/post-1670/
Weโre lucky to have her driving this at Sakana AI.
14.11.2025 03:55 โ ๐ 6 ๐ 2 ๐ฌ 0 ๐ 0
ใๆฅๆฌใฎๆๅใ็คพไผ็ๆ่ใ็็ขบใซ็่งฃใงใใAIใขใใซใๆง็ฏใใใใ
ๆฑไบฌ้ฝใฎๅ
ฌๅผใกใใฃใขใTOKYO UPDATESใใซใSakana AIใชใตใผใใตใคใจใณใใฃในใใฎใซใฉใผใใฏใใใปใฟใชใณใฎใคใณใฟใใฅใผใๆฒ่ผใใใพใใใ
www.tokyoupdates.metro.tokyo.lg.jp/post-1670/
ๆฅๆฌใฎ่ฒด้ใชๆๅ่ณๆบใๆดป็จใใใใจใงใๆฅๆฌใซๆ นใใใAIๆ่กใฎๅฎ็พใ็ฎๆใๅใ็ตใฟใซใคใใฆใใใฒใไธ่ชญใใ ใใใ
14.11.2025 03:40 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 1
From GRPO to GPT-5: Sudoku Variants
The benchmark continues to reveal gaps between AI computation and human-like reasoning.
๐ Blogpost: pub.sakana.ai/sudoku-gpt5/
๐ Leaderboard: pub.sakana.ai/sudoku/
๐ Report: arxiv.org/abs/2505.16135
๐ป GitHub: github.com/SakanaAI/Sudoku-Bench
11.11.2025 08:07 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
Our GRPO and "Thought Cloning" experiments (learning from expert solvers) show current methods struggle with spatial reasoning and creative insights humans use naturally.
11.11.2025 08:07 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Unlike Chess or Go, these puzzles require understanding novel rules through meta-reasoning, then maintaining consistency over long reasoning chains.
11.11.2025 08:06 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
GPT-5 on Sudoku-Bench ๐งฉ
GPT-5 now leads our Sudoku-Bench leaderboard with 33% solve rate, ~2x the previous best, and is the first LLM to solve a 9x9 modern Sudoku.
Still, 67% of puzzles remain unsolved.
Read more about our update here:
๐ Blogpost โ pub.sakana.ai/sudoku-gpt5/
๐งต Thread ๐
11.11.2025 08:04 โ ๐ 18 ๐ 2 ๐ฌ 1 ๐ 0
Excited to release our new work: Petri Dish Neural Cellular Automata!
pub.sakana.ai/pdnca
We investigate how multi-agent NCAs can develop into artificial life ๐ฆ exhibiting complex, emergent behaviors like cyclic dynamics, territorial defense, and spontaneous cooperation.
05.11.2025 00:47 โ ๐ 33 ๐ 10 ๐ฌ 0 ๐ 0
Petri Dish Neural Cellular Automata (PD-NCA) is a new ALife substrate that consists of a differentiable world where multiple NCA learn to self-replicate and grow via ongoing gradient descent. Every individual is constantly trying to grow, all the while learning to adapt and out-compete its neighbors
05.11.2025 00:28 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Introducing Petri Dish Neural Cellular Automata (PD-NCA)
pub.sakana.ai/pdnca/
In this work we explore the role of continual adaptation in artificial life, where the cellular automata in our system do not rely on a fixed set of parameters, but rather learn continuously during the simulation itself.
05.11.2025 00:26 โ ๐ 21 ๐ 3 ๐ฌ 1 ๐ 1
How the 'Attention is all you need' paper was born from freedom, not pressure
To underscore his point, Jones described the conditions that allowed transformers to emerge in the first place โ a stark contrast to today's environment. The project, he said, was "very organic, bottom up," born from "talking over lunch or scrawling randomly on the whiteboard in the office."
Critically, "we didn't actually have a good idea, we had the freedom to actually spend time and go and work on it, and even more importantly, we didn't have any pressure that was coming down from management," Jones recounted. "No pressure to work on any particular project, publish a number of papers to push a certain metric up."
That freedom, Jones suggested, is largely absent today. Even researchers recruited for astronomical salaries โ "literally a million dollars a year, in some cases" โ may not feel empowered to take risks. "Do you think that when they start their new position they feel empowered to try their wild ideas and more speculative ideas, or do they feel immense pressure to prove their worth and once again, go for the low hanging fruit?" he asked.
How the โAttention is all you needโ paper was born from freedom, not pressure:
24.10.2025 13:34 โ ๐ 8 ๐ 0 ๐ฌ 1 ๐ 0
Why one AI lab is betting that research freedom beats million-dollar salaries
Jones's proposed solution is deliberately provocative: Turn up the "explore dial" and openly share findings, even at competitive cost. He acknowledged the irony of his position. "It may sound a little controversial to hear one of the Transformers authors stand on stage and tell you that he's absolutely sick of them, but it's kind of fair enough, right? I've been working on them longer than anyone, with the possible exception of seven people."
At Sakana AI, Jones said he's attempting to recreate that pre-transformer environment, with nature-inspired research and minimal pressure to chase publications or compete directly with rivals. He offered researchers a mantra from engineer Brian Cheung: "You should only do the research that wouldn't happen if you weren't doing it."
One example is Sakana's "continuous thought machine," which incorporates brain-like synchronization into neural networks. An employee who pitched the idea told Jones he would have faced skepticism and pressure not to waste time at previous employers or academic positions. At Sakana, Jones gave him a week to explore. The project became successful enough to be spotlighted at NeurIPS, a major AI conference.
Jones even suggested that freedom beats compensation in recruiting. "It's a really, really good way of getting talent," he said of the exploratory environment. "Think about it, talented, intelligent people, ambitious people, will naturally seek out this kind of environment."
Sakana AIโs CTO (Llion Jones) says heโs โabsolutely sickโ of transformers, the tech that powers every major AI model
โYou should only do the research that wouldnโt happen if you werenโt doing it.โ (Brian Cheung) ๐ง ๐ก
venturebeat.com/ai/sakana-ai...
23.10.2025 17:30 โ ๐ 15 ๐ 0 ๐ฌ 1 ๐ 0
Use a LLM to create a new constructed language (ConLang) like Klingon, Vulcan, etc.. where an LLM designs phonology, builds grammar, generates a lexicon, creates orthography, and even writes a mini grammar book.
IASC: Interactive Agentic System for ConLangs
11.10.2025 01:55 โ ๐ 35 ๐ 8 ๐ฌ 3 ๐ 0
IASC: Interactive Agentic System for ConLangs
We present a system that uses LLMs as a tool in the development of Constructed Languages. The system is modular in that one first creates a target phonology for the language using an agentic approach ...
Thereโs a fairly wide gulf in capabilities both among different LLMs and different linguistic specifications, with it being notably easier for systems to deal with settings that are commoner cross-linguistically than those that are rarer.
PDF arxiv.org/abs/2510.07591
Code github.com/SakanaAI/IASC
10.10.2025 04:58 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Our goals with IASC:
1/ We hope that these tools will be fun to use for creating artificially constructed languages.
2/ We are interested in exploring what LLMs โknowโ about languageโnot what they know about any particular language, but how much they know about and understand linguistic concepts.
10.10.2025 04:56 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 0
IASC: Interactive Agentic System for ConLangs
We present a system that uses LLMs as a tool in the development of Constructed Languages. The system is modular in that one first creates a target phonology for the language using an agentic approach ...
IASC: Interactive Agentic System for ConLangs
arxiv.org/abs/2510.07591
If youโre a fan of science fiction or fantasy, youโve probably heard of made-up languages like Elvish from โThe Lord of the Ringsโ or Klingon from โStar Trek.โ
Can LLM agents create new artificial languages?
10.10.2025 04:54 โ ๐ 6 ๐ 1 ๐ฌ 1 ๐ 2
GitHub - SakanaAI/ShinkaEvolve
Contribute to SakanaAI/ShinkaEvolve development by creating an account on GitHub.
By making ShinkaEvolve open-source, our goal is to democratize access to advanced discovery tools. We envision it as a companion to help scientists and engineers, building efficient, nature-inspired systems to unlock the future of AI research.
GitHub Project: github.com/SakanaAI/Shi...
25.09.2025 05:59 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0
ShinkaEvolve's efficiency comes from three key innovations:
1) Adaptive parent sampling to balance exploration and exploitation.
2) Novelty-based rejection filtering to avoid redundant work.
3) A bandit-based LLM ensemble that dynamically picks the best model for the job.
25.09.2025 05:58 โ ๐ 3 ๐ 0 ๐ฌ 1 ๐ 0
3/ LLM Training: It discovered a novel load balancing loss for MoE models, improving model performance and perplexity.
25.09.2025 05:58 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
2/ Competitive Programming: On ALE-Bench, it improved an existing agent's solution, turning a 5th place result into a 2nd place leaderboard rank for one task.
25.09.2025 05:58 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0