hardmaru's Avatar

hardmaru

@hardmaru.bsky.social

Co-Founder & CEO, Sakana AI ๐ŸŽ โ†’ @sakanaai.bsky.social https://sakana.ai/careers

4,247 Followers  |  655 Following  |  153 Posts  |  Joined: 17.11.2024
Posts Following

Posts by hardmaru (@hardmaru.bsky.social)

Instead of forcing models to hold everything in an active context window, we can use hypernetworks to instantly compile documents and tasks directly into the model's weights. A step towards giving language models durable memory and fast adaptation.

Blog: pub.sakana.ai/doc-to-lora/

27.02.2026 04:36 โ€” ๐Ÿ‘ 104    ๐Ÿ” 14    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 4
Preview
How competition is stifling AI breakthroughs Llion Jones cowrote "Attention Is All You Need," the seminal paper that introduced the transformer โ€” the architecture that launched the generative AI revolution. Now he warns that the industry that gr...

ใ€ŒHow Competition is Stifling AI Breakthroughsใ€

Sakana AIๅ…ฑๅŒๅ‰ตๆฅญ่€… Llion Jones ใฎTED AIใƒˆใƒผใ‚ฏใŒๅ…ฌ้–‹ใ•ใ‚Œใพใ—ใŸใ€‚็›ฎๆจ™ใ‚’ๅฎšใ‚ใ™ใŽใชใ„ใ‚ชใƒผใƒ—ใƒณใ‚จใƒณใƒ‰ใช็ ”็ฉถใŒใƒ–ใƒฌใƒผใ‚ฏใ‚นใƒซใƒผใ‚’็”Ÿใ‚€็†็”ฑใ€TransformerใฎๆˆๅŠŸใŒๆฅญ็•Œใซใ‚‚ใŸใ‚‰ใ—ใŸ็Šถๆณใ€ใใ‚Œใ‚’ไน—ใ‚Š่ถŠใˆใ‚‹ๆฌกใฎๆง‹ๆƒณใจๆˆๆžœใ‚’่ชžใ‚Šใพใ—ใŸใ€‚

www.ted.com/talks/llion_...

29.01.2026 02:39 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Our journey at Sakana AI is just getting started.

We are looking for people to help us pioneer the next generation of AIโ€”building from Japan to the world.

Join us: sakana.ai/careers

25.01.2026 04:56 โ€” ๐Ÿ‘ 29    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I founded Sakana AI after my time at Google, so it is incredibly meaningful to be able to partner with them now. It feels like a special connection to be working together again to advance the AI ecosystem in Japan.

sakana.ai/google#en

23.01.2026 13:04 โ€” ๐Ÿ‘ 48    ๐Ÿ” 2    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Preview
Sakana AI Sakana AIใ€Googleใจใฎๆˆฆ็•ฅ็š„ใƒ‘ใƒผใƒˆใƒŠใƒผใ‚ทใƒƒใƒ—็ท ็ตใ‚’็™บ่กจ

Our work on The AI Scientist and ALE-Agent has already shown the power of these models.

Now, we are scaling reliable AI in mission-critical sectors like finance and government to ensure the highest security and data sovereignty.

Full details: sakana.ai/google#en

23.01.2026 13:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

We are thrilled to announce a strategic partnership with Google!

Google is also making a financial investment in Sakana AI to strengthen this collaboration. We are combining Googleโ€™s world-class products like Gemini and Gemma with our agile R&D to accelerate automated scientific discovery.

23.01.2026 13:02 โ€” ๐Ÿ‘ 17    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
An Unofficial Guide to Prepare for a Research Position Application

Authors: Stefania Druga, Luke Darlow, and Llion Jones

Disclaimer: This guide is written by a few researchers at Sakana AI who have interviewed many candidates, and does not reflect the view of the entire organization. Each team may have their own preferences and styles for interviewing and finding the people that they can work closely with. This document, written by Stefania, Luke, and Llion, provides a glimpse into how some parts of our research org conduct interviews.

An Unofficial Guide to Prepare for a Research Position Application Authors: Stefania Druga, Luke Darlow, and Llion Jones Disclaimer: This guide is written by a few researchers at Sakana AI who have interviewed many candidates, and does not reflect the view of the entire organization. Each team may have their own preferences and styles for interviewing and finding the people that they can work closely with. This document, written by Stefania, Luke, and Llion, provides a glimpse into how some parts of our research org conduct interviews.

We just published an unofficial guide on what we look for when interviewing research candidates at Sakana AI.

Written by Stefania Druga, Luke Darlow, and Llion Jones.

The biggest differentiator? Understanding over implementation.

Read it: pub.sakana.ai/Unofficial_G...

20.01.2026 01:40 โ€” ๐Ÿ‘ 23    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
RePo: Language Models with Context Re-Positioning In-context learning is fundamental to modern Large Language Models (LLMs); however, prevailing architectures impose a rigid and fixed contextual structure by assigning linear or constant positional in...

RePo moves us toward models that intelligently curate their own working memory rather than passively accepting input order.

Read the full breakdown on our website:
pub.sakana.ai/repo/

Paper: arxiv.org/abs/2512.14391

19.01.2026 00:40 โ€” ๐Ÿ‘ 18    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Introducing RePo: Language Models with Context Re-Positioning

Standard LLMs force a rigid linear structure on context, treating physical proximity as relevance. Cognitive Load Theory suggests this is inefficientโ€”models waste capacity managing noise instead of reasoning.

arxiv.org/abs/2512.14391

19.01.2026 00:39 โ€” ๐Ÿ‘ 56    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 4
Post image

2026 is just getting started ๐Ÿš€โœจ

We are hiring. Join our team in Tokyo!

sakana.ai/careers

14.01.2026 13:23 โ€” ๐Ÿ‘ 13    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
ใ‚ตใ‚ซใƒŠAIใฎใƒ‡ใƒ“ใƒƒใƒ‰ใƒปใƒCEOใ€AIๅฐŽๅ…ฅใฏใ€Œ้›‡็”จไธๅฎ‰ๅฐใ•ใ„ๆญฃ็คพๅ“กๅˆถๅบฆใŒๅผทใฟใซใ€ AIใƒ•ใ‚กใƒผใ‚นใƒˆใ‚’ๆŽฒใ’ใ‚‹ไผๆฅญใŒๅข—ๅŠ ใ™ใ‚‹ไธญใ€็ตŒๅ–ถ่€…ใฏAIใฎใƒชใ‚นใ‚ฏใ‚’ๆญฃใ—ใ็†่งฃใ—ใ€้ฉๅˆ‡ใซๅฐŽๅ…ฅใ‚’้€ฒใ‚ใ‚‹ๅฟ…่ฆใŒใ‚ใ‚‹ใ€‚ๅ›ฝๅ†…ๆœ€ๅคง็ดšใฎใƒฆใƒ‹ใ‚ณใƒผใƒณใงใ€ไผๆฅญๅ‘ใ‘ใฎAIใ‚ฝใƒชใƒฅใƒผใ‚ทใƒงใƒณ้–‹็™บใ‚’่กŒใ†Sakana AI๏ผˆใ‚ตใ‚ซใƒŠAIใ€ๆฑไบฌใƒปๆธฏ๏ผ‰ใฎใƒ‡ใƒ“ใƒƒใƒ‰ใƒปใƒๆœ€้ซ˜็ตŒๅ–ถ่ฒฌไปป่€…๏ผˆCEO๏ผ‰ใซใ€ๆ—ฅๆœฌไผๆฅญใฎAIๅฐŽๅ…ฅใซใŠใ‘ใ‚‹่ชฒ้กŒใ‚’่žใ„ใŸใ€‚

AIๅฐŽๅ…ฅใฏใ€Œ้›‡็”จไธๅฎ‰ๅฐใ•ใ„ๆญฃ็คพๅ“กๅˆถๅบฆใŒๅผทใฟใซใ€

ๆ—ฅ็ตŒใƒ“ใ‚ธใƒใ‚นใซใฆใ€Sakana AI CEO @hardmaru.bsky.social
ใฎใ‚คใƒณใ‚ฟใƒ“ใƒฅใƒผใŒๅ…ฌ้–‹ใ•ใ‚Œใพใ—ใŸใ€‚ไผๆฅญใธใฎAIๅฎŸ่ฃ…ใŒๆœฌๆ ผๅŒ–ใ™ใ‚‹2026ๅนดใซใŠใ‘ใ‚‹็พ็Šถใจ่ชฒ้กŒใ€ใใ—ใฆๆ—ฅๆœฌไผๆฅญใฎ็ต„็น”ๆ–‡ๅŒ–ใŒAIๅฐŽๅ…ฅใซใจใฃใฆใƒใ‚ธใƒ†ใ‚ฃใƒ–ใซๅƒใๅฏ่ƒฝๆ€งใซใคใ„ใฆ่ชžใ‚Šใพใ—ใŸใ€‚

business.nikkei.com/atcl/gen/19/...

ใ€่จ˜ไบ‹ใฎใƒใ‚คใƒฉใ‚คใƒˆใ€‘๐Ÿงต

13.01.2026 08:46 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Reminded me of my older NeurIPS 2021 paper, where we removed the positional encoding entirely, and by doing so, an agent can process an arbitrarily long list of noisy, sensory inputs, in an arbitrary order.

I even made a fun browser demo to play with the agent back then: attentionneuron.github.io

12.01.2026 05:51 โ€” ๐Ÿ‘ 29    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Introducing DroPE: Extending Context by Dropping Positional Embeddings

We found embeddings like RoPE aid training but bottleneck long-sequence generalization. Our solutionโ€™s simple: treat them as a temporary training scaffold, not a permanent necessity.

arxiv.org/abs/2512.12167
pub.sakana.ai/DroPE

12.01.2026 04:07 โ€” ๐Ÿ‘ 118    ๐Ÿ” 22    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 7

One of my favorite findings: Positional embeddings are just training wheels. They help convergence but hurt long-context generalization.

We found that if you simply delete them after pretraining and recalibrate for <1% of the original budget, you unlock massive context windows. Smarter, not harder.

12.01.2026 04:12 โ€” ๐Ÿ‘ 220    ๐Ÿ” 32    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 1
Post image

We are taking our technology far beyond competitive programming to unlock a new era of AI-driven discovery.

We are hiring. Join our team in Tokyo.

sakana.ai/careers/#sof...

10.01.2026 03:07 โ€” ๐Ÿ‘ 9    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Weโ€™re hiring.

sakana.ai/careers/#sof...

10.01.2026 03:08 โ€” ๐Ÿ‘ 28    ๐Ÿ” 7    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

When agents compete for limited resources, intelligence reorganizes around survival, not elegance.

09.01.2026 09:26 โ€” ๐Ÿ‘ 19    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Survival of the fittest code!

Our paper explores LLMs driving an evolutionary arms race in Core War, where assembly programs fight each other. We task LLMs with evolving "Warriors" in a virtual machine, producing chaotic, self-modifying code dynamics.

Blog: sakana.ai/drq
Paper: pub.sakana.ai/drq/

08.01.2026 17:11 โ€” ๐Ÿ‘ 41    ๐Ÿ” 8    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2
Video thumbnail

Introducing Digital Red Queen (DRQ): Adversarial Program Evolution in Core War with LLMs.

In this work, we explore how LLMs can drive open-ended adversarial evolution of programs within the Core War environment.

Blog sakana.ai/drq
Website pub.sakana.ai/drq/
ArXiv arxiv.org/abs/2601.03335

Thread:

08.01.2026 17:00 โ€” ๐Ÿ‘ 32    ๐Ÿ” 7    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

So proud of Team Sakana AI for pulling this off! We managed to get an agent to rank #1 in a difficult heuristic optimization contest. We leaned heavily into test-time inference using a mix of frontier models.

The agent spent $1,300 to autonomously discover an algorithm that beat the human baseline.

05.01.2026 16:01 โ€” ๐Ÿ‘ 36    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Post image

Our AI agent has achieved 1st place in a competitive optimization programming contest against over 800 human participants.

Blog: sakana.ai/ahc058

Thread:

05.01.2026 15:53 โ€” ๐Ÿ‘ 20    ๐Ÿ” 2    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

Happy New Year! โ›ฉ๏ธ

01.01.2026 03:47 โ€” ๐Ÿ‘ 12    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Sakana AIโ€™s office looks like this.

28.12.2025 01:43 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Especially in such times, hackers and tinkerers tend to fare better at harnessing evolving technology with a high level of uncertainty and ambiguity, compared to traditional well-read professional types.

27.12.2025 01:09 โ€” ๐Ÿ‘ 16    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Post image

Software Engineering as a profession will continue to fundamentally change in 2026.

Humans will need to learn to co-adapt to this evolving โ€œalien technologyโ€ which comes with no real manual, and figure out how to operate it.

What a time to be alive โœจ

twitter.com/karpathy/sta...

27.12.2025 01:09 โ€” ๐Ÿ‘ 36    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Merry Christmas! ๐ŸŽ„

Sakanaโ€…AIใงใฏใ€ไบ‹ๆฅญ้–‹็™บใซ้–ขๅฟƒใŒใ‚ใ‚‹ๆ–นๅ‘ใ‘ใฎใ€Œใ‚ซใ‚ธใƒฅใ‚ขใƒซ้ข่ซ‡็ช“ๅฃใ€ใ‚’ใ‚ชใƒผใƒ—ใƒณใ—ใพใ—ใŸ๏ผ้‡‘่žใƒป้˜ฒ่ก›ใƒปใ‚คใƒณใƒ†ใƒชใ‚ธใ‚งใƒณใ‚น้ ˜ๅŸŸใงใ€็งใŸใกใŒใฉใฎใ‚ˆใ†ใช้–‹็™บใซๆŒ‘ใ‚“ใงใ„ใ‚‹ใฎใ‹ใ€‚ไธญใฎไบบใŒ็›ดๆŽฅใŠ่ฉฑใ—ใ—ใพใ™ใ€‚

ๅ‹Ÿ้›†่ท็จฎ๏ผšโ€…ใ‚จใƒณใ‚ธใƒ‹ใ‚ขใ€Projectโ€…Managerใ€Productโ€…Managerโ€…ๅ†…ๅฎน๏ผšโ€…ไบ‹ๆฅญๆˆฆ็•ฅใ€้–‹็™บใฎ่ฃๅดใ€ใƒใƒผใƒ ใฎ้›ฐๅ›ฒๆฐ—ใชใฉ

ๆœ€ๅ…ˆ็ซฏใฎAI้–‹็™บใ‚’็คพไผšๅฎŸ่ฃ…ใ™ใ‚‹ใƒ—ใƒญใ‚ปใ‚นใซ่ˆˆๅ‘ณใŒใ‚ใ‚‹ๆ–นใ€ใœใฒใŠๆฐ—่ปฝใซใ”ๅฟœๅ‹Ÿใใ ใ•ใ„๏ผ

๐Ÿ‘‰ ๅฟœๅ‹Ÿใƒ•ใ‚ฉใƒผใƒ ใฏใ“ใกใ‚‰: forms.gle/sW5wz23SLSvN...

๐Ÿ‘‰ ๅ‹Ÿ้›†่ฆ้ …: sakana.ai/careers/

25.12.2025 07:47 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
โ€œBut perhaps this can be resolved by the realization that while cleverness and intelligence are somewhat correlated traits for humans, they are much more decoupled for AI tools (which are often optimized for cleverness), and viewing the current generation of such tools primarily as a stochastic generator of sometimes clever - and often useful - thoughts and outputs may be a more productive perspective when trying to use them to solve difficult problems.โ€

โ€œBut perhaps this can be resolved by the realization that while cleverness and intelligence are somewhat correlated traits for humans, they are much more decoupled for AI tools (which are often optimized for cleverness), and viewing the current generation of such tools primarily as a stochastic generator of sometimes clever - and often useful - thoughts and outputs may be a more productive perspective when trying to use them to solve difficult problems.โ€

I doubt that anything resembling genuine AGI is within reach of current AI toolsโ€”Terence Tao

mathstodon.xyz/@tao/1157223...

22.12.2025 07:44 โ€” ๐Ÿ‘ 91    ๐Ÿ” 12    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 4
Preview
Robot Vacuum Roomba Maker Files for Bankruptcy After 35 Years iRobot Corp., the company that revolutionized robot vacuum cleaners in the early 2000s with its Roomba model, filed for bankruptcy and proposed handing over control to its main Chinese supplier.

โ€œiRobot Corp., the company that revolutionized robot vacuum cleaners in the early 2000s with its Roomba model, filed for bankruptcy and proposed handing over control to its main Chinese supplier.โ€ ๐Ÿ˜ฅ
www.bloomberg.com/news/article...

15.12.2025 12:26 โ€” ๐Ÿ‘ 17    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 5
Why AGI Will Not Happen โ€” Tim Dettmers If you are reading this, you probably have strong opinions about AGI, superintelligence, and the future of AI. Maybe you believe we are on the cusp of a transformative breakthrough. Maybe you are skep...

โ€œThe US follows the idea that there will be one winner who takes it all. Even coming short of AGI, if you have the best model, almost all people will use your model and not the competitionโ€™s model. The idea is: develop the Biggest, Baddest model and people will come.โ€
timdettmers.com/2025/12/10/w...

14.12.2025 23:20 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
ใ€Œไบบใฎๆˆ้•ทใ‚’ๅŠ ้€Ÿใ•ใ›ใ‚‹ใƒ‘ใƒผใƒˆใƒŠใƒผใ€ใใ‚Œใ“ใใŒ็”ŸๆˆAIใซๆฑ‚ใ‚ใŸใ„ไพกๅ€ค๏ฝœTHE NEXT X ๅค‰้ฉใฎๆ‰‰ - ๆ—ฅ็ตŒใƒ“ใ‚ธใƒใ‚น้›ปๅญ็‰ˆSpecial ็”ŸๆˆAI ๏ผˆไบบๅทฅ็Ÿฅ่ƒฝ๏ผ‰ใ‚’ๆดป็”จใ—ใ€ๆฅญๅ‹™ ๅŠน็އๅŒ–ใฎๅŸŸใ‚’่ถ…ใˆใ‚‹ใ‚ˆใ†ใชๆˆๆžœใ‚’ไธŠใ’ใŸไพ‹ใฏๆฑบใ—ใฆๅคšใใชใ„ใ€‚ใฉใฎใ‚ˆใ†ใซๆดป็”จใ™ใ‚Œใฐใ€ๆ–ฐใŸใชไพกๅ€คใ‚’็”Ÿใฟๅ‡บใ™ใ‚ˆใ†ใชๆˆๆžœใ‚’ๅพ—ใ‚‰ใ‚Œใ‚‹ใฎใ‹โ€•โ€•ใ€‚

ๅ…จๆ–‡ใฏใ“ใกใ‚‰๏ผš
special.nikkeibp.co.jp/atclh/ONB/25...

14.12.2025 23:19 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0