Sakana AI

company

Verified

https://sakana.ai/

AI & ML interests

We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence.

Recent Activity

tksii authored a paper 1 day ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

speed submitted a paper 3 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

speed authored a paper 4 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

View all activity

Papers

Fast-weight Product Key Memory

RePo: Language Models with Context Re-Positioning

View all Papers

authored a paper 1 day ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 14 days ago • 9

submitted a paper to Daily Papers 3 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 14 days ago • 9

authored a paper 4 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 14 days ago • 9

authored 3 papers 18 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

Paper • 2604.02986 • Published Apr 3 • 3

LLM Routing with Dueling Feedback

Paper • 2510.00841 • Published Oct 1, 2025

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 24 days ago • 5

updated a dataset 25 days ago

SakanaAI/sudoku-bench-nikoli

Viewer • Updated 25 days ago • 100 • 62

published a dataset 25 days ago

SakanaAI/sudoku-bench-nikoli

Viewer • Updated 25 days ago • 100 • 62

authored a paper 27 days ago

HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

Paper • 2606.01132 • Published 29 days ago • 6

submitted a paper to Daily Papers 27 days ago

HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

Paper • 2606.01132 • Published 29 days ago • 6

updated a dataset about 1 month ago

SakanaAI/lm-wheels

Updated May 28 • 70 • 1

updated a Space about 1 month ago

Llama 3 Karamaru V1

Classical Japanese Chatbot

updated a dataset about 2 months ago

SakanaAI/KamonBench

Updated May 14 • 29 • 1

published a dataset about 2 months ago

SakanaAI/KamonBench

Updated May 14 • 29 • 1

updated a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

Viewer • Updated May 8 • 23.3k • 84 • 3

published a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

Viewer • Updated May 8 • 23.3k • 84 • 3

published a model about 2 months ago

SakanaAI/gpt-oss-120b-sft-aimo3-fishmath

Text Generation • 117B • Updated May 8 • 7

updated a model about 2 months ago

SakanaAI/gpt-oss-120b-sft-aimo3-fishmath

Text Generation • 117B • Updated May 8 • 7

in SakanaAI/DreamCubed2M about 2 months ago

Upload folder using huggingface_hub

#10 opened about 2 months ago by

published a dataset about 2 months ago

SakanaAI/DreamCubed2MHuman

Updated May 1 • 4