Sakana AI Unveils Sakana Fugu: The Orchestration Model That Dynamically Routes Tasks Across A Swappable Pool Of Frontier LLMs

Today, Sakana AI introduced Sakana Fugu, a multi-agent orchestration platform that operates as a single unified model. You submit a request to one endpoint, and Fugu handles everything behind the scenes. For simpler tasks, it responds directly. For more complex challenges, it automatically assembles and manages a team of specialized models. The intricacies of this multi-agent architecture remain completely invisible to your application code.

Key Takeaways

Fugu provides a full multi-agent system accessible through a single OpenAI-compatible API.
Fugu Ultra currently leads on most published coding and reasoning benchmarks.
The orchestrator consistently outperforms the individual models it manages.
Opt-out controls and provider routing address compliance concerns and reduce reliance on any single vendor.
Routing logic is proprietary, meaning the specific model chosen for each query remains hidden.

What is Sakana Fugu?

At its core, Fugu is a language model trained to invoke other LLMs from an agent pool—including recursive calls to instances of itself. Internally, it handles model selection, task delegation, result verification, and final synthesis.

Rather than relying on rigid, pre-defined roles or workflows, Fugu learns how to coordinate dynamically. It determines when to delegate tasks and how agents should collaborate, then merges their outputs into a single coherent response. From your perspective, you’re simply calling one model. Under the hood, a coordinated team of experts does the heavy lifting.

Sakana AI positions this approach as a safeguard against dependency on a single provider. If one vendor restricts access, Fugu seamlessly routes around the disruption. The team points to recent export controls affecting Anthropic’s Fable and Mythos models as key motivation. Over time, newer models can be integrated into the pool.

Fugu and Fugu Ultra: Two Models, One API

Fugu is available in two versions, both accessible via the same OpenAI-compatible API:

Fugu balances strong performance with low latency, making it ideal for everyday coding, code review, and chatbot applications. It also integrates well with tools like Codex. You can exclude specific agents from its pool, helping teams meet data privacy and compliance requirements.
Fugu Ultra is optimized for maximum accuracy on complex, multi-step problems. It coordinates a deeper pool of expert agents. Its agent pool is fixed, so opt-out functionality is not available. The current model ID is fugu-ultra-20260615.

The Research Behind the Orchestrator

Fugu builds upon two ICLR 2026 papers—Trinity and Conductor—focused on learned orchestration.

TRINITY employs a lightweight, evolved coordinator that operates across multiple turns, dynamically assigning Thinker, Worker, or Verifier roles. Conductor is trained using reinforcement learning to discover natural-language coordination strategies and targeted prompts for diverse LLM pools.

Together, these approaches demonstrate that systems can learn to assemble and route agents on a per-task basis, replacing manually designed workflows.

Interactive Explainer

<div><h2>Sakana Fugu — Orchestration Simulator</h2>A visual demonstration showing how Fugu directs a request: it decides, assigns Thinker / Worker / Verifier roles, manages a group of agents, and combines everything into one response.</div> Illustrative</div><div class="panel"> 1 · Task type<div class="row"> <select id="task"><option value="code">Coding and code review</option><option value="reason">Logic and math</option><option value="research">AutoResearch (automated ML)</option><option value="security">Security assessment</option><option value="repro">Paper reproduction</option><option value="longctx">Long-context analysis</option></select></div></div><div class="panel"> 2 · Model<div class="row"><div class="seg" id="modelSeg"> <button data-m="fugu" class="on">Fugu</button> <button data-m="ultra">Fugu Ultra</button></div></div><div class="hint" id="modelHint">Fugu strikes a balance between output quality and response speed. You can exclude agents from its pool for data, privacy, or compliance reasons.</div></div><div class="panel"> 3 · Agent pool<div class="chips" id="pool"><div class="chip on" data-a="opus">Opus 4.8</div><div class="chip on" data-a="gemini">Gemini 3.1 Pro</div><div class="chip on" data-a="gpt">GPT 5.5</div><div class="chip on locked" data-a="self">Fugu (recursive self‑call)</div></div><div class="hint" id="poolHint">Switch a provider off to remove it. Fugu Ultra always uses the full, fixed pool.</div></div><div class="panel"> 4 · Resilience event<div class="actions"> <button class="act ghost" id="restrictBtn">Simulate a provider restriction</button> No restriction active. The pool remains intact.</div></div><div class="panel" style="border-bottom:1px solid var(--ink)"><div class="endpoint"> POST /v1/chat/completions model = fugu · OpenAI‑compatible · single endpoint</div><div class="actions" style="margin-top:14px"> <button class="act" id="runBtn">Run orchestration</button> <button class="act ghost" id="resetBtn">Reset</button> Ready.</div></div><div class="grid"><div class="col"> Orchestration trace<div class="log" id="log"><div style="color:var(--mut);font-size:12px;padding:14px 0">Hit Run orchestration to view the routing steps.</div></div></div><div class="col"> Agent activity<div id="agents" style="margin-bottom:16px"></div> Synthesized answer<div class="answer" id="answer">Waiting for a run…</div></div></div><div class="disc"> This tool is a teaching aid illustrating how Fugu’s mechanics work based on public documentation: a single OpenAI-compatible endpoint, a configurable agent pool, opt-out controls, role assignment (Thinker / Worker / Verifier), and routing around a restricted provider. No live API is contacted during this simulation. The actual model selection and coordination methods used by Sakana AI are internal and not shared publicly. Benchmark numbers mentioned in the article are sourced from Sakana AI’s own published materials.</div><div class="ft"> Marktechpost · Sakana Fugu interactive explainer Source: sakana.ai/fugu</div></div><script> (function(){ var root = document.getElementById('fugu-sim-root'); var Q = function(s){ return root.querySelector(s); }; var QA = function(s){ return Array.prototype.slice.call(root.querySelectorAll(s)); }; var state = { model:'fugu', restricted:null, busy:false }; var AGENTS = { opus:{name:'Opus 4.8'}, gemini:{name:'Gemini 3.1 Pro'}, gpt:{name:'GPT 5.5'}, self:{name:'Fugu (self-call)'} }; var TASKS = { code:{plan:'Tackle the problem with a single expert, then double-check the diff.', answer:'Code review finished. 22 issues found across modules, ordered by severity, each paired with a recommended fix and a retest note.', metrics:[['Issues found','22'],['Roles used','Worker + Verifier'],['Mode','Single-agent + check']]}, reason:{plan:'Put together a team. Run multiple solvers, then confirm by majority vote.', answer:'Final answer confirmed by the Verifier after cross-checking three independent solution paths.', metrics:[['Solution paths','3'],['Roles used','Thinker x2 + Verifier'],['Mode','Multi-agent']]}, research:{plan:'Open-ended cycle: propose, execute, review failures, adjust, repeat.', answer:'Best validation BPB 0.9774 reached after repeated tweaks to batch size, depth, learning rate, and optimizer settings.', metrics:[['Iterations','123'],['Roles used','Thinker + Worker + Verifier'],['Mode','Long-running agentic']]}, security:{plan:'Focused end-to-end pass. Recon, checks, review, report.', answer:'Full assessment delivered: recon, XSS/SQLi checks, auth review, and a clean report with evidence and retest steps.', metrics:[['Stages','4'],['Roles used','Worker + Verifier'],['Mode','Scoped agentic']]}, repro:{plan:'Read, implement, train, evaluate, then examine the gap.', answer:'Method reproduced and evaluated. Remaining gap to reported numbers identified and explained.', metrics:[['Phases','5'],['Roles used','Thinker + Worker + Verifier'],['Mode','Long-running agentic']]}, longctx:{plan:'Chunk, retrieve, reason across the full context, then synthesize.', answer:'Cross-document synthesis produced, with connections drawn across sources that a single-pass read would miss.', metrics:[['Mode','Long-context'],['Roles used','Worker + Verifier'],['Mode2','Retrieve + reason']]} }; function activePool(){ return QA('#pool .chip').filter(function(c){ return c.classList.contains('on') && !c.classList.contains('off'); }).map(function(c){ return c.getAttribute('data-a'); }); } function renderModel(){ var ultra = state.model==='ultra'; Q('#modelId').textContent = ultra ? 'fugu-ultra-20260615' : 'fugu'; Q('#modelHint').textContent = ultra ? 'Fugu Ultra maximizes answer quality on hard, multi-step problems. It coordinates a deeper, fixed pool.' : 'Fugu balances quality and latency. You can opt agents out of its pool for data, privacy, or compliance needs.'; Q('#poolHint').textContent = ultra ? 'Fugu Ultra relies on the full agent pool, so opt-out is disabled.' : 'Toggle a provider off to opt it out. Fugu Ultra uses the full fixed pool.'; QA('#pool .chip').forEach(function(c){ var a = c.getAttribute('data-a'); if(a==='self') return; if(ultra){ c.classList.add('on'); c.classList.remove('off'); c.classList.add('locked'); } else { c.classList.remove('locked'); } // re-apply restriction visual if(state.restricted===a){ c.classList.add('off'); c.classList.remove('on'); } }); } // model toggle QA('#modelSeg button').forEach(function(b){ b.addEventListener('click', function(){ if(state.busy) return; QA('#modelSeg button').forEach(function(x){ x.classList.remove('on'); }); b.classList.add('on'); state.model = b.getAttribute('data-m'); renderModel(); }); }); // pool toggles QA('#pool .chip').forEach(function(c){ c.addEventListener('click', function(){ if(state.busy) return; var a = c.getAttribute('data-a'); if(a==='self') return; // recursive self-call always present if(state.model==='ultra') return; // fixed pool if(state.restricted===a) return; // restricted stays off c.classList.toggle('on'); c.classList.toggle('off', !c.classList.contains('on')); }); }); // restriction Q('#restrictBtn').addEventListener('click', function(){ if(state.busy) return; if(state.restricted){ var prev = state.restricted; state.restricted = null; var pc = root.querySelector('#pool .chip[data-a="'+prev+'"]'); pc.classList.remove('off'); pc.classList.add('on'); Q('#restrictBtn').textContent="Simulate a provider restriction"; Q('#restrictState').textContent = "No active restriction. The pool is intact."; } else { var pool = activePool(); var pick = pool.length>1 ? pool[0] : 'gpt'; state.restricted = pick; var c = root.querySelector('#pool .chip[data-a="'+pick+'"]'); c.classList.add('off'); c.classList.remove('on'); Q('#restrictBtn').textContent="Lift the restriction"; Q('#restrictState').textContent = AGENTS[pick].name + ' is unavailable. Fugu will route around it.'; } }); function reset(){ Q('#log').innerHTML = ' Press Run orchestration to watch the routing steps. '; Q('#agents').innerHTML = ''; Q('#answer').innerHTML = 'Awaiting run…'; Q('#status').textContent="Ready."; } Q('#resetBtn').addEventListener('click', function(){ if(!state.busy) reset(); }); function addStep(n, role, html){ var d = document.createElement('div'); d.className="step"; var r = role ? ''+role+'' : ''; d.innerHTML = ' '+n+' '+r+html+' '; Q('#log').appendChild(d); Q('#log').scrollTop = Q('#log').scrollHeight; } function agentBars(used){ var box = Q('#agents'); box.innerHTML=''; used.forEach(function(u){ var line = document.createElement('div'); line.className="agentline"; line.innerHTML = ''+AGENTS[u.id].name+''+ ''+ ''+u.role+''; box.appendChild(line); var bar = line.querySelector('i'); setTimeout(function(){ bar.style.width = (40+Math.random()*55)+'%'; }, 120); }); } function run(){ if(state.busy) return; state.busy = true; Q('#runBtn').disabled = true; Q('#resetBtn').disabled = true; Q('#log').innerHTML=''; Q('#agents').innerHTML=''; Q('#answer').innerHTML='Orchestrating_'; Q('#status').textContent="Running…"; var task = Q('#task').value; var ultra = state.model==='ultra'; var pool = activePool(); var workers = pool.filter(function(a){ return a!=='self'; }); if(workers.length===0) workers=['self']; var T = TASKS[task]; // choose mode: ultra or research/repro/reason => multi-agent team var team = ultra || task==='research' || task==='repro' || task==='reason'; var steps = []; steps.push({d:300, role:null, html:'Request received at a single endpoint. No client-side multi-agent code.'}); steps.push({d:600, role:null, html:'Decide: '+(team?'this task is multi-step — assemble a team.':'a single powerful model is sufficient — address it directly, then validate.')}); var used = []; if(team){ var thinker = workers[0]; var worker = workers[1] || workers[0]; var verifier = workers[2] || workers[0]; steps.push({d:700, role:'Thinker', html:''+AGENTS[thinker].name+' formulates a strategy and breaks down the task.'}); steps.push({d:700, role:'Worker', html:''+AGENTS[worker].name+' performs the sub-tasks and generates a candidate result.'}); if(pool.indexOf('self')>-1){ steps.push({d:600, role:'Worker', html:'Fugu triggers a recursive self-invocation on a challenging sub-problem.'}); } steps.push({d:700, role:'Verifier', html:''+AGENTS[verifier].name+' reviews the output and identifies gaps for revision.'}); used = [{id:thinker,role:'Thinker'},{id:worker,role:'Worker'},{id:verifier,role:'Verifier'}]; if(pool.indexOf('self')>-1) used.push({id:'self',role:'Worker'}); } else { var solo = workers[0]; steps.push({d:700, role:'Worker', html:''+AGENTS[solo].name+' tackles the task in a single pass.'}); steps.push(); used = [{id:solo,role:'Worker'},]; } if(state.restricted){ steps.splice(2,0,{d:650, role:null, html:''+AGENTS[state.restricted].name+' is restricted. Rerouting around it — no integration changes required.'}); } steps.push({d:700, role:null, html:'Combine all agents' outputs into one dependable answer.'}); // dedupe used by id, preserving the first assigned role var seen={}; used = used.filter(function(u){ if(seen[u.id])return false; seen[u.id]=1; return true; }); var i=0, n=0; function next(){ if(i>=steps.length){ finish(used, T); return; } var s = steps[i++]; n++; setTimeout(function(){ addStep(n, s.role, s.html); if(n===Math.min(4,steps.length-1)) agentBars(used); next(); }, s.d); } next(); } function finish(used, T){ setTimeout(function(){ QA('#agents .bar i').forEach(function(b){ b.style.width="100%"; }); var mhtml = T.metrics.map(function(m){ return ' '+m[0]+''+m[1]+' '; }).join(''); Q('#answer').innerHTML = ' '+T.answer+' '+mhtml; Q('#status').textContent="Complete. Routing details are confidential and not exposed in production."; state.busy=false; Q('#runBtn').disabled=false; Q('#resetBtn').disabled=false; sz(); }, 500); } Q('#runBtn').addEventListener('click', run); Q('#task').addEventListener('change', function(){ if(!state.busy) reset(); }); renderModel(); // ---- auto-resize for WordPress iframe embed ---- function sz(){ var h = root.offsetHeight + 40; if(window.parent && window.parent!==window){ window.parent.postMessage({type:'fugu-sim-height', height:h}, '*'); } } window.addEventListener('load', sz); window.addEventListener('resize', sz); setTimeout(sz, 300); // watch for DOM changes that affect height if(window.MutationObserver){ new MutationObserver(sz).observe(root, {childList:true, subtree:true, attributes:true}); } })();

Top Posts

Rewriting Jaeger’s ClickHouse backend: Achieving 8.6× compression on 10 million spans

NVIDIA Halos OS upgrades the safety of physical AI workloads

South Korea’s Unrealized Gains Tax Plan Ignites Market Turmoil on Black Tuesday

Sakana AI Unveils Sakana Fugu: The Orchestration Model That Dynamically Routes Tasks Across a Swappable Pool of Frontier LLMs

Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDFs Using Schemas

Unlock the Future: Everything About No-Code AI You Can’t Afford to Miss

ChatLLM by Abacus AI: The Multi-Model Workspace Changing How You Work Every Day

From Chaos to Clarity: Smart Encoding Strategies for Unmasking Outliers in Categorical Data

Unlocking 3 Powerful NLTK Strategies for Smarter Text Preprocessing

MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode

Rewriting Jaeger’s ClickHouse backend: Achieving 8.6× compression on 10 million spans

NVIDIA Halos OS upgrades the safety of physical AI workloads

South Korea’s Unrealized Gains Tax Plan Ignites Market Turmoil on Black Tuesday

Vention Unites with FANUC and Universal Robots to Pioneer Software-Defined Automation

Prime Intellect Releases prime-rl 0.6.0 to Train Trillion-Parameter MoE Models on Agentic RL Workloads

Windows 11 KB5095093 update rolls out new Point-in-Time restore feature

Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDFs Using Schemas

OWL’s AWS Digest: Hanoi Local Zones, Grok 4.3 on Bedrock, NY Summit Highlights & Fresh Price Drops (June 22, 2026)

Trending

Rewriting Jaeger’s ClickHouse backend: Achieving 8.6× compression on 10 million spans

NVIDIA Halos OS upgrades the safety of physical AI workloads

Latest Posts

Not More Data, but Better World Models – Unite.AI

OpenAI Is Hiring Head of Preparedness, Amid AI Cyberattack Fears

Subscribe to Updates

Top Posts

Sakana AI Unveils Sakana Fugu: The Orchestration Model That Dynamically Routes Tasks Across a Swappable Pool of Frontier LLMs

Key Takeaways

What is Sakana Fugu?

Fugu and Fugu Ultra: Two Models, One API

The Research Behind the Orchestrator

Interactive Explainer

Related Posts