"One Job, Many Minds: Harnessing A Team Of Claudes For Every Task"

1.

Throughout much of 2024 and 2025, the go-to approach was straightforward: assign the job to a single AI agent, make use of the largest context window you could find, and hope for the best. It occasionally succeeded. Just as often, the model silently lost focus midway through the task.

Anthropic addressed this challenge head-on: tasks that span many steps demand sustained coherence, which frequently exceeds what any single context window can handle dependably. Expanding the window improved things, but didn’t fully fix the underlying issue.

Anthropic had already delivered several tools to manage this. Subagents allowed the primary agent to hand off smaller tasks to separate helper agents, each operating with a clean, independent context and feeding condensed summaries back into the main thread. Skills let users bundle recurring procedures into reusable Markdown guides — essentially playbooks Claude could pull up whenever needed. Agent teams pushed the idea further: multiple distinct Claude instances, each maintaining its own context window, working together via a shared to-do list and direct messaging.

All of these tools represented genuine advances. Yet they all shared the same core limitation.

When using subagents, the lead Claude still orchestrates the full plan. Every report or summary returned from a helper flows back into the main session’s context window. With subagents, skills, and agent teams alike, Claude acts as the central coordinator: it determines step-by-step what task to launch or delegate next, and everything piles up in its context. This causes the orchestrator’s context to balloon as more worker agents come online, until it eventually maxes out. At that point, performance deteriorates the same way it always does — the very same breakdown patterns resurface.

Anthropic pinpointed three recurring failure patterns that emerge whenever a single context window — whether tied to one lone agent or a coordinator managing a small crew — shoulders a task too complex to manage cleanly. These are the three predictable ways things fall apart (Figure 1).

Figure 1. One mind, one context window — and the three ways it quietly collapses on a large task. Image by author help by ChatGPT

The first is Agentic Laziness — the agent begins a task but fails to see it through completely. It might quit prematurely, overlook certain files, or presume the outstanding pieces are comparable enough not to matter. Then, with misplaced confidence, it declares the entire task finished. Think of someone skimming just a portion of a massive spreadsheet but officially marking the whole document as verified.

The second is Self-preferential Bias. AI tends to grade its own work with a generous hand. If you prompt it with, “Did you follow the instructions properly?” it frequently responds positively, naturally inclined to give itself the benefit of the doubt. It may overlook errors in its own output or overstate how well it actually performed.

The third is Goal Drift. As a task stretches longer, the AI steadily lets the original objective slip out of focus. It might still remember the general aim but lose track of specifics like “do not include X,” “don’t skip any file,” or “only use this format.” The more extended the conversation or task grows, the more severe the drift becomes.

These aren’t glitches. They are the natural consequences of treating a plan as a fleeting thought — and thoughts fade.

The price of ignoring this became starkly evident in early 2026, when Jarred Sumner, the creator of Bun, set out to migrate roughly 750,000 lines of Zig code to Rust — one file at a time. Previously, an undertaking of this scale would have consumed a team of developers for months. Sumner’s strategy, however, was elegant in its simplicity: complete one unit of work, run an adversarial review on it, then apply the changes. He would later call Dynamic Workflows “the state of the art today for reliably using agents to complete medium-to-large projects.” The outcome: 750,000 lines of Rust, 99.8% of the original test suite still passing, and just 11 days elapsed from the initial commit to final merge.

The core insight is that Claude never has to carry the entire plan in memory. The workflow externalizes the plan as executable code. The script owns the looping logic, the decision branches, and the intermediate outputs. Claude is responsible only for the current step and the concluding synthesis. The plan exists as a JavaScript file — and a file doesn’t forget, drift, or prematurely declare victory.

This is the exact gap Dynamic Workflows were engineered to fill. And this article will walk you through it.

By the time you finish reading, you’ll understand precisely where subagents, skills, and agent teams hit their boundaries — and why — not as a vague gut feeling, but as a clear structural argument you can apply to your own work. You’ll know the six composition patterns that handle the vast majority of practical workflow challenges, how to write a workflow prompt that yields a genuinely effective harness, and how to sidestep the two costliest mistakes people make when getting started. You’ll also recognize when a workflow is the wrong approach — because Dynamic Workflows burn through significantly more tokens than an ordinary session, and pulling them out for the wrong job is its own form of failure.

2. What a Dynamic Workflow Is

A Dynamic Workflow is like swapping out one overwhelmed individual for a small, specialized team.

Rather than loading one AI with the entire project from beginning to end, you break the work into distinct, manageable chunks. One agent tackles one specific job. A second reviews the output. A third advances the process further. In this setup, nobody grows fatigued midway and starts taking shortcuts. Nobody awards themselves top marks simply because they produced the answer. And nobody loses sight of the original instructions, because each agent only needs to focus on its own clearly defined piece of the puzzle.

Claude’s Dynamic Workflow enables exactly this. It distributes the job across a team of Claude instances, each starting with a clean slate of context. Each one handles a discrete segment, a separate layer scrutinizes the results, and everything is consolidated back into a single delivered output for you.

The key concept here is harness. A harness is the framework wrapped around the model — the layer responsible for deciding how a task is planned, partitioned, verified, and carried out. The default Claude Code harness is designed primarily for software development tasks. Anthropic’s team discovered that these dynamic harnesses can be “sometimes even more powerful for non-technical work.” From there, Anthropic constructed a harness tailored to whatever unique task you hand it.

Before diving deeper, it’s worth clarifying a handful of terms that tend to blur together. Tools, agents, harnesses, and workflows are often tossed around as synonyms. They’re not. The cleanest way to distinguish them — borrowing this framing from AlphaSignal — poses a single question: who holds the plan? (Figure 2)

	Subagent	Agent team	Dynamic workflow
Who holds the plan	the main Claude (orchestrator), internally	the peers, shared among them	a JavaScript program
Lifecycle	fire-and-forget, single task	long-running, ongoing	runs once, returns one answer
Talk to each other?	no — the orchestrator routes everything, and a subagent can’t even spawn its own subagents	yes — they coordinate as peers over time	no — agents work in the background through script variables; only the final result is returned
Feels like	an intern handed one task	colleagues collaborating on a shared project	an assembly line you’ve designed

Top Posts

“One Job, Many Minds: Harnessing a Team of Claudes for Every Task”

3 Critical Insights from Microsoft Build 2026 That Every Leader Must Know

The Physical AI Revolution: Reshaping Our Connected World

“One Job, Many Minds: Harnessing a Team of Claudes for Every Task”

Synthetic Data: Transforming Virtual Experiments into Groundbreaking Biomedical Discoveries

Google’s Gemini-SQL2 Achieves 80.04% on BIRD Leaderboard with Gemini 3.1 Pro

Why Decade-Old Residual Connections Still Dominate AI—And Why That’s Holding Us Back

When PyMuPDF Misses the Table: Unlocking PDF Parsing for RAG with Azure Layout

“Unlock 3 Powerful NumPy Tricks to Supercharge Your Numerical Performance”

Pioneering Otitis Media Diagnosis: The 4DO-DETR Breakthrough

“One Job, Many Minds: Harnessing a Team of Claudes for Every Task”

3 Critical Insights from Microsoft Build 2026 That Every Leader Must Know

The Physical AI Revolution: Reshaping Our Connected World

Unlocking Claude Code’s Full Potential with Local Model Pairings

Anthropic Terminates Fable 5 and Mythos 5 Access Following US Export Restrictions

U.S. Halts Anthropic’s Fable 5 and Mythos 5 Access for Foreign Nationals

Five NDAA proposals poised to reshape life for DoD civilian employees

The Complete Guide to Securing Domains for Your IoT Innovations

Trending

“One Job, Many Minds: Harnessing a Team of Claudes for Every Task”

3 Critical Insights from Microsoft Build 2026 That Every Leader Must Know

Latest Posts

Not More Data, but Better World Models – Unite.AI

OpenAI Is Hiring Head of Preparedness, Amid AI Cyberattack Fears

Subscribe to Updates

Top Posts

“One Job, Many Minds: Harnessing a Team of Claudes for Every Task”

1.

2. What a Dynamic Workflow Is

3. The real test

3.1 Patterns that make dynamic workflows useful

3.2 Dynamic Workflow on a non-technical problem

3.3 Enable dynamic workflows

3.4 Let’s test them out

3.4.1 Default approach

3.4.2 Cheaper model for agents

3.4.3 Adjusting the workflow before execution

3.4.4 Comparison with a single-agent approach

6. Save the workflow only if it’s worth keeping

Sources

Related Posts