How Claude Managed Agents Work: A Non Technical Guide
25 May 2026 | David and Goliath
Quick answer
Claude Managed Agents is Anthropic's API for running long lived Claude agents in production without your team operating the underlying infrastructure. Launched May 2026, it handles state, retries, scheduling, and tool execution for agents that need to run reliably across hours or days. Most enterprises adopt it for customer facing agents, internal support, and any workflow where uptime matters more than maximum customisation.
- Anthropic launched the Managed Agents API in May 2026
- Handles state, retries, scheduling, and tool execution by default
- Pricing per agent run rather than per token, predictable for production budgets
- Strong fit for customer support, internal helpdesk, and any always on workflow
- Less customisable than direct Claude API, traded for operational simplicity
Mentioned: Anthropic, Claude, Claude Managed Agents, Claude Code, Vercel, AWS
Anthropic released the Managed Agents API in May 2026. If you have spent 12 months keeping a Claude agent running reliably in production, you already know why it matters. This guide gives the non technical answer on what it is, when to use it, and what to budget for.
What is Claude Managed Agents?
Claude Managed Agents is Anthropic's API for running long lived Claude agents in production without your team operating the underlying infrastructure (Source: Anthropic Managed Agents documentation, May 2026). It handles the operational layer underneath an agent: state persistence, retries, scheduling, tool execution, and the orchestration to keep an agent running across hours or days rather than seconds.
The distinction matters. Direct Claude API calls are stateless. Anything that needs to persist across calls (conversation state, intermediate results, tool outputs, queued work) has to be built and operated by you. For an agent that runs a multi step workflow over a long time horizon, retries failed tool calls, and picks up where it left off after a restart, the operational burden is significant. Managed Agents removes that burden by running the agent runtime as a service inside Anthropic's infrastructure.
You describe, configure, and instrument the agent, but do not host it. The trade off is less flexibility than running the runtime yourself. For most production use cases, that is a sensible trade.
How does Claude Managed Agents differ from the standard Claude API?
The standard Claude API is a stateless request and response interface. Managed Agents is a stateful runtime on top of it. The standard API gives you a model call. Managed Agents gives you a managed process that calls the model, holds state, calls tools, and persists across interruptions.
The standard Claude API is the right choice when you are building a chat interface, a single shot summarisation workflow, or any function where one request maps to one response. The infrastructure is simple and the operational footprint is essentially zero.
Managed Agents is the right choice when a workflow does any of three things. First, runs over a long time horizon with multiple model calls, tool calls, and intermediate state. Second, recovers gracefully when a tool fails or a dependency is unavailable. Third, is scheduled, triggered on an event, or kept warm so response time is predictable. Building those properties on the standard API takes engineering effort. Managed Agents ships them as part of the service.
When does Claude Managed Agents make sense?
Managed Agents makes sense when uptime and operational simplicity matter more than maximum customisation. The clearest fits are three patterns: customer facing agents, internal helpdesk and support automation, and any always on workflow where downtime is visible.
A customer facing agent is the canonical case. It handles enquiries, runs continuously, and recovers from transient failures without losing context. Building that on the standard API means provisioning state storage, a retry layer, monitoring, and operations. On Managed Agents, you describe the agent, configure tools, and point your application at the managed runtime.
Internal helpdesk and support automation follow the same pattern at smaller scale. An agent handling IT or HR queries runs continuously, remembers context within a session, and recovers from tool failures. The volume is lower but the operational shape is the same.
Always on workflows are the third pattern. An agent that monitors a queue, processes incoming documents, or runs scheduled work fits well. The standard API would require you to operate the scheduler, the worker pool, and the state store. The managed runtime handles all three.
What can Claude Managed Agents do that direct API agents cannot?
The Managed Agents runtime ships several capabilities you would otherwise build yourself. State persistence means the agent remembers context across calls and runtime restarts (Source: Anthropic Managed Agents documentation, May 2026). Retries with backoff are applied to tool calls that fail transiently. Scheduling lets you trigger an agent on a cron expression, on an event, or on an external webhook. Tool execution is sandboxed inside the managed runtime.
The capability that matters most is run resumption. If the runtime restarts mid run, or a tool call fails after a long running step, the agent resumes from the last checkpoint rather than restarting from scratch. Building that on the standard API takes weeks of engineering effort.
The constraint is real: the agent runs inside Anthropic's runtime. Tools that reach into your internal systems do so through outbound calls from the runtime to your endpoints. That is fine for most patterns, but you cannot run the agent inside your network perimeter where outbound calls from a third party runtime are not acceptable. For those workflows, the standard API plus your own orchestration is still the right pattern.
How does pricing work?
Pricing for Managed Agents is structured around the agent run rather than the underlying token spend (Source: Anthropic Managed Agents documentation, May 2026). A run is a single invocation that may include many model calls, tool calls, and intermediate steps. The model gives finance teams a predictable unit cost per business event (an enquiry handled, a ticket triaged, a document processed) rather than a per token spend that varies with each interaction.
There is a token component inside the per run pricing, and it scales with the work the agent does. Heavy reasoning workflows cost more per run than lightweight triage. For exact rates, see Anthropic's published pricing page, as rates may shift as the product matures.
The practical implication is that Managed Agents is easier to forecast for production workloads. You estimate the volume of business events, multiply by an average cost per run, and arrive at a monthly spend finance can plan against. That is a meaningful improvement on the standard API, where engineering time on operating the runtime is the dominant hidden cost.
What about data residency and governance?
Managed Agents inherits Anthropic's enterprise data handling baseline. Customer content processed by an agent is not used to train future Anthropic models and is retained only as required to operate the runtime (Source: Anthropic Enterprise Terms, May 2026). For ANZ deployments, the managed runtime can run within the AWS Sydney region so that agent state and customer content stay onshore.
The governance work on top of the baseline matches any production Claude deployment. A Privacy Impact Assessment is needed where personal information is in scope, aligned to the Privacy Act reforms taking effect in December 2026 (Source: Attorney General's Department, Privacy Act Review Report 2023, reforms enacted 2025). For APRA regulated entities, Managed Agents counts as a material service provider under CPS 230, which takes effect on 1 July 2026, so the third party register and operational resilience evidence must capture the dependency (Source: APRA, CPS 230 final standard, 2024).
The advantage for governance is that the operational layer is documented and assured by Anthropic. The trade off is less visibility into the runtime internals than if you ran the agent yourself. For most enterprise use cases, the balance favours the managed runtime. For workflows where you need to evidence specific runtime behaviour to a regulator, the direct API plus your own orchestration may still be the better answer.
How does this fit with the Claude Activation Programme?
The Claude Activation Programme delivers one production agent in 10 business days. Managed Agents changes the implementation pattern, not the structure. The activation modules (Knowledge Foundation, Workflow Automation, Governance, Activation) are unchanged. What changes is the runtime the first agent runs on.
For workflows where Managed Agents fits (customer facing, always on, predictable run volume), the activation builds the agent against the managed runtime. The Workflow Automation module configures the agent, tools, and scheduling. The Governance module captures the material service provider documentation, the Privacy Impact Assessment, and the audit trail configuration.
For workflows where the standard API plus orchestration fits better, the activation still delivers a production agent at day 10, but the runtime is operated inside your environment or on a partner managed platform. The choice is made at scope definition based on the workflow, data residency requirements, and integration shape.
Where do most teams get this wrong?
The most common mistake is choosing Managed Agents for a workflow that does not need it. A single shot summarisation function, a chat interface, or a one off document analysis does not benefit from the managed runtime. The standard Claude API is simpler, cheaper for that pattern, and has a smaller operational surface to govern. Teams that default to Managed Agents because it is the newest product end up with a more complex deployment than the workflow requires.
The second mistake is the inverse: building a long running agent on the standard API because it feels more flexible. The flexibility costs weeks of engineering effort on state, retries, scheduling, and observability. Teams often spend three months building infrastructure that Managed Agents would have shipped on day one.
The third mistake is treating Managed Agents as a way to skip governance. The runtime is managed, but the deployment is not. The Privacy Impact Assessment, material service provider documentation, audit trail, and human oversight design are still your responsibility. Teams that assume the managed runtime covers governance find the gap surfaced on the next privacy or APRA review.
What should ANZ teams do this quarter?
Identify the workflows where uptime and operational simplicity matter more than maximum customisation. Customer facing agents are the obvious candidates. Internal helpdesk and always on document processing are the next tier. Single shot or experimental workflows should stay on the standard API.
For each candidate, scope a 10 day activation that delivers a production agent on the managed runtime, with governance documentation as a deliverable. The output is a working agent, an internal AI register entry that satisfies the December 2026 Privacy Act baseline and the 1 July 2026 CPS 230 deadline, and a reference deployment your team can extend.
To scope a Managed Agents activation for your organisation, visit davidandgoliath.ai/claude-activation or book a scoping call at davidandgoliath.ai/claude-activation/start.
Sources: Anthropic Managed Agents documentation, May 2026. Anthropic Enterprise Terms, May 2026. APRA, CPS 230 final standard, 2024. Attorney General's Department, Privacy Act Review Report 2023, reforms enacted 2025.
Ready to move from reading to shipping?
Ten business days. Four modules. One agent live by the end.