Agent Registries Are Not Runtime Governance

The security review starts with a simple question: how many agents do we have in production?

The answer is not simple. Some agents live in the support platform. Some are embedded in developer tools. Some are marketplace skills wrapped around scripts. Some run as scheduled workflows. A few were created by business teams and never entered the normal application inventory.

That is why agent registries matter.

A registry gives the organization a place to record what agents exist, who owns them, what environment they run in, and which lifecycle state they are in. Microsoft Agent 365 is positioned around this kind of enterprise problem: observe, govern, and secure agents through centralized registry, lifecycle management, access control, compliance, and audit.

That is necessary infrastructure. It is not the same thing as runtime governance.

The registry tells you what the agent is supposed to be. Runtime governance decides whether the next action should happen now.

What a Registry Is Good At

An agent registry is an inventory and lifecycle control point. It helps answer questions that every production governance program eventually asks.

Registry question	Example answer
Which agents exist?	`support-refund-agent-prod`, `sales-research-agent`
Who owns them?	Support platform team, revenue operations
Where do they run?	Production, staging, developer workstation
What is their lifecycle state?	Draft, approved, suspended, retired
Which identity or key is assigned?	Tenant-scoped runtime credential
Which tools are approved?	CRM read, ticket create, email send
Which compliance review applies?	Data handling, access review, audit retention

These are real controls. Without them, incident response becomes discovery. Operators spend the first hour of an incident figuring out whether the agent is sanctioned, who owns it, and where to disable it.

A registry is also the right place to manage approval workflows. Before an agent reaches production, someone should know its owner, purpose, environment, data classes, toolsets, and expected operating envelope.

What the Registry Does Not Decide

A registry usually stores intended state. It does not automatically know what has already happened inside a live run.

Runtime question	Why registry state is insufficient
Has this tenant exhausted its budget?	Requires live ledger state
Has this workflow already used its risk allowance?	Requires cumulative action accounting
Is this the 1st or 201st email in the run?	Requires per-run action history
Is this delegated child agent narrower than the parent?	Requires scoped authority at delegation time
Should the agent degrade instead of stop?	Requires ALLOW_WITH_CAPS semantics
Is the reservation safe under concurrency?	Requires atomic reserve-commit logic

The difference is timing. Registry review happens before deployment or during lifecycle management. Runtime governance happens at the moment an action is proposed.

Those moments have different information.

At registration time, you know the intended owner and approved capabilities. At runtime, you know the current tenant, workflow, toolset, estimate, remaining budget, previous actions, retry behavior, and accumulated exposure.

The Approved-Agent Failure Mode

Many production failures do not come from unknown agents. They come from approved agents behaving badly under a specific input, failure mode, or workload spike.

Approved state	Runtime failure
Agent is registered	It loops on a retryable error
Owner is assigned	The owner is not paged until the budget is exhausted
Tool is approved	The tool is called too many times
Key is valid	The key is used outside the intended workflow
Policy is documented	The action exceeds the live risk budget
Audit is enabled	The log shows what happened after the side effect

This is why "approved" cannot be the last governance decision.

An approved support agent may be allowed to send customer emails. It should not send 400 emails in one run. An approved coding agent may be allowed to edit source files. It should not use the same authority to modify deployment scripts. An approved research agent may be allowed to call web search. It should not spend the entire tenant budget on one task.

The registry answers whether the agent belongs in the system. Runtime authority answers whether this action still belongs in this run.

The Two-Control-Plane Pattern

The safer pattern is to compose lifecycle governance and runtime authority.

text

Agent registry
  -> records owner, purpose, lifecycle state, approved toolsets
Runtime authority
  -> evaluates each proposed action against budget, risk, and scope
Tool execution
  -> proceeds only when the live decision allows it
Audit stream
  -> preserves both lifecycle context and runtime decision context

That split keeps each layer honest.

The registry should not try to become a high-frequency budget ledger. It is optimized for inventory, review, lifecycle, and compliance workflows.

Runtime authority should not try to become the enterprise system of record for agent ownership. It is optimized for fast, atomic, per-action decisions.

Together, they produce a better control plane than either layer can provide alone.

What to Pass from Registry to Runtime

The useful integration is not "registry versus runtime." It is "registry metadata attached to runtime decisions."

A runtime authority request can carry:

Field	Example
`agent_id`	`support-refund-agent-prod`
Owner	`support-platform-team`
Lifecycle state	`approved`
Tenant	`acme`
Environment	`production`
Toolset	`refund.issue`
Risk tier	`high`
Budget scope	`tenant:acme/workflow:refund-run-4821`

Some of this metadata is enforcement input; some is audit context. The important part is that it travels with the runtime decision.

The server can then evaluate live policy against a meaningful identity and scope. The audit trail can show not just "a key made a request," but which registered agent, under which tenant, in which workflow, consumed which budget or RISK_POINTS.

That is the bridge between lifecycle governance and runtime authority.

Incident Response Gets Cleaner

When a registry and runtime ledger are connected, incident response becomes more direct.

The registry tells operators:

which agent exists
who owns it
which environment it runs in
which credentials and approved toolsets are associated with it
whether it should be suspended, retired, or reviewed

Cycles records and application telemetry together can tell operators:

which actions were allowed, capped, or denied
which budget or risk scope was consumed
whether the incident is isolated to one tenant, workflow, or agent
whether the application mapped child agents to narrower budget scopes
whether the host honored a rejection before the side effect happened

That makes fleet operations safer. A registry can help identify the affected population. Bulk actions can suspend tenants, pause webhooks, or adjust budgets for that population. Cycles events, balances, reservation records, and application logs show the budget decisions and settlement; the host remains the source of truth for tool authorization and external outcomes.

Where Cycles Fits

Cycles is not an agent registry. It does not replace Microsoft Agent 365, an internal CMDB, a marketplace approval flow, or an IAM platform.

Cycles sits at the budget decision point. It uses scoped budgets, reserve-commit semantics, and idempotency keys to decide whether the submitted estimate remains within configured budget bounds, then records applicable audit and event data. The host separately authorizes the action and enforces the result.

That makes it complementary to registry systems. The registry defines the agent's intended envelope. Cycles meters and enforces the envelope while the agent runs.

The same pattern appears in Zero Trust for AI Agents: every consequential action needs a policy decision before execution. Registry approval is one input to that decision. It is not the whole decision.

The Takeaway

Agent registries are becoming necessary enterprise infrastructure. They give teams inventory, ownership, lifecycle state, access review, and audit context.

But runtime governance asks a different question: should this specific next action proceed, given the budget, risk, scope, and history already consumed?

Production systems need both. The registry says what the agent is supposed to be. Runtime authority decides what the agent is still allowed to do.

Sources

Microsoft Learn: Overview of Microsoft Agent 365 - agent registry, lifecycle, access control, compliance, and audit positioning
Microsoft Security Blog: Zero Trust for AI - zero trust framing for the AI lifecycle and agent behavior
OWASP Top 10 for Agentic Applications 2026 - agentic risk framework for autonomous systems
Zero Trust for AI Agents - local context on pre-execution policy decisions
Runtime Authority vs Runtime Authorization - how identity policy and bounded runtime authority compose
API Key Management in Cycles - runtime credential scoping and rotation
Runtime Authority Byproducts: Audit Trail and Attribution by Default - audit value created by runtime enforcement

Agent Registries Are Not Runtime Governance ​

What a Registry Is Good At ​

What the Registry Does Not Decide ​

The Approved-Agent Failure Mode ​

The Two-Control-Plane Pattern ​

What to Pass from Registry to Runtime ​

Incident Response Gets Cleaner ​

Where Cycles Fits ​

The Takeaway ​

Sources ​

More from the Blog