What is Cycles?

Cycles is a runtime authority for autonomous agents. It enforces hard limits on agent spend and actions — before they happen, not after.

python

@cycles(estimate=5000, action_kind="llm.completion", action_name="openai:gpt-4o")
def ask(prompt: str) -> str:
    return openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": prompt}],
    ).choices[0].message.content
# Cycles are reserved before the action runs. If unavailable, execution is denied.

The problem

Autonomous systems fail differently than traditional software. A runaway agent does not just burn dollars — it creates unbounded exposure.

That exposure can be financial: thousands of dollars in LLM calls accumulated before anyone notices. But it can just as easily be operational: records deleted, files overwritten, emails sent, orders placed, deployments triggered. In these cases, the damage is not measured primarily in cost, but in consequence.

Rate limiters control velocity — requests per second. They do not control total exposure: the cumulative cost, risk, or irreversible side effects a system is allowed to create before execution is halted. Nor do they constrain what each individual action is permitted to do.

By the time an alert fires, the system has already acted. Observation is useful for visibility. It is not enforcement.

See it in action

The Demos page has self-contained scenarios you can run in 60 seconds — no LLM API key required:

Runaway Agent Demo — same agent, same bug, two outcomes: without Cycles the agent burns ~$10 before being force-killed. With Cycles it stops cleanly at $1.00.
Action Authority Demo — a support agent handles a billing dispute in four steps. Cycles allows internal actions but blocks the customer email before it executes.

How Cycles solves it

Cycles enforces a budget decision before agent actions execute — LLM calls, tool invocations, API requests. Every action follows a Reserve-Commit lifecycle:

Cycles enforces where you instrument it. Uninstrumented code paths are unaffected.

1. Reserve    →  Lock estimated amount before the action runs
2. Execute    →  Call the LLM / tool / API
3. Commit     →  Record actual usage; unused budget is released automatically

If the budget is exhausted, the reservation is denied before the action executes.

PythonTypeScript

python

from runcycles import CyclesClient, CyclesConfig, cycles, set_default_client

client = CyclesClient(CyclesConfig.from_env())
set_default_client(client)

@cycles(estimate=5000, action_kind="llm.completion", action_name="openai:gpt-4o") 
def ask(prompt: str) -> str:
    return openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": prompt}],
    ).choices[0].message.content

# Cycles are reserved before the action, committed after, released on failure.
result = ask("Summarize this document")

typescript

import { CyclesClient, CyclesConfig, withCycles, setDefaultClient } from "runcycles";

const client = new CyclesClient(CyclesConfig.fromEnv());
setDefaultClient(client);

const ask = withCycles( 
  { estimate: 5000, actionKind: "llm.completion", actionName: "openai:gpt-4o" }, 
  async (prompt: string) => {
    const res = await openai.chat.completions.create({
      model: "gpt-4o",
      messages: [{ role: "user", content: prompt }],
    });
    return res.choices[0].message.content;
  },
);

const result = await ask("Summarize this document");

Key guarantees

Guarantee	What it means
Atomic reservation	Budget is locked across all affected scopes in one operation — no partial locks
Concurrency-safe	Multiple agents sharing a budget cannot oversubscribe
Idempotent	Retries are safe; the same action cannot settle twice
Pre-enforcement	Budget is denied before the expensive action, not after

Multi-level scoping

Budgets are applied hierarchically. A single reservation can enforce limits at every level simultaneously:

tenant → workspace → app → workflow → agent → toolset

For example, a reservation with tenant=acme, workspace=prod, app=chatbot checks budget at:

tenant:acme
tenant:acme/workspace:prod
tenant:acme/workspace:prod/app:chatbot

All three must have sufficient budget for the reservation to succeed.

Architecture

Your Application@cycles / withCycles / MCP

HTTP (port 7878) · X-Cycles-API-Key

Cycles ServerRuntime enforcement · Port 7878

Cycles Admin ServerTenants / budgets / keys · Port 7979

Redis 7+Budget state · reservations · tenants · audit logs

BRPOP

Cycles Events ServiceWebhooks (optional) · Port 7980

Your application talks to the Cycles Server for runtime budget checks. The Admin Server manages tenants, API keys, and budget ledgers. The Events Service (optional) delivers webhook notifications asynchronously — see Deploying the Events Service.

Who uses Cycles

Platform teams building multi-tenant agent runtimes
Framework authors integrating budget enforcement into SDKs
Enterprise operators needing audit-grade cost accountability
Teams building agents that call paid APIs autonomously

Get started

New to Cycles? Start here.

The End-to-End Tutorial takes you from zero to a working budget-guarded app in ~10 minutes — deploy the stack, create a tenant, mint an API key, and run your first reservation. Do this first, then pick a language client below.

Already have a running server? Pick your client

Stack	Guide	Time
Python	Python Quickstart	~5 min
TypeScript / Node.js	TypeScript Quickstart	~5 min
Spring Boot / Java	Spring Boot Quickstart	~5 min
Rust	Rust Quickstart	~5 min
Claude / Cursor / Windsurf	MCP Server Quickstart	~3 min

Need to deploy the server?

Scenario	Guide	Time
Single-server local	Self-hosting the Cycles Server	~5 min
Full stack (runtime + admin + events)	Deploy the Full Stack	~10 min
Webhooks/events only	Deploy the Events Service	~5 min
Admin dashboard (web UI)	Deploy the Admin Dashboard	~10 min

Next steps

Ballpark this for your workload — the cost calculator takes ~30 seconds and produces a shareable URL with your numbers
Choose a First Rollout — decide your adoption strategy
Architecture Overview — how the runtime, admin, and events components fit together
How Cycles Compares — vs. rate limiters, observability, provider caps

What is Cycles? ​

The problem ​

See it in action ​

How Cycles solves it ​

Key guarantees ​

Multi-level scoping ​

Architecture ​

Who uses Cycles ​

Get started ​

Already have a running server? Pick your client ​

Need to deploy the server? ​

Next steps ​