Integrating Cycles with Pydantic AI

This guide shows how to guard Pydantic AI agent runs with Cycles budget reservations so that every agent invocation is cost-controlled and observable.

Prerequisites

bash

pip install runcycles pydantic-ai

Set environment variables:

bash

export CYCLES_BASE_URL="http://localhost:7878"
export CYCLES_API_KEY="your-api-key"   # create via Admin Server — see note below
export CYCLES_TENANT="acme"
export OPENAI_API_KEY="sk-..."         # or whichever provider your agent uses

Need an API key? Create one via the Admin Server — see Deploy the Full Stack or API Key Management.

60-Second Quick Start

python

from pydantic_ai import Agent
from runcycles import CyclesClient, CyclesConfig, cycles, set_default_client

set_default_client(CyclesClient(CyclesConfig.from_env()))

agent = Agent("openai:gpt-4o", system_prompt="You are a helpful assistant.")

@cycles(estimate=1_500_000, action_kind="llm.completion", action_name="gpt-4o")
def ask(prompt: str) -> str:
    result = agent.run_sync(prompt)
    return result.data

print(ask("What is budget authority?"))

Every call is now budget-guarded. If the budget is exhausted, BudgetExceededError is raised before the agent runs. Read on for production patterns with tool calls and structured output.

Guarding model calls

Use @cycles to wrap an agent run with automatic reserve, execute, and commit:

python

from pydantic_ai import Agent
from runcycles import (
    CyclesClient, CyclesConfig, CyclesMetrics,
    cycles, get_cycles_context, set_default_client,
)

set_default_client(CyclesClient(CyclesConfig.from_env()))

PRICE_PER_INPUT_TOKEN = 250       # $2.50 / 1M tokens in microcents
PRICE_PER_OUTPUT_TOKEN = 1_000    # $10.00 / 1M tokens in microcents

agent = Agent("openai:gpt-4o", system_prompt="You are a research assistant.")

@cycles(
    estimate=2_000_000,
    actual=lambda result: (
        result["usage"]["input_tokens"] * PRICE_PER_INPUT_TOKEN
        + result["usage"]["output_tokens"] * PRICE_PER_OUTPUT_TOKEN
    ),
    action_kind="llm.completion",
    action_name="gpt-4o",
    unit="USD_MICROCENTS",
    ttl_ms=60_000,
)
def research(question: str) -> dict:
    result = agent.run_sync(question)

    ctx = get_cycles_context()
    if ctx:
        ctx.metrics = CyclesMetrics(
            tokens_input=result.usage().request_tokens,
            tokens_output=result.usage().response_tokens,
        )

    return {
        "answer": result.data,
        "usage": {
            "input_tokens": result.usage().request_tokens,
            "output_tokens": result.usage().response_tokens,
        },
    }

Tool call budget scoping

When your Pydantic AI agent uses tools, you can budget each tool invocation separately by wrapping tool functions with @cycles:

python

from pydantic_ai import Agent, RunContext

agent = Agent("openai:gpt-4o", system_prompt="Use tools to answer questions.")

@agent.tool
@cycles(estimate=50_000, action_kind="tool.search", action_name="web-search")
def search_web(ctx: RunContext[None], query: str) -> str:
    """Search the web for information."""
    # Your search implementation
    return perform_search(query)

@agent.tool
@cycles(estimate=20_000, action_kind="tool.lookup", action_name="db-lookup")
def lookup_database(ctx: RunContext[None], record_id: str) -> str:
    """Look up a record in the database."""
    return db.get(record_id)

@cycles(estimate=2_000_000, action_kind="llm.completion", action_name="gpt-4o")
def ask_with_tools(prompt: str) -> str:
    result = agent.run_sync(prompt)
    return result.data

Each tool call gets its own reservation, so you have fine-grained visibility into what the agent spends on LLM calls versus tool invocations.

Structured output with budget control

Pydantic AI excels at returning structured data. Combine this with Cycles to budget-guard typed responses:

python

from pydantic import BaseModel
from pydantic_ai import Agent
from runcycles import cycles

class MovieReview(BaseModel):
    title: str
    rating: float
    summary: str

review_agent = Agent(
    "openai:gpt-4o",
    result_type=MovieReview,
    system_prompt="You are a film critic. Return structured reviews.",
)

@cycles(estimate=1_500_000, action_kind="llm.completion", action_name="gpt-4o")
def review_movie(movie_name: str) -> MovieReview:
    result = review_agent.run_sync(f"Review the movie: {movie_name}")
    return result.data

The decorator does not interfere with the return type. Your function still returns a MovieReview instance; Cycles only manages the budget lifecycle around it.

Error handling

When the budget is insufficient, BudgetExceededError is raised before the agent runs:

python

from runcycles import BudgetExceededError

try:
    answer = ask("Summarize recent ML papers")
except BudgetExceededError:
    answer = "Budget exhausted — please try again later."

For agents with tool calls, each tool decorated with @cycles can independently raise BudgetExceededError. Handle this at the outer call to catch failures at any level:

python

try:
    result = ask_with_tools("Find the latest sales data and summarize it")
except BudgetExceededError as e:
    print(f"Budget limit hit: {e}")
    result = fallback_response()

See Degradation Paths for patterns like queueing, model downgrade, and caching.

Key points

Decorator wraps any function. The @cycles decorator works with agent.run_sync(), async runs, and tool functions alike.
Tool-level budgets. Decorate individual @agent.tool functions for per-tool cost visibility.
Structured output is preserved. Cycles does not alter your function's return type or Pydantic models.
The agent never runs on DENY. If the budget is exhausted, the LLM is never called, saving both cost and latency.

Next steps

Error Handling Patterns in Python — handling budget errors in Python
Handling Streaming Responses — budget-managed streaming
Testing with Cycles — testing budget-guarded code
Production Operations Guide — running Cycles in production
Integrating with OpenAI — if your Pydantic AI agent uses OpenAI models
Integrating with Anthropic — if your Pydantic AI agent uses Anthropic models

Integrating Cycles with Pydantic AI ​

Prerequisites ​

Guarding model calls ​

Tool call budget scoping ​

Structured output with budget control ​

Error handling ​

Key points ​

Next steps ​

Integrating Cycles with Pydantic AI

Prerequisites

Guarding model calls

Tool call budget scoping

Structured output with budget control

Error handling

Key points

Next steps