Claude vs GPT Cost Calculator

A free interactive calculator that compares per-call, per-day, per-month, and per-year cost across the major Claude and OpenAI models for any token volume and call rate.

Tip: Open fullscreen ↗ for a wider table, share/export buttons, and a shareable URL that preserves your configuration. The same toolbar is also available below.

Workload name (optional)Short description (optional)

Input tokens / callOutput tokens / callCalls / day

Per call	Per day	Per month	Per year
$0.0340	$340	$10,200	$124,100
$0.0170	$170	$5,100	$62,050
$0.0051	$51.00	$1,530	$18,615
$0.0014	$14.00	$420	$5,110
$0.0300	$300	$9,000	$109,500
$0.0300	$300	$9,000	$109,500
$0.0180	$180	$5,400	$65,700
$0.0060	$60.00	$1,800	$21,900

Defaults reflect published list pricing as of 2026-04 from OpenAI and Anthropic; verify before relying on these for budgeting and edit any cell to match your contracted rate. Calculation: (input_tokens × input_$/M + output_tokens × output_$/M) ÷ 1,000,000 × calls/day. Excludes prompt caching, batch discounts, fine-tuning, fast-mode, and data-residency premiums.

How the calculation works

Model prices change frequently. The defaults are starting points, not a pricing guarantee — edit any rate to match the current provider pricing page or your contracted rates.

The cost for a single LLM call is:

cost_per_call = (input_tokens × input_price_per_M + output_tokens × output_price_per_M) ÷ 1,000,000

Per-day, per-month, and per-year columns multiply by calls_per_day, then by 30 and 365 respectively.

The pricing rates default to widely-published reference values, but they are user-editable above so you can plug in current or contracted rates. The cheapest per-year row is highlighted.

What the calculator does not include

Prompt caching discounts. Anthropic's cache_control blocks bill cache hits at a fraction of base input pricing (for example, Opus 4.7 cache hits at $0.50 / MTok vs $5 / MTok base input). OpenAI also offers cached-input and Batch API pricing on supported models, but the discount varies by model. If a meaningful share of your prompts are reused, your real cost may be 30–60% lower than the calculator suggests.
Batch API discounts. Both providers offer batch-processing discounts (Anthropic at 50%; OpenAI varies).
Fine-tuning costs. Per-token rates differ for fine-tuned model variants.
Reserved or committed-use pricing. Enterprise contracts often beat list pricing materially.
Context-window pricing tiers. Some providers charge differently for very long contexts.

For accurate enterprise planning, treat the calculator as a directional estimate and verify with your provider account manager.

Why estimates are not the same as runtime authority

A common pattern: a team uses a calculator like this to project monthly cost at $4,000, sets up an alert for "$5,000 exceeded," and then loses $40,000 in a weekend to an agent that loops while the on-call team sleeps.

Calculators answer "what should we spend?" Cycles answers a broader question — "what *should be allowed to happen at all?" — by enforcing both budgets and action authority at runtime, before each call or tool invocation leaves your application. Cost is one dimension of that. The other dimensions matter just as much in production:

Blast radius. A single agent action — a deploy, an email blast, a database mutation — can cost more in damage than the agent's entire month of LLM bills. Action authority caps what agents do, not just how much they spend.
Risk-weighted authorization. Not every tool call is equal. Reading a file is not the same as sending a refund or executing code. RISK_POINTS lets you allocate authority by danger, not by token count.
Multi-tenant boundaries. A noisy tenant cannot drain shared headroom from quiet ones. See Multi-tenant SaaS.

If your projected $4,000/month is the actual constraint, the right response is not an alert at $5,000 but a pre-execution gate that will not let calls proceed beyond the cap — and an action-authority layer that prevents the catastrophic single mistake that no cost calculator can predict.

Why Cycles — cost, action authority, multi-tenant isolation, and governance, together
Cost Estimation Cheat Sheet — how to size budgets accurately
Action authority: controlling what agents do — the blast-radius dimension
Debugging sudden LLM cost spikes — what to do when the calculator was wrong
Cycles vs Provider Spending Caps — why provider caps do not bound multi-tenant spend

Claude vs GPT Cost Calculator ​

How the calculation works ​

What the calculator does not include ​

Why estimates are not the same as runtime authority ​

Related ​

Claude vs GPT Cost Calculator

How the calculation works

What the calculator does not include

Why estimates are not the same as runtime authority

Related