How to Get an Anthropic (Claude) API Key — And What It Really Costs in 2026
Ampliflow
Advanced AI frontier lab and business growth agency. Helping UK businesses deploy agentic AI systems.

Getting a Claude API key takes about five minutes. Understanding what it will cost you takes a little longer — and that is the part worth getting right. This guide covers both: the steps to a working key, and the pricing mechanics that decide whether your bill is sensible or silly.
Last updated: May 2026 · Covers Claude Opus 4.7, Sonnet 4.6, Haiku 4.5
TL;DR: You get a Claude API key from the Claude Console — create an organisation, add a small amount of credit, and generate a key in seconds. New accounts start on Tier 1 (a $5 deposit unlocks a $500/month spend limit) and advance automatically as you spend. Pricing is per token: Opus 4.7 at $5/$25 per million input/output tokens, Sonnet 4.6 at $3/$15, Haiku 4.5 at $1/$5. Two features cut real money — prompt caching (cache reads cost 10% of the input price) and the Batch API (50% off). Most UK businesses do not need the raw API at all; a Claude.ai or Claude Code subscription covers using Claude. The API is for building on Claude. This guide shows you how to get set up safely and forecast the cost before you commit.
Contents
- Do you actually need an API key?
- How to get a Claude API key, step by step
- How usage tiers and spend limits work
- What does the Claude API actually cost?
- The two levers that cut your bill
- The cost traps to know before you build
- How to keep your API key secure
- Frequently asked questions
Do you actually need an API key?
Most people who think they need the Claude API need a subscription instead. It is worth thirty seconds to be sure, because the two are billed completely differently.
- If you want to chat with Claude, research, or write — you want a Claude.ai plan, starting around £17/month.
- If you want Claude to write and ship code in your terminal — you want Claude Code, bundled into a flat per-seat subscription.
- If you want to build Claude into your own product, automation, or agent — then you want the API, billed per token, controlled by an API key.
The API is the raw material. A subscription is the finished tool. If your goal is "use Claude", buy the tool. If your goal is "build something that uses Claude", read on — the key is what you need, and the cost mechanics below are what will keep your bill under control.
How to get a Claude API key, step by step
The whole process lives in the Claude Console, and it is fast. Here is the path from nothing to a working key.
- Create an account at the Claude Console. Sign up with a work email. This creates your organisation — the billing and access boundary for everything that follows.
- Add a payment method and a small amount of credit. A minimum of $5 puts you on Tier 1. Billing is in US dollars regardless of where you are; for a UK business, treat it as a US-dollar software cost and let your accountant handle the FX and expensing.
- Generate an API key. In the Console, go to Settings → API keys and create one. Copy it immediately — you will not be shown the full key again. It looks like
sk-ant-.... - Set a spend limit. Before you write a line of code, go to Settings → Limits and set a customer-defined spend cap below your tier ceiling. This is your seatbelt. A runaway loop in test code cannot cost more than the cap you set.
- Use workspaces to isolate projects. If you are running more than one project or client on the same organisation, create a workspace per project, each with its own key and its own spend and rate limits. A bug in one workspace cannot drain another.
That is it. You now have a key, a spend ceiling, and a clean separation between projects. New accounts also receive a small amount of free credit to test with before you spend anything real.
How usage tiers and spend limits work
Anthropic uses a tiered system that raises your monthly ceiling as your cumulative spend grows — automatically, with no application. You do not ask to move up. You move up by depositing.
| Usage tier | Cumulative credit purchased | Monthly spend limit |
|---|---|---|
| Tier 1 | $5 | $500 |
| Tier 2 | $40 | $500 |
| Tier 3 | $200 | $1,000 |
| Tier 4 | $400 | $200,000 |
| Monthly invoicing | By arrangement | No limit |
The deposit figure is cumulative — the total you have ever added, not a single payment. Cross the threshold and you advance immediately. Each tier also raises your rate limits: the requests and tokens per minute you are allowed to push. On Tier 1, Opus models allow 50 requests per minute and 500,000 input tokens per minute; by Tier 4 that is 4,000 requests and 10 million input tokens a minute.
For most UK SMEs building a first product or internal tool, Tier 1 or Tier 2 is plenty. You only need the higher tiers when you are serving real production traffic at volume. When you outgrow Tier 4, monthly invoicing removes the cap entirely and switches you to Net-30 terms — the point at which you should be talking to a human about volume pricing anyway.
What does the Claude API actually cost?
You pay per token, input and output billed separately, and output always costs five times the input rate. That last fact is the one that shapes a sensible system: a model that answers concisely is dramatically cheaper than one that thinks out loud, even at the same headline price.
| Model | Input ($/M tokens) | Output ($/M tokens) |
|---|---|---|
| Opus 4.7 | $5 | $25 |
| Sonnet 4.6 | $3 | $15 |
| Haiku 4.5 | $1 | $5 |
A token is roughly three-quarters of an English word. To make that concrete, here is a real worked example from Anthropic's own documentation:
Processing 10,000 customer-support conversations (averaging ~3,700 tokens each) on Haiku 4.5 costs about $37 in total. The same workload on Opus 4.7 would cost roughly five times as much, for a task Haiku handles perfectly. Choosing the right model is not a technical preference. It is the single biggest line item you control.
Read that number again: ten thousand customer conversations, handled by frontier-grade AI, for the price of lunch. A year ago that did not exist at any price.
It quietly reframes the whole question — from can we afford to use AI here to can we afford not to.
This is why model routing matters more than model choice. We cover the full decision framework in the companion piece, Claude Models Explained: Opus 4.7 vs Sonnet 4.6 vs Haiku 4.5 — but the short version is: default to Sonnet, push high-volume simple work down to Haiku, and reserve Opus for the genuinely hard tasks.
The two levers that cut your bill
Prompt caching and the Batch API are the difference between an API bill that scales sensibly and one that does not. Both are free to turn on. Most teams discover them too late.
Prompt caching lets you store a large, fixed chunk of context (a system prompt, a knowledge base, a codebase, a long document) and reuse it across requests at a fraction of the price. The economics:
| Cache operation | Cost vs base input price |
|---|---|
| Cache read (a "hit") | 0.1× — a 90% discount |
| 5-minute cache write | 1.25× |
| 1-hour cache write | 2× |
A cache hit costs one-tenth of the standard input price. For any agent that re-reads the same context on every turn, which is almost all of them, this is the largest single saving available. There is a second, quieter benefit: on most Claude models, cached input tokens do not count towards your rate limit. A team with an 80% cache hit rate can effectively process five times more input per minute than their tier nominally allows.
The Batch API processes non-urgent work asynchronously at a flat 50% discount on both input and output. If the job does not need to happen this second — overnight reporting, bulk data enrichment, large backfills, classification runs — batch it and halve the cost. The two discounts stack: batch a cached workload and the savings compound.
The cost traps to know before you build
Three things catch teams out, and all three are avoidable if you know them in advance.
The new tokenizer on Opus 4.7. Opus 4.7 uses a different tokenizer to earlier models — and Anthropic notes it can use up to 35% more tokens for the same text. If you are migrating a cost estimate from Opus 4.6 or Sonnet, do not assume the token counts carry over. Re-measure on the model you will actually ship, or you will under-budget by up to a third.
Server-side tools have their own meter. If your agent uses Claude's built-in web search, that is billed at $10 per 1,000 searches on top of tokens. Web fetch has no surcharge — you pay only for the tokens of whatever it pulls in. Code execution gives you 1,550 free container-hours a month, then $0.05 per hour. These are small, but they are separate from the per-token price, and they surprise people who budgeted on tokens alone.
Output is the expensive half. Because output costs 5× input, a verbose system is structurally more expensive than a concise one at the same volume. Designing prompts that produce tight, structured answers — and reserving long generations for when they genuinely add value — is a real cost lever, not a stylistic nicety.
How to keep your API key secure
An API key is a credential that spends your money. Treat it like a password, because that is exactly what it is. The basics are not optional:
- Never commit a key to a repository. Store it in an environment variable or a secrets manager. In CI, use the platform's secret store — for example, a GitHub Actions secret named
ANTHROPIC_API_KEY, referenced as${{ secrets.ANTHROPIC_API_KEY }}, never pasted into the workflow file. - Set a spend limit on every key. The Console lets you cap monthly spend below your tier ceiling. This turns "a bug drained my account" into "a bug hit a cap and stopped."
- Use one workspace per project. Isolation means a compromised or buggy key in one project cannot spend another project's budget.
- Rotate keys when people leave or when in doubt. Generating a new key and revoking the old one takes seconds.
These are the same disciplines we apply on every production system we ship. The cost of getting them wrong (a leaked key, a runaway loop, a drained account) is far higher than the minute it takes to get them right.
Frequently asked questions
Is the Anthropic API the same as ChatGPT's API?
No. The Anthropic (Claude) API gives you access to the Claude family of models — Opus 4.7, Sonnet 4.6, Haiku 4.5 — billed per token. It is a separate service from OpenAI's API, with its own keys, pricing, and console. The two are not interchangeable, though both speak broadly similar request formats.
How much does the Claude API cost per month?
There is no fixed monthly fee — you pay only for what you use, in tokens. A light internal tool might cost a few dollars a month; a high-volume production agent, hundreds or thousands. Your ceiling is set by your usage tier (from $500/month on Tier 1 up to $200,000 on Tier 4), and you can set a lower customer-defined cap to control spend. Prompt caching and the Batch API can cut the real cost substantially.
Is there a free Claude API tier?
New accounts receive a small amount of free credit to test with, but there is no permanent free tier — the API is pay-as-you-go. If you only want to use Claude rather than build on it, the free Claude.ai plan covers chat at no cost.
Do I need the API, or will a subscription do?
If you want to use Claude — chat, research, coding — a subscription is enough and usually cheaper. Claude.ai plans start around £17/month and Claude Code bundles model access for engineering. You only need the API when you are building Claude into your own product, automation, or agent and want to control routing, caching, and integration yourself.
How do I lower my Claude API bill?
Four levers, in order of impact: (1) route each task to the cheapest capable model — Haiku for high-volume simple work, Sonnet for most things, Opus only for the hard cases; (2) turn on prompt caching for any repeated context (cache reads are 90% cheaper); (3) use the Batch API for non-urgent jobs (50% off); (4) design prompts for concise output, since output costs 5× input.
Can I use the Claude API from the UK?
Yes. The API is global and available to UK businesses. Billing is in US dollars, so account for FX, and treat it as a deductible software expense — your accountant can advise on VAT treatment for a US-supplied digital service. There are no UK-specific restrictions on access.
Related reading
- ↔ Claude Models Explained: Opus 4.7 vs Sonnet 4.6 vs Haiku 4.5 — which model to point your key at, and why routing beats defaulting
- ↔ The Claude Agent SDK: Building Production Agents — the fastest way to turn an API key into a working agent
- ↔ What Is MCP (Model Context Protocol)? — connecting your Claude-powered app to real tools and data
- ↔ Claude Code Pricing 2026 — Real Cost for UK Businesses — the subscription route, for when you want the tool not the raw API
- ↔ The Latest Claude Code Features — what the subscription tool can do before you reach for the API
What should you do next?
If you are building on the Claude API, the expensive mistakes are made in the first week — the wrong model on a high-volume path, no caching, no spend cap. They are easy to avoid and painful to unwind.
We help UK businesses design Claude-powered systems that are cheap to run and safe to scale — the right model routing, caching from day one, and spend controls that mean a bug costs you a coffee, not a salary.
See how Ampliflow builds on Claude in production →
Or start with a free audit of your use case, your projected volumes, and the model routing that fits: Book a free AI audit →
The API is the cheapest frontier intelligence has ever been. Spend it deliberately, and it pays for itself many times over.
Ampliflow is a UK AI frontier lab and growth agency based in Solihull, West Midlands. We ship production AI systems for UK SMEs and enterprises using Claude, the Amplex orchestration framework, and reviewing-model verification. Our case studies are named, our methodology is published, and our team builds with Claude daily.