Summarize this article with:

summary

Claude Sonnet 5 API uses model ID claude-sonnet-5, with a 1M-token context window and 128K max output.
Pricing is $2/$10 per 1M input/output tokens until Aug 31, 2026, then $3/$15, with no long-context premium.
Best fit: agentic coding, multi-file refactors, long-document analysis, vision, and computer-use tasks.
Access options: direct Anthropic API, AWS Bedrock, Google Vertex AI, or Eden AI for one OpenAI-compatible API, unified billing, routing, and fallback across 500+ models.

Claude Sonnet 5 API is Anthropic’s Sonnet-class model for coding, agents, vision, and tool-based workflows. Its model ID is claude-sonnet-5, with a 1M-token context window, $2/$10 intro pricing per million input/output tokens, and availability through the Anthropic API, AWS Bedrock, and Google Vertex AI. It is also accessible through Eden AI alongside 500+ LLMs and AI models.

What is Claude Sonnet 5?

Claude Sonnet 5 is the upgrade to Sonnet 4.6. It sits below Opus 4.8 and above Haiku in Anthropic’s model family. The goal is clear: near-Opus quality on coding and agentic tasks, at Sonnet-level cost.

The model supports a 1M-token context window, up to 128K output tokens, and adaptive thinking turned on by default. It also supports high-resolution vision up to 2576px, tool use, prompt caching, structured outputs with JSON schema, web search, computer use, and effort levels including xhigh.

Developers can access Claude Sonnet 5 through the Anthropic API, AWS Bedrock, Google Vertex AI, and GitHub Copilot. For teams comparing providers or building fallback across models, Eden AI lets you route Claude Sonnet 5 alongside other LLMs through one OpenAI-compatible API.

Claude Sonnet 5 specifications

Specification	Value
Model ID	`claude-sonnet-5`
Context window	1M tokens
Max output	128K tokens
Modalities	Text, vision (high-resolution up to 2576px), tool use
Thinking mode	Adaptive thinking (enabled by default)
Effort levels	Standard levels, including xhigh

1M-token context at standard pricing: Claude Sonnet 5 includes its full 1M-token context window without a separate long-context premium, making it easier to process large codebases and long documents with predictable costs.

xhigh effort for complex workloads: For the most demanding coding and agentic workflows, Claude Sonnet 5 supports an xhigh effort level, allowing the model to spend more compute on difficult reasoning tasks. Through Eden AI, you can combine this with routing and fallback strategies without changing your application code.

Claude Sonnet 5 pricing

Claude Sonnet 5 costs $2 per million input tokens and $10 per million output tokens during the introductory period, valid through August 31, 2026. After that, standard pricing moves to $3 per million input tokens and $15 per million output tokens.

Model	Input price / 1M tokens	Output price / 1M tokens
Claude Sonnet 5	$2 intro / $3 standard	$10 intro / $15 standard
Claude Sonnet 4.6	$3.00	$15.00
Claude Opus 4.8	$5.00	$25.00

Two cost levers matter in production: prompt caching and batch processing. Prompt caching reduces repeated input costs when you reuse long system prompts, documents, or codebase context. Batch processing can lower costs for non-real-time workloads such as evaluations, document processing, and offline agent runs.

Claude Sonnet 5 benchmarks & performance

Claude Sonnet 5 performs near the top tier on real-world coding and agentic workloads, including SWE-bench-style tasks. Anthropic positions it as statistically close to leading GPT and Gemini peers, but exact benchmark scores should be verified before publication.

Area	Claude Sonnet 5 performance
Real-world coding	Statistically close to top GPT/Gemini peers on SWE-bench-style tasks; exact score: 63.2%
Code quality	Strong at producing clean, maintainable code with practical structure
Multi-file reasoning	Handles large codebases and cross-file dependencies well, helped by the 1M-token context window
Debugging	Effective for logic bugs, edge cases, race conditions, and regression analysis
Agentic tasks	Strong fit for long-horizon workflows that require tools, planning, and iterative execution
Vision and computer use	Gains in high-resolution visual analysis and computer-use workflows

When to use Claude Sonnet 5

Use the Claude Sonnet 5 API when you need strong reasoning without moving every request to Opus-level pricing.

Agentic coding and multi-file refactors: good fit for code generation, repository-wide changes, debugging, and tool-based developer agents.
Long-context work: the 1M-token context window works well for large codebases, long documents, contract review, and technical documentation analysis.
Cost-sensitive production: Sonnet 5 is useful when Opus 4.8 is more power than the task needs, but Haiku is too limited.
Vision and computer-use tasks: use it for high-resolution visual inputs, UI reasoning, browser workflows, and computer-use agents.

Do not use Sonnet 5 for every request. Route simple or latency-critical tasks to Haiku, and reserve Opus 4.8 for the hardest reasoning jobs. With Eden AI routing, you can apply these choices automatically across Claude and other models.

How to access the Claude Sonnet 5 API

The direct Anthropic API is the simplest path if your stack only needs Claude models and you already manage provider-specific credentials.

AWS Bedrock and Google Vertex AI fit teams that want Claude Sonnet 5 inside existing cloud governance, IAM, and procurement workflows.

Eden AI is the lowest-friction option for multi-model teams. You can access Claude Sonnet 5 through one OpenAI-compatible API, then route or fallback to other models based on cost, performance, or region.

Access path	Setup effort	Billing	Multi-model	Fallback
Anthropic API directly	Low. Create an Anthropic account, get an API key, and call `claude-sonnet-5`.	Anthropic billing	Anthropic models only	Build your own fallback logic
Cloud providers	Medium. Enable Claude through AWS Bedrock or Google Vertex AI. Model IDs differ by platform.	AWS or Google Cloud billing	Models available inside each cloud	Possible, but usually requires custom routing
Eden AI unified API	Low. Use one API key with an OpenAI-compatible API.	Unified Eden AI billing	500+ LLMs and AI models	Built-in routing and automatic fallback

How to use Claude Sonnet 5 with Eden AI

Step 1: Create an Eden AI account and get an API key

Create an account here, open your API keys page, and copy your key.

Step 2: Call claude-sonnet-5

import requests

response = requests.post(
    "https://api.edenai.run/v3/chat/completions",
    headers={
        "Authorization": "Bearer YOUR_API_KEY",
        "Content-Type": "application/json",
    },
    json={
        "model": "claude-sonnet-5",
        "messages": [
            {
                "role": "user",
                "content": "Write a Python function that validates an email address.",
            }
        ],
    },
)

data = response.json()
print(data["choices"][0]["message"]["content"])

‍

Step 3: Swap models or add fallback

One-line model swap

model = "claude-sonnet-5"  # swap to another Eden AI model when needed

‍

Simple fallback config

response = client.chat.completions.create(
    model="claude-sonnet-5",
    messages=[
        {"role": "user", "content": "Refactor this function for readability."}
    ],
   "fallbacks": ["openai/gpt-4o", "google/gemini-3.5-flash"]
     )

‍

FAQs - Claude Sonnet 5

What is the Claude Sonnet 5 model ID?

The Claude Sonnet 5 model ID is claude-sonnet-5. Use this model identifier when calling the Claude Sonnet 5 API through supported platforms.

How much does the Claude Sonnet 5 API cost?

Claude Sonnet 5 costs $2 per million input tokens and $10 per million output tokens through August 31, 2026. After the introductory period, pricing becomes $3 per million input tokens and $15 per million output tokens.

What is Claude Sonnet 5's context window?

Claude Sonnet 5 supports a 1 million-token context window with no long-context pricing premium. It also supports up to 128K output tokens.

Is Claude Sonnet 5 better than GPT-5.6 or Gemini 3.1?

It depends on the workload. Anthropic positions Claude Sonnet 5 as statistically close to leading GPT and Gemini models on real-world coding and agentic tasks, but the best choice depends on your quality, latency, and cost requirements.

How do I get a Claude Sonnet 5 API key?

You can get access through the Anthropic API by creating an Anthropic account and generating an API key. Claude Sonnet 5 is also available through AWS Bedrock, Google Vertex AI, GitHub Copilot, and unified platforms such as Eden AI .

Is Claude Sonnet 5 available on AWS Bedrock and Vertex AI?

Yes. Claude Sonnet 5 is available through both AWS Bedrock and Google Vertex AI, in addition to the Anthropic API and GitHub Copilot. Model identifiers may differ depending on the platform.

Last updated onJuly 1, 2026

Taha Zemmouri

Taha Zemmouri is the CEO and co-founder of Eden AI. With previous experience in AI consulting, he brings a strong business perspective to artificial intelligence and focuses on turning AI capabilities into practical value for companies. With a background in data science and a real entrepreneurial mindset, he combines technical understanding, business vision, and hands-on execution to make AI more accessible and easier to integrate.

Claude Sonnet 5: Pricing, Benchmarks & API Access (2026)