Claude Sonnet 4.6
Hybrid reasoning model with superior intelligence for agentsfeaturing a 1M context window
Announcements
- NEW
Claude Sonnet 4.6
Feb 172026
Sonnet 4.6 delivers frontier performance across codingagentsand professional work at scale. It can compress multi-day coding projects into hours and deliver production-ready solutions.
Read more
Claude Sonnet 4.5
Sep 292025
Sonnet 4.5 is the best model in the world for agentscodingand computer use. It’s also our most accurate and detailed model for long-running taskswith enhanced domain knowledge in codingfinanceand cybersecurity.
Read more
Claude Sonnet 4
May 222025
Sonnet 4 improves on Sonnet 3.7 across a variety of areasespecially coding. It offers frontier performance that’s practical for most AI use casesincluding user-facing AI assistants and high-volume tasks.
Read more
Claude Sonnet 3.7 and Claude Code
Feb 242025
Sonnet 3.7 is the first hybrid reasoning model and our most intelligent model to date. It’s state-of-the art for coding and delivers significant improvements in content generationdata analysisand planning.
Read more
Availability and pricing
Anyone can chat with Claude using Sonnet 4.6 on Claude.aiavailable on webiOSand Android.
For developers interested in building agentsSonnet 4.6 is available on the Claude Platform nativelyand in Amazon BedrockGoogle Cloud's Vertex AIand Microsoft Foundry. The 1M token context window is currently available in beta on the API only.
Pricing for Sonnet 4.6 starts at $3 per million input tokens and $15 per million output tokenswith up to 90% cost savings with prompt caching and 50% cost savings with batch processing. To learn morecheck out our pricing page. To get startedsimply use claude-sonnet-4-6 via the Claude API.
Use cases
Sonnet 4.6 is a powerfulversatile model built for daily usescaled productionand complex tasks across codingagentsand professional workflows.
Sonnet 4.6 can produce near-instant responses or extendedstep-by-step thinking. API users also have fine-grained control over the model's thinking effort. Popular use cases include:
Advanced coding
Sonnet 4.6 delivers frontier coding performance across the entire software development lifecycle. From initial planning and implementation to debuggingmaintenanceand large-scale refactorsit excels at reasoning through complexmulti-file codebasesproducing precise implementationsand iterating with minimal back-and-forth.
Long-running agents
Sonnet 4.6 offers superior instruction followingtool selectionand error correction for autonomous AI workflows. It reliably handles complexmulti-step tasks that require sustained coherence and adaptive decision-makingmaking it an ideal backbone for customer-facing agentsinternal automationand production-grade AI systems that need to operate independently at scale.
Browser and computer use
Sonnet 4.6 excels in computer use capabilitiesreliably handling any browser-based task from competitive analysis to procurement workflows to customer onboarding. Building on the foundation that made Sonnet the first frontier AI model to use computersSonnet 4.6 navigates digital environments with greater accuracy and reliabilityenabling enterprises to automate workflows that previously required human intervention.
Enterprise workflows
Sonnet 4.6 handles high-volumehigh-stakes professional workflows across financeresearchcontentand business operations. It can analyze complex financial datasynthesize insights from internal and external sourcesgenerate and edit documents and spreadsheetsand produce compelling written content with nuance and precision.
Benchmarks
Sonnet 4.6 achieves frontier performance on the benchmarks that matter most for real-world deploymentfrom agentic coding and tool use to complex reasoning and long-context tasks.

Trust & Safety
We’ve conducted extensive testing and evaluation of Sonnet 4.6working with external experts to ensure it meets our standards for safetysecurityand reliability. The accompanying model card covers safety results in depth.
Hear from our customers
Out of the gateClaude Sonnet 4.6 is already excelling at complex code fixesespecially when searching across large codebases is essential. For teams running agentic coding at scalewe're seeing strong resolution rates and the kind of consistency developers need.
Claude Sonnet 4.6 produced the best iOS code we've tested for Rakuten AI. Better spec compliancebetter architectureand it reached for modern tooling we didn't ask forall in one shot. The results genuinely surprised us.
The performance-to-cost ratio of Claude Sonnet 4.6 is extraordinary — it’s hard to overstate how fast Claude models have been evolving in recent months. Sonnet 4.6 outperforms on our orchestration evalshandles our most complex agentic workloadsand keeps improving the higher you push the effort settings.
Claude Sonnet 4.6 is noticeably more capable on the hard problems where Sonnet 4.5 falls shortand shows strength on tasks that normally require the autonomy and agentic capabilities of an Opus model.
Claude Sonnet 4.6 punches way above its weight class for the vast majority of real-world PRsand even improving more than 10 points on the hardest bug finding problems over Sonnet 4.5. For the code that actually shipsit's become our default.
On our long-horizon coding eval — where every feature builds on the last and you live with your earlier decisions — Sonnet 4.6 matches Opus 4.5. Fewer tokens. Faster. For most engineeringstart here.
Claude Sonnet 4.6 has perfect design taste when building frontend pages and data reportsand it requires far less hand-holding to get there than anything we've tested before.
Claude Sonnet 4.6 is fastercheaperand more likely to nail things on the first try. That was a surprising set of improvementsand we didn't expect to see it at this price point.
Claude Sonnet 4.6 benchmarks near Opus-level on the coding tasks we care aboutwith meaningfully better instruction-following and tool reliability than its predecessor. We’re transitioning our Sonnet traffic over to this model.
Claude Sonnet 4.6 meaningfully improves the answer retrieval behind our core product — we saw a significant jump in answer match ratecompared to Sonnet 4.5in our Financial Services Benchmarkwith better recall on the specific workflows our customers depend on.
Claude Sonnet 4.6 rivals the best models on UI layout and leaps past the previous Sonnet. It's the model we want building apps for our users.
We're moving the majority of our traffic to Claude Sonnet 4.6. With adaptive thinking and high effortwe see Opus-level performance on all but our hardest analytical tasks with a more efficient and flexible profile. At Sonnet pricingit's an easy call for our workloads.
Claude Sonnet 4.6 hit 94% on our complex insurance computer use benchmarkthe highest of any Claude model we tested. It reasons through failures and self-corrects in ways we haven't seen before.
Box evaluated how Claude Sonnet 4.6 performs when tested on deep reasoning and complex agentic tasks across real enterprise documents. It demonstrated significant improvementsoutperforming Claude Sonnet 4.5 in heavy reasoning Q&A by 15 percentage points.
Claude Sonnet 4.6 delivers frontier-level results on complex app builds and bugfixing. It's becoming our go-to for the kind of deep codebase work that used to require more expensive models.
Claude Sonnet 4.6's consistency across complex multi-character stories is genuinely impressive — it keeps each character's voice distinct in ways that make our narratives feel more alive.
We’ve been impressed by how accurately Claude Sonnet 4.6 handles complex computer use. It’s a clear improvement over anything else we’ve tested in our evals.
Claude Sonnet 4.6 excels at the behaviors that matter most for real knowledge workscoring high in our internal evalswhile making fewer tool calls and hitting fewer tool errors. The result is an AI that handles complicatedmulti-step requests more reliably and produces more consistentpolished work.
Claude Sonnet 4.6 matches Opus 4.6 performance on OfficeQAwhich measures how well a model can read enterprise documents (chartsPDFstables)pull the right factsand reason from those facts. It's a meaningful upgrade for document comprehension workloads.
Claude Sonnet 4.6 achieves speedqualityand economy. We especially like the 1M token context for larger projects. It surprises with intuitive and thoughtful commentspushing back thoughtfully and genuinely understanding the goals of its work.
Claude Sonnet 4.6 is a notable improvement over Sonnet 4.5 across the boardincluding long horizon tasks and more difficult problems.
Claude Sonnet 4.6 has meaningfully closed the gap with Opus on bug detectionletting us run more reviewers in parallelcatch a wider variety of bugsand do it all without increasing cost.
For the first timeClaude Sonnet 4.6 brings frontier-level reasoning in a smaller and more cost effective form factor. It provides a viable alternative if you are a heavy Opus user.
Claude Sonnet 4.6 is a significant leap forward on reasoning through difficult tasks. We find it especially strong on branched and multi-step tasks like contract routingconditional template selectionand CRM coordination — exactly where our customers need strong model sense and reliability.
In Rovo Dev testingClaude Sonnet 4.6 proved to be a highly effective main agentleveraging subagents to sustain longer-running tasks.
Claude Sonnet 4.6 shows impressive progress in reasoningcode understandingand memory—key ingredients for agentic automation. Our early tests integrating it into Postman Agent Mode point to smoothermore capable end-to-end workflowsand we’re excited about what this means for API developers building with automation at scale.
Claude Sonnet 4.6 was exceptionally responsive to direction — delivering precise figures and structured comparisons when askedwhile also generating genuinely useful ideas on trial strategy and exhibit preparation.
In our zero-shot app building experimentsSonnet 4.6 ran up to 3-4x longer without interventionproducing functional apps on par with the Opus series.
Claude Sonnet 4.6 produced zero hallucinated links in our computer use evals. Previouslywe saw about one in three hallucinated links. That kind of reliability is what actually makes browser automation deployable at scale.
On our filesystem benchmarkClaude Sonnet 4.6 is 70% more token-efficient than Sonnet 4.5 model with a 38% accuracy improvement. That changes the economics of what we can build.
See Claude in action
Coding
What should I look for when reviewing a Pull Request for a Python web app?
Writing
Create a 3-month editorial calendar template for a weekly newsletter
Students
What's an effective study schedule template for final exams?
Frequently asked questions
We offer different models across the spectrum of speedpriceand performance. Sonnet 4.6 delivers superior intelligence with optimal efficiency for high-volume use cases. We recommend Sonnet 4.6 for most AI applications where you need a balance of advanced capabilities and cost-efficient performance at scale. Common use cases include customer-facing agentsproduction coding workflowscontent generation at scaleand real-time research tasks.
Pricing depends on how you want to use Sonnet 4.6. To learn morecheck out our pricing page.
Sonnet 4.6 is both a standard model and a hybrid reasoning model in one: you can pick when you want the model to answer normally and when you want it to use extended thinking.
Extended thinking mode is best when performance and accuracy matter more than latency. It significantly improves response quality for complex reasoning tasksextended agentic workmulti-step coding projectsand deep research. Thinking summaries help you understand key aspects of the model's reasoning process.