Claude 3.7 Sonnet

Claude 3.7 Sonnet
Claude 3.7 Sonnet

Hybrid reasoning model, state-of-the-art coding skills, computer use, and 200K context window

Announcements

  • New

    Claude 3.7 Sonnet and Claude Code

    Feb 24, 2025

    Claude 3.7 Sonnet is the first hybrid reasoning model and our most intelligent model to date. It’s state-of-the art for coding and delivers significant improvements in content generation, data analysis, and planning.

  • Claude 3.5 Haiku and a new Claude 3.5 Sonnet

    Oct 22, 2024

    Claude 3.5 Sonnet, the predecessor to Claude 3.7 Sonnet, offered state-of-the-art skills for real-world software engineering tasks, agentic capabilities, and computer use in public beta.

Availability and pricing

For developers interested in building custom AI solutions with Claude 3.7 Sonnet, it is available on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.

For business users and consumers who want to collaborate with Claude 3.7 Sonnet using a simple chat experience, Claude 3.7 Sonnet is available on Claude.ai for all users across the web, iOS, and Android.

Pricing for Claude 3.7 Sonnet starts at $3 per million input tokens and $15 per million output tokens, with up to 90% cost savings with prompt caching and 50% cost savings with batch processing. To learn more, check out our pricing page.

Use cases

Claude 3.7 Sonnet can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data. Combined with state-of-the-art coding, vision, and writing skills, you can use Claude 3.7 Sonnet for a variety of use cases.

Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users also have fine-grained control over how long the model thinks for. Popular use cases include:

Code generation

Claude 3.7 Sonnet is state-of-the-art for agentic coding, and can complete tasks across the entire software development lifecycle—from initial planning to bug fixes, maintenance to large refactors. It offers strong performance in both planning and solving for complex coding tasks, making it an ideal choice to power end-to-end software development processes.

Claude 3.7 Sonnet supports up to 128K output tokens (beta)—over 15x longer than before. This is particularly valuable for rich code generation and planning.

Computer use

By integrating Claude via API, developers can direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking buttons, and typing text. Claude 3.5 Sonnet was the first frontier AI model to be able to use computers in this way. Claude 3.7 is our most accurate model to reliably use computers in this way—albeit experimentally in public beta—and we expect the capability to improve over time.

Advanced chatbots

With enhanced reasoning and a warm, human-like tone, Claude 3.7 Sonnet is ideal for chatbots that need to connect data and take action across a variety of systems and tools.

Knowledge Q&A

Claude 3.7 Sonnet offers a large context window and low rates of hallucination, making it ideal for answering questions around large knowledge bases, documents, and codebases.

Visual data extraction

Claude 3.7 Sonnet is able to extract information from visuals like charts, graphs, and complex diagrams with ease—making it an ideal AI model for data analytics and data science tasks.

Customer-facing agents

Claude 3.7 Sonnet offers superior instruction following, tool selection, error correction, and advanced reasoning for customer-facing agents and complex AI workflows.

Content generation and analysis

Claude 3.7 Sonnet excels at writing and is able to understand nuance and tone to generate more compelling content and analyze content on a deeper level.

Robotic process automation

Automate repetitive tasks or processes with Claude 3.7 Sonnet. It offers industry-leading instruction following and is capable of handling complex processes and operations.

Benchmarks

Claude 3.7 Sonnet offers state-of-the-art performance across a variety of coding, vision, and reasoning tasks.

Benchmark table comparing frontier reasoning models
Claude 3.7 Sonnet excels across instruction-following, general reasoning, multimodal capabilities, and agentic coding, with extended thinking providing a notable boost in math and science. Beyond traditional benchmarks, it even outperformed all previous models in our Pokémon gameplay tests.

Trust & Safety

We've conducted extensive testing and evaluation of Claude 3.7 Sonnet, working with external experts to ensure it meets our standards for safety, security and reliability. In the safety card for this release, we discuss new safety results in several categories, including emerging risks from computer use and potential safety benefits from reasoning models.

What customers are saying

There's a reason Claude is the default model for all Cursor users—Anthropic's approach to building models delivers on real-world tasks. During our extensive testing of Claude 3.7 Sonnet, we've seen significant improvements in the model's ability to understand and handle complex codebases and multi-step tasks. Now with two ways to think, Claude cements its place as the industry leader for coding.

Michael TruellCursor, CEO

Claude 3.7 Sonnet has been very impressive at planning code changes—we've seen cases where it's far better than any other model. Even with thinking off, it effectively handles medium-size features requiring both frontend and backend updates.

Walden YanCofounder, Cognition

What impresses us most about Claude 3.7 Sonnet is its instruction following and multi-turn tool calling accuracy. Coupled with its extended output window, 3.7 Sonnet is transformative for code generation and agentic workflows.

Jared PalmerVP of Product, AI, Vercel

Testing Claude 3.7 Sonnet's abilities building web apps zero-to-one reveals advancements far beyond other models. From consumer sites to dashboards, it's delivering sophisticated frontends with remarkable design quality—creating powerful applications that instantly demonstrate their value.

Michele CatastaPresident, Replit

Claude 3.7 Sonnet is absolutely epic for coding—it delivers complete production-grade code with genuine design taste while maintaining context through complex iterations. This will enable us to empower everyone to add interactivity and smarts to their Canva designs, even without coding skills.

Danny WuHead of AI Products, Canva

On my favorite mathematical test question, Claude 3.7 Sonnet performed better than any other model by a significant margin, at least to my taste. It shows a level of genuine understanding we have not yet seen from AI models—it explains concepts clearly, creates intuitive analogies, and can apply principles across different domains, even in standard mode.

Craig FallsHead of Quantitative Research, Jane Street

Results from our early testing of Claude 3.7 Sonnet in GitHub Spark demonstrate two large improvements from its predecessor: it generates higher quality apps (e.g. feature set, user interface) from a brief natural language description, and in thinking mode it is more successful at generating passing code across iterations.

Alice LiStaff Researcher, Machine Learning, GitHub

Enterprises look to Box as their platform for AI-driven intelligent content management. With Claude 3.7 Sonnet’s advanced reasoning capabilities our customers can unlock even more value from their unstructured content and create new efficiencies across their business. We’re excited to further integrate with Anthropic and bring 3.7 Sonnet’s capabilities to Box AI.

Yashodha BhavnaniVP of AI Product Management, Box

Testing Claude 3.7 Sonnet across Slack and Salesforce shows significant improvements against older models: 30% better summarization, 24% enhanced information retrieval, and a deeper understanding of organizational context and social dynamics.

Curtis AllenSenior Staff Engineer, Slack

Our evaluation shows Claude 3.7 Sonnet enhancing Notion's assistant capabilities, with notable gains in adhering to formatting constraints and more accurate information filtering. Our metrics show chat accuracy improving from 56% to 70%.

Simon LastCofounder, Notion

What sets Claude 3.7 Sonnet apart is its deep understanding of human collaboration. It not only stays focused and completes tasks thoroughly, but interacts in a way that feels natural—while passing nearly 100% of our knowledge graph evaluations.

Rodrigo DaviesDirector of AI Product Management, Asana

Our early testing of Claude 3.7 Sonnet has been extremely encouraging, especially on our most difficult tasks. The quality is exceptional—we are exploring how these gains can be leveraged across more nuanced legal tasks.

Joel HronCTO, Thomson Reuters

Claude 3.7 Sonnet is particularly impressive in codebase understanding. We saw 10+% gains on agentic QA, essentially solving our benchmarks even without reasoning mode enabled—a change on par with the step from Haiku to Sonnet.

Guy Gur-AriCofounder & Chief Scientist, Augment Code

We're excited to continue our partnership with Anthropic in bringing Claude 3.7 Sonnet to Poe, which combines deep reasoning and performance with engaging conversation skills. 3.7 Sonnet will be especially helpful for our users who want to solve real-world coding tasks.

Spencer ChenHead of Poe Product, Quora

What stands out about Claude 3.7 Sonnet is its natural and human-like approach to organizing information. From thoughtful code documentation to well-structured summaries, it consistently focuses on what matters most.

Denis YaratsCofounder & CTO, Perplexity

Claude 3.7 Sonnet is hands down the best model we've used for programming, especially Deno. It creates complex applications and handles extensive refactors in a single turn, setting a new bar for AI assistance across our 30,000+ users across the enterprise!

Justin WattsDistinguished Engineer, TELUS

In our benchmark of complex financial analysis tasks, Claude 3.7 Sonnet delivers a substantial 20% increase in accuracy over other reasoning models. At Endex, 3.7 Sonnet enables automations for the most time consuming and detail-oriented tasks like spreading financials and company performance analysis.

Tarun AmasaCEO, Endex

Claude 3.7 Sonnet demonstrates a complex, transparent chain of thought in financial calculations, showing its reasoning in ways no other model has achieved. This builds essential trust in both process and results, which is why we're excited to integrate it across the BlueFlame platform.

Raj BakhruCEO, BlueFlame

Claude 3.7 Sonnet absolutely transforms application development by combining real-world understanding with exceptional code generation. For building agentic systems, this is the first model I’ve seen that can iterate for long durations with zero errors.

Ash EdwardsCEO, Fern Labs

Claude 3.7 Sonnet has transformed AI-powered software engineering. At iGent AI, we've measured 7x success on complex tasks and 2.5x fewer context errors. Engineers now build in days what took weeks, with a thought partner that preserves creative control.

Sean WardCEO & Cofounder, iGent

What we're seeing with Claude 3.7 Sonnet in our development environment is very impressive. From multi-file edits to enhanced reasoning output, it's bringing unprecedented capabilities to complex coding workflows for Bolt.new.

Dominic ElmFounding Engineer, StackBlitz

Per our internal testing results, we're seeing delightful improvements using Claude 3.7 Sonnet for web story generation—it understands narrative context deeply and delivers more natural dialogue than other models we've tried. We are not just seeing richer and longer outputs, but more enhanced emotional intelligence and cultural adaptability.

DJay LeeCPO & Cofounder, WRTN

Claude 3.7 Sonnet represents a leap in computer use—its ability to determine correct agentic actions is significantly improved over GPT-4o and prior Claude versions. Complex actions show far lower error rates with less guidance needed, reducing 4-hour automated testing to just 10 minutes.

David ColwellVP of Artificial Intelligence & Machine Learning, Tricentis

What impresses us about Claude 3.7 Sonnet is how it combines rapid analysis with nuanced code understanding. When planning code updates, it considers subtle context such as files being generated from a template. It also filters out 21% more noisy security alerts than our previous model.

Bence NagyAI Lead Engineer, Semgrep

See Claude in action

Coding

What should I look for when reviewing a Pull Request for a Python web app?

Writing

Create a 3-month editorial calendar template for a weekly newsletter

Students

What's an effective study schedule template for final exams?

Frequently asked questions

We offer a family of Claude models across the spectrum of speed, price, and performance. Claude 3.7 Sonnet is our most intelligent model to date. We recommend Claude 3.7 Sonnet for critical use cases where you want frontier intelligence, like customer-facing AI experiences.

Pricing depends on how you want to use Claude 3.7 Sonnet. To learn more, check out our pricing page.

Instead of restricting Claude to use APIs, we're teaching it general computer skills—allowing it to use a wide range of standard tools and software programs. Developers can use this beta capability to automate repetitive tasks, perform software testing and Q/A, and perform open-ended tasks like research.

To make this possible, we've built an API that allows Claude to perceive and interact with computer interfaces. Developers can integrate this API to enable Claude to translate prompts (e.g., “find me a hotel in Colorado”) into specific computer commands.

Claude 3.7 Sonnet is both an ordinary LLM and a reasoning model in one: you can pick when you want the model to answer normally and when you want it to think longer before answering.

Extended thinking mode is best for use cases where performance and accuracy matter more than latency. It improves response quality across many tasks, including instruction following, math, physics, and coding—and the visible thought process also helps you verify how the model arrived at its response.