克劳德 Sonnet 3.5

克劳德 Sonnet 3.5
Claude Sonnet 5

原始链接: https://www.anthropic.com/news/claude-sonnet-5

Anthropic 发布了 Claude Sonnet 5，这是其迄今为止最具代理能力（agentic）的模型。该模型专为处理复杂的多步骤任务而设计，在推理、编码和自主工具使用方面较上一代 Sonnet 4.6 有了显著提升，进一步缩小了与高端 Opus 级别模型之间的性能差距。早期反馈强调了该模型在完成端到端工作流、进行自我修正以及高效驾驭复杂技术环境方面的能力。尽管功能有所增强，但 Sonnet 5 依然保持了强大的安全性，其不良行为和幻觉的发生率低于过往版本。虽然它比 Sonnet 4.6 更强大，但在敏感的网络安全任务上，其表现仍不及更高端的 Opus 模型。 Claude Sonnet 5 现已在所有方案（包括免费版、Pro、Team 和企业版）以及 Claude API 中提供。该模型在 2026 年 8 月 31 日前提供优惠定价，输入 Token 每百万个 2 美元，输出 Token 每百万个 10 美元；此后将执行每百万个 Token 3 美元/15 美元的标准定价。通过平衡高性能与成本效益，Sonnet 5 为开发者在日常自动化和复杂软件工程方面提供了一个强大的新工具。

Anthropic 发布 Claude Sonnet 3.5 在 Hacker News 上引发了褒贬不一且普遍持怀疑态度的反应。用户对该模型的定位感到困惑，指出基准测试数据表明，尽管由于采用了效率较低的新分词器可能导致成本上升，但其能力（尤其是在网络安全任务方面）仍不如现有的 Opus 3.5 模型。讨论中的一个反复出现的主题是高性能模型的“准入门槛”。许多用户推测，Anthropic 为了满足美国政府监管机构的要求，有意削弱了该模型的网络安全能力。批评者认为，这造成了一个双重体系：面向公众的“安全”阉割版模型，以及供政府使用的更强大的受限版本。从实际角度来看，开发人员认为该模型仅适用于低至中等难度的“子代理”任务，并建议对于复杂的工作流程，较大的 Opus 模型在成本效益和可靠性方面仍然更胜一筹。许多人表达了“基准测试疲劳”，指出围绕 AI 发布的讨论已变得重复，往往只关注微小的增量收益，而忽视了实质性进展的停滞。总体而言，用户认为这是一次乏善可陈的发布，凸显了 AI 能力与公众可访问性之间日益扩大的鸿沟。

原文

Claude Sonnet 5 is built to be the most agentic Sonnet model yet. It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models.

For many developers, the agentic AI era began with Sonnet-class models: Claude Sonnet 3.5, 3.6, and 3.7 were the first models that showed impressive skills in coding and tool use. More recently, though, the clearest gains in agentic capabilities have been in our Opus-class models.

Sonnet 5 narrows the gap: its performance is close to that of Opus 4.8, but at lower prices. It’s a substantial improvement over its predecessor, Sonnet 4.6, on important aspects of agentic performance like reasoning, tool use, coding, and knowledge work:

Our safety assessments found that Sonnet 5 shows an overall lower rate of undesirable behaviors than Sonnet 4.6, and is generally safer to use in agentic contexts. Evaluations also show that it has a much lower ability to perform cybersecurity tasks than our current Opus models.

From today, Claude Sonnet 5 is available across all plans: it is the default model for Free and Pro plans, and is available to Max, Team, and Enterprise users. It’s also available in Claude Code and on the Claude Platform, where it launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which it will be priced at $3 per million input tokens and $15 per million output tokens. Developers can use claude-sonnet-5 via the Claude API.

Working with Claude Sonnet 5

The charts below compare the performance of Sonnet 5 with Sonnet 4.6 and Opus 4.8 at different effort levels on the agentic search evaluation BrowseComp and the computer use evaluation OSWorld-Verified. Sonnet 5 (orange line) is a strict improvement over Sonnet 4.6 (gray line). Opus 4.8 (yellow line) is still the model of choice for higher accuracy on these tasks, but Sonnet 5 provides developers with lower-priced options that are of much higher quality than what was previously available. Between Sonnet 5 and Opus 4.8, users can adjust the effort level to find the right balance of cost and performance.

Feedback from our early access partners has been consistent: Sonnet 5 is much more agentic than its predecessors. Testers described how it finishes complex tasks where previous Sonnet models would stop short, how it checks its own output without explicitly being asked, and how it does all this agentic work at an attractive price point:

Claude Sonnet 5 gives our agents a strong execution layer for multi-step software engineering work. It handles sustained coding, tool use, and debugging well across messy technical contexts, and has been especially useful for workflows where follow-through and technical grounding matter.

We handed Claude Sonnet 5 a two-part job—update Salesforce account tiers, send a launch announcement to enterprise contacts—and it finished end to end. That used to stall halfway. For day-to-day automation, it’s a no-brainer

Claude Sonnet 5 gets more done with less. Same output quality, fewer steps to get there. It refuses unsafe requests cleanly and consistently, too. At Lovable, we’re putting powerful tools in the hands of millions of builders. A model that knows when to say no is just as important as one that knows how to build.

We ran Claude Sonnet 5 against dozens of our most challenging real pull requests, and it carried each one through to a tested, verified result on its own — freeing our engineers to focus on the judgment, the decision, and the final sign-off.

I asked Claude Sonnet 5 to investigate a bug. Unprompted, it wrote a reproducing test, implemented the fix, then stashed it to confirm the bug came back without the change. All in a single pass.

With Claude Sonnet 5, agents stay on plan, follow our conventions, and ship clean multi-step changes, all at an efficient cost.

Claude Sonnet 5 is at its best on brownfield code—race conditions, hidden tests, the parts nobody wants to touch. It traces a failure to its actual root cause and ships a durable fix instead of patching the symptom.

Claude Sonnet 5 sits on the Pareto frontier for Eve’s plaintiff-law tasks. We see the clearest gains in legal research and analysis, at a price-to-performance ratio that made the choice to migrate easy.

ClickHouse agents explore live data and produce insights on the fly, so time-to-insight matters when testing new models. Claude Sonnet 5 reasons in tighter steps and gets our users to answers noticeably faster. That speed is a difference our customers feel.

At Pace, our computer-use agents run insurance workflows—submission intake, FNOL, loss runs—on the systems our operations teams already use. Claude Sonnet 5 consistently takes the right action and does it quickly, which is what real insurance work demands.

Safety evaluations

Our pre-deployment safety evaluations found that Sonnet 5 was overall an improvement on Sonnet 4.6. On agentic safety, the model is better at refusing malicious requests and resisting hijack attempts in prompt injection attacks. The model shows lower rates of hallucination and sycophancy than Sonnet 4.6. On our automated behavioral audit, which tests a wide range of misaligned behaviors such as cooperation with misuse and deception, Sonnet 5 scored lower (that is, safer) overall. However, it did show somewhat higher rates of misaligned behavior on this assessment compared to the more capable Opus 4.8 and Claude Mythos Preview.

We did not deliberately train Sonnet 5 on cybersecurity tasks. It can perform some routine, non-harmful cyber tasks, but on evaluations testing potentially dangerous cyber skills, such as developing software exploits, it shows substantially poorer performance than models such as Opus 4.8 and Mythos 5. Scores from one evaluation, which tested models’ ability to develop exploits for vulnerabilities in the Firefox browser, are shown in the chart below. Sonnet 5 was never able to develop a full working exploit, but it does show a slightly higher rate of partial success than Sonnet 4.6. This latter change is likely due to improvements in general intelligence rather than specific training.

Since Sonnet 5 is somewhat stronger than its predecessor on these tasks, we’ve launched it with cyber safeguards enabled by default. These safeguards—which detect and block dangerous cyber usage in real time—are the same as those present in Claude Opus 4.7 and 4.8 (because we judged that the overall level of cybersecurity risk from Sonnet 5 was low, the safeguards are less strict than those launched with Fable 5, which block a much wider range of cybersecurity tasks).¹

Our full assessment of Sonnet 5 across many safety and capability evaluations is reported in the Claude Sonnet 5 System Card.

Availability and pricing

Claude Sonnet 5 is available everywhere today at an introductory price of $2 per million input tokens and $10 per million output tokens through August 31, 2026. It then moves to standard pricing at $3 per million input tokens and $15 per million output tokens.² We’ve increased rate limits across Chat, Cowork, Claude Code, and the Claude Platform³ to accommodate the higher token usage of higher effort levels; users can select whichever level makes sense for their particular project.