Anthropic的Claude代码用户达到使用限制的速度“比预期快得多”。
Anthropic: Claude Code users hitting usage limits 'way faster than expected'

原始链接: https://www.theregister.com/2026/03/31/anthropic_claude_code_limits/

Anthropic 的 AI 编程助手 Claude Code 的用户,面临着意外高的 token 使用量和迅速耗尽的配额,打断了他们的工作流程。Anthropic 承认了这个问题,并表示这是他们的首要任务,用户报告显示每个订阅周期内可用的天数显著减少。 几个因素导致了这个问题:高峰时段最近的配额减少、使用量限制翻倍的促销活动结束,以及可能导致成本膨胀的错误——一位用户发现错误导致成本增加 10-20 倍。简短提示缓存的生命周期短(5 分钟)也会在非连续使用的情况下增加成本,尽管可以选择以更高的 token 价格延长它。 更令人沮丧的是,Anthropic 的使用限制定义模糊,使得开发者难以预测和管理他们的支出。这与谷歌 Antigravity 最近出现的问题相似,凸显了用户成本控制与 AI 提供商盈利能力之间的更广泛紧张关系,尤其是在供应商鼓励将 AI 集成到容易迅速耗尽配额的自动化工作流程中时。

Anthropic:Claude Code 用户超出使用限制“比预期快得多”(theregister.com) 26 分,来自 samizdis 53 分钟前 | 隐藏 | 过去 | 收藏 | 2 条评论 帮助 elephanlemon 12 分钟前 [–] 昨天(专业版)我进行了一次小对话,Claude 做了一组三次网络搜索,一次非常小的无网络搜索的对话,并且我向一个已有的长对话中添加了一个提示。我震惊地发现,在最后一个提示之后,我不知怎么的就达到了我的限制,直到下午 5 点。这个账户没有连接到 IDE 或 Code,非常令人困惑。回复 master_crab 2 分钟前 | 父评论 [–] 工具调用(特别是为了获取上下文而进行检索)会大量消耗上下文窗口。我明确地将 MCP 调用发送给子代理,因为它们非常“冗长”。回复 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系 搜索:
相关文章

原文

Users of Claude Code, Anthropic's AI-powered coding assistant, are experiencing high token usage and early quota exhaustion, disrupting their work.

Anthropic has acknowledged the issue, stating that "people are hitting usage limits in Claude Code way faster than expected. We're actively investigating... it's the top priority for the team."

A user on the Claude Pro subscription ($200 annually) said on the company's Discord forum that "it's maxed out every Monday and resets at Saturday and it's been like that for a couple of weeks... out of 30 days I get to use Claude 12."

The Anthropic forum on Reddit is buzzing with complaints. "I used up Max 5 in 1 hour of working, before I could work 8 hours," said one developer today. The Max 5 plan costs $100 per month.

There are several possible factors in the change. Last week, Anthropic said it was reducing quotas during peak hours, a change that engineer Thariq Shihipar said would affect around 7 percent of users, while also claiming that "we've landed a lot of efficiency wins to offset this."

March 28 was also the last day of a Claude promotion that doubled usage limits outside a six-hour peak window.

A third factor is that Claude Code may have bugs that increase token usage. A user claimed that after reverse engineering the Claude Code binary, they "found two independent bugs that cause prompt cache to break, silently inflating costs by 10-20x." Some users confirmed that downgrading to an older version helped. "Downgrading to 2.1.34 made a very noticeable difference," said one.

The documentation on prompt caching says that the cache "significantly reduces processing time and costs for repetitive tasks or prompts with consistent elements." That said, the cache has only a five-minute lifetime, which means stopping for a short break, or not using Claude Code for a few minutes, results in higher costs on resumption.

Developers can upgrade the cache lifetime to one hour but "1-hour cache write tokens are 2 times the base input tokens price," the documentation states. A cache read token is 0.1 times the base price, so this is a key area for optimization.

Anthropic does not state the exact usage limits for its plans. For example, the Pro plan promises only "at least five times the usage per session compared to our free service." The Standard Team plan promises "1.25x more usage per session than the Pro plan." This makes it hard for developers to know what their usage limits are, other than by examining their dashboard showing how much quota they have consumed.

Problems like this are not unusual. Earlier this month, users of Google Antigravity were protesting about similar issues.

Bugs aside, what we are seeing is an implicit negotiation between users and providers over what is an acceptable pricing and usage model for AI development. Users want to control costs and providers need to make a profit. There is also a disconnect between vendor marketing that urges developers to insert AI into every process, including in some cases automated workflows, and a quota system that can cause AI tools to stop responding.

"For folks running Claude Code in automated workflows: rate-limit errors need to be caught explicitly – they look like generic failures and will silently trigger retries. One session in a loop can drain your daily budget in minutes," observed one user. ®

联系我们 contact @ memedata.com