```通用Claude。md – 将Claude输出令牌减少63%```

```通用Claude。md – 将Claude输出令牌减少63%```
Universal Claude.md – cut Claude output tokens

原始链接: https://github.com/drona23/claude-token-efficient

## CLAUDE.md：减少Claude输出和成本 **CLAUDE.md** 是一个单独的文件，旨在显著减少Claude的冗余性——基准测试中减少约63%——无需任何代码更改。虽然它主要针对输出token的减少（影响成本），但也解决了令人讨厌的倾向，例如过度礼貌、重复问题和不必要的格式。该文件通过上下文修改Claude的行为，禁止例如谄媚的开场白、空洞的结尾和未经请求的建议等内容。它与模型无关，但主要在Claude上进行了测试。 **最适合：** 自动化、结构化任务以及需要一致输出的团队。 **不适合：** 单个查询、休闲使用或修复模型的基本问题，例如幻觉。 **主要权衡：** 该文件*增加*输入token，因此只有当输出量足够大以抵消此成本时，才能节省成本。 **用法：** 将 `CLAUDE.md` 文件简单地放入您的项目根目录。预构建的配置适用于编码、代理和分析，或者您可以使用特定规则自定义该文件。欢迎社区贡献！ **在此处查找：** [https://github.com/drona23/claude-token-efficient](https://github.com/drona23/claude-token-efficient)

一个新GitHub项目旨在将Claude的输出令牌减少高达63%，可能降低成本，尤其是在处理大型数据集时。然而，Hacker News上的讨论提出了一个问题：这种简洁性是否会影响更复杂、更具“代理性”的编码任务中的性能。一些用户认为，Claude的自然冗长性实际上可以*提高*迭代编码项目中的连贯性和决策能力，防止它“迷失方向”，并最终通过更好的结果节省令牌。其他人则分享了管理上下文长度的策略，包括创建会话摘要（“交接”技能）以及利用Headroom、RTK和MemStack等工具来压缩数据并提供持久内存。一个关键点是，Claude的成本主要由*输入*令牌驱动，而非输出，因此收益可能有限。此外，一些人担心限制输出可能会对模型的整体智能和推理能力产生负面影响。该项目似乎在输出令牌成本是一个重要问题且任务不太依赖于复杂迭代推理的情况下最有益。

原文

One file. Drop it in your project. Cuts Claude output verbosity by ~63%. No code changes required. Note: most Claude costs come from input tokens, not output. This file targets output behavior - sycophancy, verbosity, formatting noise. It won't fix your biggest bill but it will fix your most annoying responses. Model support: benchmarks were run on Claude only. The rules are model-agnostic and should work on any model that reads context - but results on local models like llama.cpp, Mistral, or others are untested. Community results welcome.

When you use Claude Code, every word Claude generates costs tokens. Most people never control how Claude responds - they just get whatever the model decides to output.

By default, Claude:

Opens every response with "Sure!", "Great question!", "Absolutely!"
Ends with "I hope this helps! Let me know if you need anything!"
Uses em dashes (--), smart quotes, Unicode characters that break parsers
Restates your question before answering it
Adds unsolicited suggestions beyond what you asked
Over-engineers code with abstractions you never requested
Agrees with incorrect statements ("You're absolutely right!")

All of this wastes tokens. None of it adds value.

Drop CLAUDE.md into your project root. Claude Code reads it automatically. Behavior changes immediately.

your-project/
└── CLAUDE.md    <- one file, zero setup, no code changes

When This Helps vs When It Doesn't

This file works best for:

Automation pipelines with high output volume (resume bots, agent loops, code generation)
Repeated structured tasks where Claude's default verbosity compounds across hundreds of calls
Teams who need consistent, parseable output format across sessions

This file is not worth it for:

Single short queries - the file loads into context on every message, so on low-output exchanges it is a net token increase
Casual one-off use - the overhead doesn't pay off at low volume
Fixing deep failure modes like hallucinated implementations or architectural drift - those require hooks, gates, and mechanical enforcement
Pipelines using multiple fresh sessions per task - fresh sessions don't carry the CLAUDE.md overhead benefit the same way persistent sessions do
Parser reliability at scale - if you need guaranteed parseable output, use structured outputs (JSON mode, tool use with schemas) built into the API - that is a more robust solution than prompt-based formatting rules
Exploratory or architectural work where debate, pushback, and alternatives are the point - the override rule lets you ask for that any time, but if that's your primary workflow this file will feel restrictive

The honest trade-off: The CLAUDE.md file itself consumes input tokens on every message. The savings come from reduced output tokens. The net is only positive when output volume is high enough to offset the persistent input cost. At low usage it costs more than it saves.

Same 5 prompts. Run without CLAUDE.md (baseline) then with CLAUDE.md (optimized).

Test	Baseline	Optimized	Reduction
Explain async/await	180 words	65 words	64%
Code review	120 words	30 words	75%
What is a REST API	110 words	55 words	50%
Hallucination correction	55 words	20 words	64%
Total	465 words	170 words	63%

~384 output tokens saved per 4 prompts. Same information. Zero signal loss.

Methodology note: This is a 5-prompt directional indicator, not a statistically controlled study. Claude's output length varies naturally between identical prompts. No variance controls or repeated runs were applied. Treat the 63% as a directional signal for output-heavy use cases, not a precise universal measurement. The CLAUDE.md file itself adds input tokens on every message - net savings only apply when output volume is high enough to offset that persistent cost.

Usage	Tokens Saved/Day	Monthly Savings (Sonnet)
100 prompts/day	~9,600 tokens	~$0.86
1,000 prompts/day	~96,000 tokens	~$8.64
3 projects combined	~288,000 tokens	~$25.92

Without CLAUDE.md - Code Review:

Great question! I can see you're working with a loop. Let me take a look at that.
That's a thoughtful piece of code! However, I notice there's actually a subtle bug
here that's worth pointing out. The condition i <= arr.length should be i < arr.length.
Using <= means the loop will run one extra iteration... You might also want to consider
using modern array methods like forEach, map, or for...of which are cleaner...
Great catch on reviewing this - it's always good practice to double-check loop
boundaries! I hope this review was helpful. Feel free to share more code anytime!

120 words.

With CLAUDE.md - Same prompt:

Bug: <= causes an off-by-one error. On the final iteration arr[arr.length] is undefined.

Fix:
for (let i = 0; i < arr.length; i++)

30 words. Same fix. 75% fewer tokens.

#	Problem	Fix
1	Sycophantic openers	Banned - answer is always line 1
2	Hollow closings	Banned - no "I hope this helps!"
3	Restating the prompt	Banned - execute immediately
4	Em dashes, smart quotes, Unicode	ASCII-only output enforced
5	"As an AI..." framing	Banned
6	Unnecessary disclaimers	Banned unless genuine safety risk
7	Unsolicited suggestions	Banned - exact scope only
8	Over-engineered code	Simplest working solution enforced
9	Hallucination on uncertain facts	Must say "I don't know" - no guessing
10	User correction ignored	Correction becomes session ground truth
11	Redundant file reads	Never read the same file twice
12	Scope creep	Do not touch code outside the request

Pro Tips From the Community

Scope rules to your actual failure modes, not generic ones. Generic rules like "be concise" help but the real wins come from targeting specific failures you've actually hit. For example if Claude silently swallows errors in your pipeline, add a rule like: "when a step fails, stop immediately and report the full error with traceback before attempting any fix." Specific beats generic every time.

CLAUDE.md files compose - use that. Claude reads multiple CLAUDE.md files at once - global (~/.claude/CLAUDE.md), project-level, and subdirectory-level. This means:

Keep general preferences (tone, format, ASCII rules) in your global file
Keep project-specific constraints ("never modify /config without confirmation") at the project level
Keep task-specific rules in subdirectory files

This avoids bloating any single file and keeps rules close to where they apply.

Different project types need different levels of compression. Pick the base file + a profile, or use the base alone.

Profile	Best For
`CLAUDE.md`	Universal - works for any project
`profiles/CLAUDE.coding.md`	Dev projects, code review, debugging
`profiles/CLAUDE.agents.md`	Automation pipelines, multi-agent systems
`profiles/CLAUDE.analysis.md`	Data analysis, research, reporting

Option 1 - Universal (any project):

curl -o CLAUDE.md https://raw.githubusercontent.com/drona23/claude-token-efficient/main/CLAUDE.md

Option 2 - Clone and pick a profile:

git clone https://github.com/drona23/claude-token-efficient
cp claude-token-efficient/profiles/CLAUDE.coding.md your-project/CLAUDE.md

Option 3 - Manual: Copy the contents of CLAUDE.md from this repo into your project root.

User instructions always win. If you explicitly ask for a detailed explanation or verbose output, Claude will follow your instruction - the file never fights you.

Found a behavior that CLAUDE.md can fix? Open an issue with:

The annoying behavior (what Claude does by default)
The prompt that triggers it
The fix rule you propose

Community submissions become part of the next version with full credit.

Full benchmark results with before/after word counts: See BENCHMARK.md

This project was built on real complaints from the Claude community. Full credit to every source that contributed a fix:

MIT - free to use, modify, and distribute.

Built by Drona Gangarapu - open to PRs, issues, and profile contributions.