是时候将您的文档迁移到代码仓库了——特别是考虑到人工智能。

是时候将您的文档迁移到代码仓库了——特别是考虑到人工智能。
It's time to move your docs in the repo

原始链接: https://www.dein.fr/posts/2026-03-13-its-time-to-move-your-docs-in-the-repo

## 人工智能时代的文档核心思想：**将文档视为代码仓库中的一级公民。** 正如代码需要版本控制一样，文档也需要——尤其是在人工智能兴起的情况下。 “最苍白的墨水比最强大的记忆更可靠” 这句谚语比以往任何时候都更具现实意义。人工智能代理正在*大幅增加*文档（通常以 markdown 格式通过规则文件），但这凸显了对清晰、人类可读规范的需求。这些“规则”通常代表未记录的最佳实践，模糊了人工智能生成内容和人类编写内容之间的界限。未来，我们可能会将重点从代码审查转移到*更多地*审查定义明确的规范和指南。将文档移入仓库可以解决文档过时的问题，因为人工智能可以帮助保持代码和文档的一致性。它还为人工智能代理提供关键上下文，通过将知识（例如基础设施经验）直接置于代码库中来节省时间和 token。虽然像 Google Docs 这样的工具仍然对*协作起草*很有价值，但最终的、稳定的文档应该与代码一起存储，以便进行版本控制、轻松更新并受益于标准的编辑工具。最终，文档应该编写成供人类审查，以确保在日益人工智能驱动的开发环境中保持清晰度和可维护性。

## 仓库内的文档：人工智能再次强调的长期最佳实践一则黑客新闻讨论集中在重新推动开发者将文档保存在代码仓库内。虽然大型语言模型被宣传为这样做的*原因*，但许多评论员认为这是一种早于人工智能的基本最佳实践。良好的文档、测试和清晰的架构对人类开发者有益——无论人工智能是否参与，都有助于理解、维护和协作。对话强调了一种感知到的转变：以前在与人类协作时被忽视的考虑因素，现在由于大型语言模型需要访问这些信息而受到优先考虑。然而，许多人认为人类*一直*应该得到这些考虑。人工智能的兴起只是让维护良好的文档的好处更加明显和有影响力。也有人指出，人工智能可以*促进*更好的文档实践，例如在代码更改的同时自动更新文档。最终，共识是投资文档是一场双赢，既有利于人类开发者，也有利于日益强大的AI工具。

原文

The palest ink is more reliable than the most powerful memory. – Chinese proverb

AI changes the game when it comes to having all your docs in your repository: it's never been that easy to keep them up to date!

I've always been a fan of having documentation living alongside the code:

Version control: just like code, documentation evolve. Why use a different versioning control system when you're already using git? Especially when multiple people are changing docs at the same time, with potentially conflicting changes.
Proximity with code: e.g., rg or grep will yield code and documentation results, making it much easier to keep it up to date.
Formal approval: in the spirit of documentation-driven development, starting with a review of documentation updates help understand the final product/API. (For active collaboration other tools, e.g., Google Docs still provide a superior UX).
Automatic generation: when using a different system for hosting the docs (Google Docs, Confluence, Notion, etc.), it's quite laborious to copy-paste APIs and example code. There are many tools (e.g., Sphinx's autodoc, jsdoc, javadoc, docusaurus) that can generate API docs directly from the code.
Testing: static code examples in documentation are a good start, but it's even better when they're tested, which you can do when running code examples in docs is part of your continuous integration process. See Python's doctest for example. In a way, the documentation is the spec.
Efficient editing: you benefit from all your text editor tools, and can script mass-changes.

We will be spending more time writing docs

First observation: AI agents have considerably increased the proportion of markdown files in commits. That's usually because folks check out the agent's implementation, which is a very good idea. It's also because you save a lot agentic iteration time when you write rules files (.mdc files) to guide agents' execution. So, whether or not you agree with the thesis, it is happening.

I would submit that 80% of the rules file could have been documentation instead, or potentially are already documented elsewhere. Just like code should be primarily written for humans to read, all files in a repository is written primarily for humans to review. This also applies to rules files created to guide agentic execution. Rules files are reading more and more like style guides and best practices that we never bothered writing but probably should have solidified.

The frontier between AI-only markdown and human-only is so blurry that I could see rules files completely disappearing and being replaced by documentation.

This is also consistent with engineers shifting their focus left. Engineering tooling has trended towards higher and higher abstractions, from machine code to C, to dynamic languages, to SDK, and now to not even writing code and only focusing on the spec, and guidelines. Just like we don't review the machine code produced by the compiler, there might be a day where we don't review the code generated by an LLM provided it respects the harness, the specs and the guidelines (security will be a key concern there). In that world, we'll spend most of our energy reviewing the specs, the harness, the guidelines. Conclusion: those docs need to be written first and foremost for humans review.

Why AI makes it even more meaningful to move docs into your repo

AI agents solve stale docs. A common objection to writing docs is: "why bother? read the code - the code is always up to date". (the same line of arguments would apply to brushing your teeth). AI agents solve this problem. They take away the laborious work to ensure code & documentation alignment (either in PR, or with specialized review agents that look for documentation inconsistencies). Quite game-changing.

AI agents benefit from higher level context. Moving your docs (including, perhaps, your architecture proposals - RFCs, your product specs - PRDs, etc.) will provide that extra context.

Materialized plans with findings will save token & iteration time. Imagine researching "the best way to do X" in a massive codebase. You will spend a lot of tokens finding the answer to that question. Documenting the answer to that question and materializing it in the repository will enable your colleague to skip that research step later (and keep it up-to-date with extra learnings, best practices, etc.!). This is especially the case for things that agents can't infer from the code, or are difficult to infer: typically, infra-related things you learned by deploying your code to production. For instance, I spent about two weeks researching and iterating on structured logging best practices - I materialized that into a "metaplan" that other teams can use directly, saving everyone (including agents!) a ton of time!

Answer to objections

You could use MCP and other approaches (skills) to give agent access to your documentation. But the same arguments I laid out in the beginning still apply, especially the version control piece. Most documentation system are not designed for fast iteration with strong concurrency control.

Waiting for a code review for docs will deter from updating docs. (1) What if you weren't the one writing those docs? (2) Who says all repo content change need to go through review? (3) As we shift more and more left, won't the documentation change or implementation plan be the most important thing to review?

AI agents write long, convoluted docs. First response: well, most humans do as well :). Just like code, you should (1) review the agent's work, (2) fix the agent's work, and (3) write your own docs (like this article: none of this content has been generated by AI!). Putting it into version control makes it MUCH easier and safer (reviews! history!) to iterate.

Do I really need to move all my documentation? I'd say yes, at this point. Not the fleeting docs, but everything that provide useful context about the codebase, including RFCs.

[your_preferred_tool] is better at [tables/schemas/links]. AI is getting incredibly good at generating mermaid schemas (supported by Github), tables, etc.

[your_preferred_tool] is better for human collaboration. Yes, Google Docs is still much better for active collaboration, so it's fair to continue using it for that use case. But once the documentation is in a good place, I would move it in the repo (Google Docs has a useful "Copy as Markdown" feature that I use all the time).

Non-engineers usually don't have repo access. (1) You can deploy your docs on an internal-only website. (2) There is clear trend with non-engineer code access (which poses some interesting security challenges).

References and articles

As always, there are more resources on my repo charlax/professional-programming