我们自动化了所有事情,除了知道发生了什么。
We Automated Everything Except Knowing What's Going On

原始链接: https://eversole.dev/blog/we-automated-everything/

## 迫在眉睫的软件理解危机 软件开发正在经历一场激进的民主化:人工智能工具现在赋予个人前所未有的速度和便捷来构建和部署产品,从而大幅降低成本和传统团队规模。然而,这种加速正在暴露一个关键缺陷——对日益复杂的系统缺乏*理解*。 多年的扩展、云采用和工具分层导致了碎片化、不透明的基础设施,工程开发更多地变成了管理机器而不是真正构建系统。至关重要的是,负责维护稳定的团队往往被低估,而速度和产出被优先考虑。 人工智能正在加剧这个问题。随着人工智能代理快速生成代码和变更,开发速度与人类理解之间的差距正在呈指数级增长。传统的监控工具可以检测到*哪里*出了问题,但无法检测到*为什么*,而工程师队伍日益萎缩,难以跟上节奏。 解决方案不是更多的工具,而是重新关注基础工程原则——深入的系统理解、明确的所有权和主动的风险评估。 如果没有将激励机制转移到优先考虑理解而非单纯的产出,整个行业将面临一场迫在眉睫的危机,系统将变得越来越脆弱,容易发生不可预测的故障。

## 黑客新闻讨论:自动化与理解系统 最近一篇黑客新闻帖子引发了关于自动化快速发展,特别是人工智能影响的争论。核心论点总结为“未来属于那些理解他们刚刚发布的内容的人”,突显了一个日益增长的担忧:我们正在构建极其复杂的系统——通常*借助*人工智能——却未能完全理解它们如何运作。 许多评论者同意,人工智能工具本质上是在生成“没人真正理解的遗留代码库”,加剧了现有问题,即所有生产代码最终都会变成遗留代码。 还有人指出其中存在的激励结构;人工智能允许单人开发者快速构建功能性应用程序,但长期维持和维护这些项目具有挑战性。 虽然有些人对未来表示悲观,认为思考将被外包给人工智能,但也有人分享了使用人工智能克服技能差距并构建以前无法触及的项目方面的积极经验。 然而,一个共同的主题是缺乏可见的影响——这些新软件中的很多都在服务于利基受众,而不是推动广泛的繁荣。 讨论还涉及了长期维护复杂、数十年前的系统的长期问题,认为人工智能只是增加了一层不透明性。
相关文章

原文

Software is being democratized. The cost of building just collapsed in a way most people haven't fully processed yet. Anyone with an idea and access to an AI agent can ship a product. What used to take a team of twenty and six months now takes one person and a weekend. That's not hype. It's happening right now, everywhere, all at once.

But here's what nobody's talking about. The systems underneath all of this are buckling. And the people responsible for keeping them running are either overwhelmed, undervalued, or being replaced entirely. We're entering an era where everyone can build, and almost nobody understands what they've built.

That's not an AI problem. It's an identity crisis that's been growing inside software engineering for years. AI just ripped the cover off.

Software used to be something you could wrap your head around. You knew what server it ran on, what database it talked to, what data center it lived in. That era has been gone for a while now. We scaled, broke things into pieces, adopted every cloud service and managed platform we could find, and when the complexity got unbearable we started stacking layers on top. Infrastructure as code, GitOps, CI workflows. The pitch was always the same: encode the complexity, make it manageable, sleep at night. It happened so gradually that nobody noticed how deep they were until they were already underwater.

What came out the other side wasn't engineering. It was bureaucracy wearing an engineering costume. Configuration scattered across dozens of tools, each owned by a different vendor, each showing a narrow slice of reality. The daily work shifted from shaping systems to keeping machinery lined up. And we just accepted it, because the way we value work in this industry never caught up either. The teams keeping systems alive today are visible and rewarded. The teams doing the quiet work of making sure systems don't need saving tomorrow get ignored. Nobody promotes the team that prevented the outage. Leadership retells the 4 a.m. hero story. We say we value reliability but promote throughput. Prevention matters, right up until the budget conversation. Then we fund response.

None of this is new. The complexity, the configuration bloat, the broken incentives, all of it predates AI by years. What AI did was pour gasoline on it. Teams that shipped weekly now ship continuously. Architecture decisions that took weeks happen in minutes, all because the cost of writing code is collapsing overnight. But the cost of understanding that code, what it actually does to a running system, hasn't moved at all. If anything it's gotten dramatically worse, because now the author doesn't even know why they made their decisions. And while all of this is happening, the number of people responsible for keeping things working is shrinking, not growing. We've created a supply chain crisis where we're producing far more than we can actually handle.

Change is up 30x while understanding is dropping, and the gap is widening every quarter. It's the same mess we've always had. It's just moving fast enough now that all the little tricks we used to play on ourselves have stopped working.

Great engineering is not deployments. It's not monitoring, not dashboards. It's understanding. Knowing how the pieces connect, who owns what, how changes spread, and where risk has quietly been building for months until it suddenly matters.

What should make everyone very hopeful is the foundation of great engineering isn't missing. It's scattered across every tool your organization already pays for. What's missing is anyone putting it together. Engineers jump between dozens of tools and rebuild the picture in their heads every time something goes wrong. That picture decays the day they leave, breaks apart across teams, and is always slightly wrong in exactly the ways that surface under pressure.

And this is about to get so much worse. We're heading into a world where AI agents outnumber engineers 50 to 1, each one shipping code and deploying changes faster than any human can track. When one engineer could barely keep up with what ten humans were changing, what happens when that same engineer is responsible for understanding what fifty agents are doing to a live system at the same time? The old ways of keeping up don't just get harder. They become physically impossible. Monitoring can tell you something went wrong. It can't tell you why, or what changed, or who owns the thing that broke. Understanding isn't a nice-to-have in that world. It's the only thing standing between a running system and chaos.

The world we're entering doesn't look like the one we're leaving.

Enterprises with thousands of engineers are going to be the exception, not the rule. A team of five people with fifty agents is going to build and operate what used to take five hundred. That's not a prediction. It's already starting to happen. Software is being democratized in a way that changes who builds, how fast they build, and how much of the world runs on what they've built.

But democratizing creation without solving understanding is a disaster waiting to happen. Acceleration without understanding just makes things more dangerous, and right now the entire industry is accelerating while understanding falls further behind.

SRE isn't dead. DevOps isn't dead. Platform engineering isn't dead. What's dying is the shallow version of all of them. The belief that shipping faster is always better, that more tools solve more problems, that you can outrun your own complexity if you just automate hard enough.

You can't.

The ability to understand a system as fast as it changes is the foundational layer for the next era of technology. Not another tool on top of the pile. The thing everything else gets built on.

But here's the uncomfortable part. Most of you reading this already know everything I just said. You've felt it. You've lived it. You've complained about it in retros and Slack threads and late-night incident calls. And then Monday comes and you go back to the same dashboards, the same alert fatigue, the same quarterly planning rituals that measure output instead of understanding. Nothing changes because the incentives haven't changed. And the incentives haven't changed because nobody with authority has been forced to confront what's actually breaking.

That's about to end. The supply chain crisis I described isn't theoretical. It's arriving. And when a team of five with fifty agents is shipping faster than your entire engineering org, the question won't be whether you have enough dashboards. It'll be whether anyone in the building can explain what your system is actually doing right now.

Most of you won't be able to answer that. And you already know it.

That's the conversation this industry needs to have. Not tomorrow. Now.

联系我们 contact @ memedata.com