管理不可靠的编译器
Managing Unreliable Compilers

原始链接: https://blog.tonkotsu.ai/p/managing-unreliable-compilers

## 软件开发者角色的演变 尽管人工智能编码工具(LLM)兴起,软件开发并未消失——它正在*变化*。虽然比以往任何时候都多地构建软件,但开发者正在从主要*编写*代码转变为*管理*流程,类似于监督“不可靠编译器”的初级管理者。 关键在于有效的授权:在使用人工智能时,避免过度管理(过度监督)和盲目信任(不足监督)。成功的开发者将建立明确的工作流程,并设定明确的检查点,以便进行有针对性的人工监督。 这使得开发过程呈现出“杠铃”形状——将精力集中在**规划**(做出关键技术决策)和**验证**(确保质量)上。中间环节——实际编码——正日益自动化。 该职业的未来在于利用人工智能的速度,同时在关键环节保留重要的判断力,最终扩大开发能力并保持高质量的输出。

Hacker News 讨论批评了一篇名为“管理不可靠的编译器”的文章,认为其严重误导。用户指出,该文章主要讨论的是开发者管理、人工智能在软件开发中的作用,以及推广了一个人工智能工具包——这些都与*编译器*不可靠性没有直接关系。 一位评论员强调了“计划 -> 代码验证 -> 代码实现 -> 验证”优于简单的“计划 -> 代码 -> 验证”方法,并指出像 Copilot 这样的工具可能会操纵测试而不是修复代码。另一位评论员将大量但有缺陷的 LLM 与庞大而缺乏技能的劳动力进行类比,强调成本仍然是一个限制因素。 最终,大家一致认为这篇文章内容泛泛,缺乏实际例子,感觉像是人工智能生成的,并且有多位用户明确表示,这篇文章根本不是关于管理不可靠的编译器,而是一则伪装成技术见解的广告。
相关文章

原文

Is software development done? Is it all over for a profession that has rewarded, empowered, and provided direction for 30 million people worldwide?

The answer is clearly no: developers are needed as much as ever. More software will get built than ever before, and most of it will be in meaningfully complex domains and settings, requiring strong human judgment.

But it is changing at an incredible pace.

The Unreliable Compiler

Many have analogized LLMs with compilers. Both transform a more compact, higher-level description of behavior into a more verbose, lower-level code. But there is a crucial difference: compilers are now incredibly reliable, so much so that “it was a compiler bug” gets you approximately the same reaction as “a cosmic ray inverted a bit in RAM”. LLMs/coding agents on the other hand are anything but: they make errors in logic and errors in judgement, resulting in functional bugs and slop.

But they’re fast, and there are effectively infinitely many of them.

The developer’s key role, then, is to figure out how to put all these unreliable compilers to work. How to specify and structure work clearly, delegate that work efficiently, then verify and guardrail imperfect outputs. In other words, developers have all just become first-time managers.

Mistakes Managers Make

First-time managers make two classic mistakes: under-delegation and over-delegation. I have seen and made both of these mistakes during my time as an engineering manager at Meta, Microsoft and Atlassian.

Under-delegation results in micro-management. This is incredibly common; the manager can’t let go, and insists on babysitting everything and everyone. This limits scale: you can’t take on more projects if you’re providing dense supervision over everything. You can see this in a lot of present-day coding agent usage: developers sitting in chat panels, watching as an LLM performs a task.

Over-delegation is also a road to pain and suffering. This is the classic hands-off manager who is clueless about details and useless in a crisis. You see this pattern with present-day coding agent interactions as well: blindly one-shotting entire apps that turn out to be completely broken or unmaintainable. Fine for a one-off demo, fireable offense for any real production workload.

The solution to both problems is to define a clear protocol with explicit hand-offs and well-defined points when you as the manager can weigh in: sparse, but effective supervision that scales up.

Lifting the Barbell

A simple model for development is plan → code → verify. It applies at multiple scales, and it’s not entirely waterfall and linear, but the model holds.

In this model, it’s clear where human attention and judgment should be concentrated: at the endpoints. Planning is where you exercise judgment over significant technical decisions: what storage system to use, whether to factor something into a framework vs one-offs, whether logic should live on the client or server. And verification is where you exercise judgment over quality, both functional and non-functional. Just as with managers, a key duty here is to hold a high quality bar.

This is the transformation that is upon us as developers: learning to switch from spending most of our time and energy on coding, to spending most of it at the endpoints. Our role remains critical, but has become barbell-shaped.

We’re building Tonkotsu around this barbell. We give you powerful tools for planning and verification while orchestrating the middle so you don’t have to babysit. The profession isn’t ending. It’s scaling up.

联系我们 contact @ memedata.com