大型语言模型作为语言编译器：从Fortran中汲取的经验，展望编码的未来。

大型语言模型作为语言编译器：从Fortran中汲取的经验，展望编码的未来。
LLMs as Language Compilers: Lessons from Fortran for the Future of Coding

原始链接: https://cyber-omelette.com/posts/the-abstraction-rises.html

## 编程自动化循环大型语言模型（LLM）和“编码代理”的兴起正在深刻重塑软件开发，这与该领域的历史性转变相呼应。就像LLM现在能够自主构建复杂的应用程序——一项曾经需要整个工程团队才能完成的壮举一样，计算机的早期阶段将编程视为一种高度专业化的“黑魔法”，由少数“祭司”实践。这与20世纪50年代的情况相似，当时FORTRAN和COBOL等语言的出现简化了编码，显著缩短了程序长度和开发时间。虽然最初的怀疑集中在性能和潜在的熟练程序员失业问题上，但这些语言最终*增加了*对计算的需求并扩展了该领域，尽管并未像最初预测的那样普及到大众。核心挑战依然存在：减少*意外*复杂性并不能消除*本质*复杂性——理解*你想要*计算机做什么仍然至关重要。历史表明，自动化不一定会导致失业，而是会导致越来越多的劳动力解决更复杂的问题。如今的编码代理，就像之前的FORTRAN一样，正在降低入门门槛，但对熟练问题解决者的需求依然存在，并将重点转移到更高层次的挑战，并可能推动进一步的创新——这种现象类似于杰文斯悖论，即效率提高导致需求增加。

一场 Hacker News 的讨论围绕着“LLM 作为语言编译器”的文章展开，探讨了大型语言模型 (LLM) 对软件开发的影响。一些人认为 LLM 可以达到整个工程团队的产出，而另一些人则认为这是不现实的。一个关键点是“意外复杂性”的增加——在现有缺陷系统之上构建的不必要的层级——尽管 LLM 可能会降低核心编码任务（“本质复杂性”）的难度。评论者争论人工智能是否能够自行解决本质复杂性，并预见未来会出现大量在线人工智能代理。一个主要担忧是维护和保护由 LLM 生成的代码，这些代码可能不可预测且容易出错，需要熟练的人工来审查和修复“AI 垃圾”。讨论强调了*理解*代码的重要性，即使不编写代码，因为人工智能生成代码变得越来越普遍。

原文

In the time it takes to get an undergraduate degree, Large Language Models (LLMs) have evolved from delivering realistic chat responses, to autonomously coordinating and completing tasks at the scale of full engineering teams.

In programming circles, Stack Overflow used to be where you landed when you got stuck. A simple search typically led you to discover another programmer who had suffered through the same problem, and if lucky, the solution too (relevant xkcd). Since 2022, however, the number of new Stack Overflow posts has fallen by 77%. Instead, developers have begun turning to tools like ChatGPT to get help, and now, even entire fleets of Coding Agents.

In my experience, coding agents can do amazing things. I've built numerous prototypes in a few hours, that would have once prevented by my own incompetence. Most recently it was a full-stack iOS app for photographing, indexing, and searching my personal storage bins. The back-end was familiar territory for me, iOS front-end was not. So I let agents do 100% of the front-end work for me, and by the afternoon I had a fully functional prototype. I still experience the limits of these agents though. In a recent case, I was adding a configurable field for an SSL certificate path. The task was mundane and well-defined, yet the agent fixated on adding an unrelated parameter. No amount of pleading could convince the agent that it was, in fact, wrong.

Steven Yegge poses a compelling vision of the use of coding agents with his provocative orchestration tool called Gas Town:

Gas Town is an industrialized coding factory manned by superintelligent robot chimps, and when they feel like it, they can wreck your shit in an instant.

Relatable. Yet while it's clear that something major has changed, I remain unsure of how this will play out over the next decade or two. Some believe AI Super-intelligence is just around the corner (for good or evil). Others believe we're mistaking philosophical zombies for true intelligence, and speedrunning our own brainrot.

When facing an uncertain future, I try to anchor my predictions in historical outcomes. In the history of computing, it turns out that Vibe Coding, or more specifically Automatic Programming, has been invented before. Reading this history reshaped my own view of the future and my place in it. Your mileage may vary.

The Priesthood

Let's go back to the early 1950s. Computing machines had progressed from mechanical abacus-esque calculators, to room-scale computing machines capable of ending wars. The ENIAC and UNIVAC were state of the art. The field was small, with fewer than 1,000 computers in existence.

Programming these machines was labour-intensive. It was done on punch cards, with the programmer defining exact mechanical steps from an operator's manual. Performing a simple two-number addition involved explicitly choreographing where the values would be stored, when the computation step would fire, and wiring the outputs back to meaningful addresses. There was complexity even in the most basic requirements.

The most notable computation of the time was forecasting who would win the presidential election, where a UNIVAC-1 system correctly predicted Eisenhower to win with a landslide from a sample size of 5.5% of voters. Another notable application was the SAGE air defence system, consuming 250,000 to 500,000 lines of assembly code and employing 7000 engineers - ~20% of the world's programmers at the time.

Leading minds of the day considered programming to be a black art, so advanced it could never be generalized for the masses. While efforts to simplify existed, like the Laning and Zierler system at MIT, they slowed machines down by a factor of five or ten. With the price of computation so high, that inefficiency was like lighting money on fire. The small group of contributors capable of producing efficient and correct code considered themselves exceedingly clever, and scoffed at the idea that they could be replaced. A man named John Backus later referred to this group of black art programmers as "The Priesthood," and he was one of them. But he had his own plans to disrupt the status quo.

Enter Automatic Programming

As head of IBM's Programming Research Group, John Backus was seeking to vastly simplify the effective use of the IBM 704. His motivation was blunt: "I didn't like writing programs," he later admitted. His team set out to build an abstraction on top of machine code, seeking to simplify logic without sacrificing speed. The stakes were high: computers cost nearly $1M, and more than 90% of elapsed project time went to planning, writing, and debugging while the machine sat idle.

In the 1956 Programmer's Reference Manual, Backus made a bold claim:

Since FORTRAN should virtually eliminate coding and debugging, it should be possible to solve problems

In hindsight, this seems absurd. There's a saying that "You can write FORTRAN in any language," a reminder that any language can produce buggy, illegible code. But compared to hand-coded assembly, FORTRAN delivered. Programs that previously required 1000 machine instructions could be written in 47 FORTRAN statements. GM's productivity studies showed FORTRAN reduced programming effort by a factor of five to ten. This was significant progress.

FORTRAN wasn't the only language dramatically simplifying computer coding. Grace Hopper, a computing pioneer who had helped build the UNIVAC-1, was also convinced that coding didn't need to be so opaque. She drove the creation of the FLOW-MATIC language, a predecessor to COBOL, and the first compiled language to adopt English-like syntax. Hopper's motivation was simple: you can't force a businessman to learn math notation.

Programming languages were moving from esoteric holes in punch cards toward portable, readable text.

What They Expected

Computer code was getting easier to write, and not everyone was impressed. In John Backus' own reflections, he describes:

The Priesthood wanted and got simple mechanical aids for the clerical drudgery which burdened them, but they regarded with hostility and derision more ambitious plans to make programming accessible to a larger population.

The skeptics had economics on their side as well. Slowdown meant lost money, so Automatic Programming would have to match hand-coded efficiency to be adopted. There was a human component as well: programming was a hard-won skill, and implying it could be automated felt like an attack on the craft itself.

Meanwhile, analysts in the early 1960s predicted everyone would have to become a programmer to meet expected demand. Computers were proliferating. Software was essential. The prediction was that programmers would become as common as typists, and technology would be democratized.

What Actually Happened

Despite historically-anchored criticism, the efficiency problem was eventually solved. FORTRAN's optimizing compiler produced code that ran nearly as fast as hand-coded assembly. The IEEE Computer Society later noted it was "the best overall optimizer for not 5 years, not 10 years, but 20 years". Coupled with the falling cost of computation, the performance objection evaporated.

The democratization vision, however, did partially come true. COBOL became the most widely used language in the world, but not for use by the general masses. Businessmen still needed trained programmers. Reading and writing code proved to be very different skills. As Turing Award winner Fred Brooks phrases it, simpler programming languages reduce the accidental complexity of a task, but the essential complexity remains. You still have to know what you want the computer to do, and that can be very hard. While not everyone wrote computer programs, the number of computers in the world exploded. This count grew from under 1000 in 1960 to over 7 billion smartphones alone today. Computers became daily tools for a majority of the world, and not just for business.

With easier programming, teams didn't shrink either. Paradoxically, they grew. The U.S. went from 200k computer workers in 1970 to 1.6 million by 2015, with estimates of 26-28 million globally. As a percentage of the S&P 500, technology went from around 3% when the index launched in 1957, to over 32% in 2024. Now more than double the next largest sector.

The Priesthood lost its grip, and the black art of telling computers what to do made it to the masses.

Looking Back, Looking Forward

This post began as an exploration into history. When reading John Backus' reflection on being a programmer in the 1950s, it completely matched my own views from my days optimizing GPU kernels as an HPC Developer.

He states:

The programmer had to be a resourceful inventor to adapt his problem to the idiosyncrasies of the computer... he had to employ every trick he could think of to make a program run at a speed that would justify the large cost of running it. And he had to do all of this by his own ingenuity.

When first reading this quote, I realized I am a part of the modern Priesthood. Seventy years later, it still fits how I and others think of our craft. But history shows even the sharpest skills will be made obsolete; the need for niche expertise to push computation to its limits will remain. What changes is the problems they will be solving.

There's a saying in cycling - "It never gets easier, you just go faster." Perhaps with knowledge work, it doesn't get easier, the systems just get more complex. Before FORTRAN, crude election forecasts and tracking a handful of aircraft were state of the art. The next 50 years brought weather forecasting, universe-scale simulations, and the codification of the human genome. This wildly exceeded anyone's imagination at the dawn of the compiler era.

This lesson predates FORTRAN as well. When James Watt optimized the steam engine, the expectation was that coal use would plummet. Instead, it ballooned. This is known as Jevons Paradox, and it continues today. In 2016, Nobel Laureate Geoffrey Hinton predicted that radiologists would be obsolete within 5 years. Ten years later, radiologist MD Dana Smetherman notes that not only is demand strong, "AI might even increase the workload by identifying additional findings".

When primed with lessons from history, I find my own technological arrogance fading. My concerns about obsolescence have shifted toward curiosity about what remains to be built. The accidental complexity of coding is plummeting, but the essential complexity remains. The abstraction is rising again, to tame problems we haven't yet named.