Codex 应用很酷,它说明了 IDE 和编码 GUI 的左移趋势。
The Codex app illustrates the shift left of IDEs and coding GUIs

原始链接: https://www.benshoemaker.us/writing/codex-app-launch/

## Codex 与软件开发的未来 OpenAI 的新款 Codex 桌面应用,虽然并非革命性,但预示着软件构建方式的重大转变。作者将 Codex 与 Claude Code 结合使用,轻松管理“工作树”——允许并行开发较小功能,同时专注于主项目。这使得更高效的多代理工作流程成为可能。 然而,该应用真正的意义在于它所代表的更广泛趋势:远离直接与代码交互。作者认为,现代 AI 驱动的开发正在将重点从调试代码本身,转移到管理*生成*代码的*系统*。 这可以被视为一个光谱:从传统的 IDE 编码,到 AI 辅助编码,再到代理 IDE,最后到像 Codex 这样的多代理编排——界面中心在于管理 AI 代理,而非阅读代码行。作者认为,最终方向是“规格说明”——优先考虑需求和设计,代码成为次要输出。他们目前正在构建一个专注于这种“规格说明优先”方法的工具。

## AI 编程与代码阅读的未来 - Hacker News 摘要 Hacker News 上关于 Codex 应用的讨论引发了担忧:AI 驱动的代码生成导致开发者*不*阅读他们生成的代码。有人称,AI 编程领域的领军人物专注于管理代码生成系统,而非理解代码本身,依赖于黑盒测试和迭代调整,而不检查输出结果。 这种做法受到质疑,评论者质疑其效率以及可能积累的技术债务。人们对以这种方式构建的软件的长期可维护性表示担忧,并将其与存在问题的遗留系统相提并论。 然而,也有人认为 AI 可以比人类更快地*阅读*代码,并且重点正在转向定义健壮的规范和系统架构,使代码成为一个实现细节。还有人指出,模型的改进正在克服最初的、在无人监督项目中的问题。这场争论的中心在于,这是否代表对不透明系统的危险依赖,或者只是软件开发的一种自然演变。
相关文章

原文

No, it doesn’t. The Codex desktop app dropped yesterday. You’ll see breathless Twitter posts and YouTube videos about how it changes everything. It doesn’t. But it is pretty cool, and it’s part of a larger trend worth paying attention to. I’m going to talk briefly about how it’s changing my workflow, and then zoom out to what it means that this app exists at all.

My Workflow (For Now)

I’ll write a longer post on this, but the quick version:

  • My primary driver is Claude Code in the terminal. I think it has the best features, the most hooks, and the most ability to create a clean development workflow with all the checks I want.
  • The Codex app is my parallelization layer. The thing that’s cool about it (and Conductor, which is similar) is that it makes Git worktrees easy to use. That means real parallelization.

Here’s how I’m experimenting with it:

  1. I have my main feature or project running in Claude Code in a terminal window
  2. Whenever I come up with changes, bug fixes, or investigations outside the scope of that feature, I spin up a worktree in the Codex app. I can chat with it separately. It lets me know when it needs input. It’s totally isolated, and I can merge it back whenever I want.

The TLDR: Codex app is OpenAI’s supported UI for multi-agent parallelized development. In my workflow, I use it to develop small features in parallel while I’m working on the main thing in Claude Code.

The Bigger Picture

The reason I find this interesting isn’t the app itself, but what it says about where things are headed. I think about IDEs a lot because they’re a lens into where software development is going. I’ve said this before: software development will be unrecognizable in two to three years. And what’s happening with IDEs is proof.

“IDE” stands for integrated development environment. The name doesn’t imply it has to be about reading and writing code - but that’s what it’s always been. That’s changing.

Here’s the thing: I don’t read code anymore. I used to write code and read code. Now when something isn’t working, I don’t go look at the code. I don’t question the code. I either ask one of my coding agents, or - more often - I ask myself: what happened with my system? What can I improve about the inputs that led to that code being generated?

The code isn’t the thing I’m debugging. The system that produced the code is. The people really leading AI coding right now (and I’d put myself near the front, though not all the way there) don’t read code. They manage the things that produce code.

The Continuum

The image above illustrates how I think about this landscape. There’s a spectrum with three major zones: Code, Agents, and Specs. The further left you move, the higher up the stack you get.

Code (right side): Traditional IDEs. VS Code, JetBrains. You read code, you write code.

Code + AI: AI-assisted features. Autocomplete, inline suggestions. GitHub Copilot lives here. The human is still driving.

Agentic IDEs: Cursor, Windsurf. Code and agents combined. The AI makes autonomous multi-file edits, runs terminal commands, iterates on its own work. But you’re still looking at code.

Multi-Agent Orchestration: Claude Code, Codex CLI, Codex app, Conductor. The whole interface is about managing agents. You’re not staring at code - you’re dispatching tasks, watching progress, reviewing PRs. Agent inbox.

Specs (left side): Kiro, GitHub Spec Kit, Vibe Scaffold. The spec is the primary artifact. Requirements → design → tasks → implementation. Code is an output, not the thing you manage.

Where This Is Going

I think the industry is moving left. Toward specs. The code is becoming an implementation detail. What matters is the system that produces it - the requirements, the constraints, the architecture. Get those right, and the code follows. I’m actually building something in this area, focused on specs (not Vibe Scaffold 🙂). Hopefully I have some details in the next few weeks.

联系我们 contact @ memedata.com