个人人工智能的无头化方案

个人人工智能的无头化方案
Headless Everything for Personal AI

原始链接: https://interconnected.org/home/2026/04/18/headless

## 无头服务崛起与设计的未来应用程序和服务交互的未来正朝着“无头”架构转变——为人工智能代理提供访问，而无需传统的用户界面。这得益于人工智能辅助带来的卓越用户体验，以及与屏幕抓取 GUI 相比，人工智能任务的更高可靠性。这种转变已经发生，这体现在像 MCP 这样的 AI 专用 API 的出现以及命令行界面 (CLI) 的普及。像 Google Workspace CLI (gws) 和 Salesforce CLI 这样的工具允许通过文本命令直接与服务交互，从而为人工智能实现强大且可组合的工作流程——并且越来越多地，为用户提供服务。 CLI 在安全性方面具有优势，因为最大限度地减少复杂性可以降低漏洞，这对于人工智能发现漏洞的能力而言至关重要。前端设计不会消失，但会演变为专注于建立强大的“氛围”和品牌标识，因为用户很可能主要通过其人工智能代理与服务交互。重点将从优化用户旅程转移到为人工智能提供安全、简化的访问，从而促使银行和政府部门等企业优先开发强大的 CLI 工具。这代表着我们与技术互动方式的根本性变化，优先考虑人工智能时代的功能性和安全性。

这个Hacker News讨论围绕“个人AI”的未来，以及AI代理与日常应用程序之间互联互通的可能性。用户‘markusw’分享了一个关于“为个人AI构建无界面应用”的链接，引发了‘stephenlf’对采用时间线的预测。 ‘Stephenlf’认为广泛集成比许多人预期的要遥远，预测需要10年时间。最初，像谷歌和微软这样的科技巨头将在其现有生态系统中扩展AI（微软的尝试可能因闭源而失败）。随着时间的推移，接受度会提高，最终苹果可能会建立一个主导标准（“AILP”），然后最终与其他协议融合。评论者强调了潜在的影响，指出即使严重依赖命令行工具（如Salesforce CLI）的开发者，也可能会频繁地与这些AI助手互动。

原文

It’s pretty clear that apps and services are all going to have to go headless: that is, they will have to provide access and tools for personal AI agents without any of the visual UI that us humans use today.

By services I mean things like: getting a new passport; finding and booking a hotel or a flight; managing your bank account; shopping for t-shirts with a minimum cotton weight from brands similar to ones that you’ve bought from before.

Why? Because using personal AIs is a better experience for users than using services directly (honestly); and headless services are quicker and more dependable for the personal AIs than having them click round a GUI with a bot-controlled mouse.

Where this leaves design for the services… well, I have some thoughts on that.

Headless services? They’re happening already.

Already there is MCP, as previously discussed (2025), a kind of web API specifically for AI. For instance, best-in-class call transcriber app Granola recently released their MCP and now you can ask your Claude to pull out the meeting actions and go trawling in your personal docs to find answers to all the question. A good integration.

But command-line tools are growing in popularity, although they used to be just for developers. So you can now create a spreadsheet by typing in your terminal:

gws sheets spreadsheets create --json '{"properties": {"title": "Q1 Budget"}}'

Here are some recently launched CLI tools:

gws: Google Workspace CLI, Drive, Gmail, Calendar, and every Workspace API. Zero boilerplate. Structured JSON output. 40+ agent skills included.
Obsidian CLI to do anything you can do with the (very popular) note-taking app - like keep daily notes, track and cross off tasks, search - from the CLI
Salesforce CLI and, look, I don’t really understand what the Salesforce business operating system does, but the fact it has a CLI too is significant.

And here is CLI-Anything (31k stars on GitHub) which auto-generates CLIs for ANY codebase.

Why CLIs?

It turns out that the best place for personal AIs to run is on a computer. Maybe a virtual computer in the cloud, but ideally your computer. That way they can see the docs that you can see, and use the tools that you can use, and so what they want is not APIs (which connect webservers) but little apps they can use directly. CLI tools are the perfect little apps.

CLIs are composable so they are a better fit for what users actually do.

By composable I mean you can: query your notes then jump to a spreadsheet then research the web then jump back to the spreadsheet then text the user a clarifying question then double-check your notes, all by bouncing between CLIs in the same session.

A while back app design got obsessed with “user journeys.” Like the journey of a user finding a hotel then booking a hotel then staying in it and leaving a review.

But users don’t live in “journeys.” They multitask; they talk to people and come back to things; they’re idiosyncratic Try grabbing the search results from the Airbnb app and popping them in a message to chat on the family WhatsApp, then coming back to it two days later. It’s a pain and you have to use screenshots because apps and their user journeys are not composable.

CLIs are composable because they came originally from Unix and that is the Unix tools philosophy: tools were designed so that they could operate together.

Personal AIs like Openclaw or [Poke](https://poke.com do what users want and don’t follow “designed” user journeys, and as a result the composed experience is more personal and way better. CLIs are a great underlying technology for that.

CLIs are smaller than regular apps and so they are easier to secure.

The alternative to providing special tools for AIs is that the AIs use the same browser-based apps that we do, and that’s a terrifying prospect.

For one, AIs are really good at finding security holes. The new Mythos model from Anthropic is so good at discovering security flaws that it has been held back from the public and governments are convening emergency meetings of the biggest banks.

And including a UI increasing complexity and makes security holes more likely.

Here’s a recent shocking example. Companies House is the national register of all companies, directors, and accounts for England and Wales. Users could view and edit any other user’s account:

a logged-in company director could exploit the flaw by starting from their own dashboard and then trying to log into another company’s account.

Once they reach the 2FA block, which they would not be able to pass, all that was required was to click the browser’s back button a few times. Typically, the user would be taken back to their own dashboard, but the bug instead returned them to the company they had tried to log into but couldn’t.

– The Register, Flaw in UK’s corporate registry let directors rummage through rival records

This bug had been present since October 2025.

Imagine a future where personal AIs are filing company records, and one of them notices this security hole overnight and posts about it on moltbook or whatever agent-only social network is most popular. The other agents would exploit the system up, down and sideways before the engineering team woke up.

The only viable solution is that services need to security hardened, and to do that they need to be simplifying and minimised. Again, CLI tools are a great fit.

What does this mean for front-end design?

Design won’t go anywhere.

Sure, the front-end should be driving the same CLI tools that agents use.

Arguably it’s more important than ever: human users will encounter services, figure out what they can do, and pick up their vibe from using the app, just as now.

Then they’ll tell their personal AI about the service and never see the front-end again, or re-mix it into bespoke personal software.

So from a usability perspective I see front-end as somewhat sacrificial. AI agents will drive straight through it; users will encounter it only once or twice; it will be customised or personalised; all that work on optimising user journeys doesn’t matter any more.

But from a vibe perspective, services are not fungible. e.g. if you’re finding a restaurant then Yelp, Google Maps, Resy, and The Infatuation are all more-or-less equivalent for answering that question but clearly they are completely different and you’ll use different services at different times.

Understanding that a service is for you is 50% an unconscious process - we call it brand - and I look forward to front-end design for apps and services optimising for brand rather than ease of use.

If I were a bank, I would be releasing a hardened CLI tool like yesterday.

There is so much to figure out:

How do permissions work? Should the user get a notification from their phone app when the agent strays outside normal behaviour? How do I give it credentials to act on my behalf, and how long do they last?
How does adjacency work? My bank gives me a current account in exchange for putting a “hey, get a loan!” button on the app home screen. How do you make offers to an agent?

Headless banks.

Headless government?

I’d love to show you a worked example here. I vibed up a suite of four CLI tools that wrap four different services from UK government departments.

If I were renting a house, I would set my agent to learning about the neighbourhoods using one of these tools. Another tool will be helpful next time I’m buying a used car. (There’s a Companies House command-line tool too.)

But I won’t show you the tools because I don’t want to be responsible for maintaining and supporting them.

I wish Monzo came with an official CLI. I wish Booking.com came with a CLI. I reckon, give it a year, they will.

个人人工智能的无头化方案 Headless Everything for Personal AI

个人人工智能的无头化方案
Headless Everything for Personal AI