管道（互连）

管道（互连）
Context plumbing

原始链接: https://interconnected.org/home/2025/11/28/plumbing

## 人工智能的未来：理解意图与语境最近构建人工智能系统的工作强调了“语境管道”的关键重要性——高效地向人工智能代理传递相关信息。核心思想是，成功的人工智能取决于理解*意图*——用户*想要*实现的目标，即使没有明确说明。能够最大程度减少“管理负担”的界面（例如直接触摸手机屏幕而不是使用鼠标）将占据主导地位。然而，理解意图需要丰富的*语境*。这超越了人工智能的一般知识，包括实时数据，例如用户历史记录、当前任务、共享知识和可用工具。这导致了“语境工程”——动态地向人工智能提供正确的信息。挑战在于语境的流动性和分散性。关键在于主动地从其来源“连接”语境到人工智能，预测需求而不是依赖缓慢的查找。这使得技术架构从传统的以数据库为中心的应用程序转向专注于连续数据流的系统，从而反映了用户对人工智能可以访问哪些信息的期望。最终目标是创建能够预测需求的人工智能——“理解我的意思”界面——这需要我们在构建和架构人工智能系统方面进行根本性的转变。

黑客新闻新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 Context plumbing (interconnected.org) 13 分，gmays 5小时前 | 隐藏 | 过去 | 收藏 | 2 评论 iammjm 2小时前 [–] 用户鼠标追踪很酷。但令人分心。我被它们分心了，停止了阅读回复 redhale 1小时前 | 父评论 [–] 同意，但看起来“静音模式”切换可以隐藏它们。指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系搜索：

原文

These past few weeks I’ve been deep in code and doing what I think about as context plumbing.

I’ve been building an AI system and that’s what it feels like.

Let me unpack.

Intent

Loosely AI interfaces are about intent and context.

Intent is the user’s goal, big or small, explicit or implicit.

Uniquely for computers, AI can understand intent and respond in a really human way. This is a new capability! Like the user can type I want to buy a camera or point at a keylight and subvocalise I’ve got a call in 20 minutes or hit a button labeled remove clouds and job done.

Companies care about this because computers that are closer to intent tend to win.

e.g. the smartphone displaced the desktop. On a phone, you see something and then you touch it directly. With a desktop that intent is mediated through a pointer – you see something on-screen but to interact you tell your arm to move the mouse that moves the pointer. Although it doesn’t seem like much your monkey brain doesn’t like it.

So the same applies to user interfaces in general: picking commands from menus or navigating and collating web pages to plan a holiday or remembering how the control panel on your HVAC works. All of that is bureaucracy. Figuring out the sequence for yourself is administrative burden between intent and result.

Now as an AI company, you can overcome that burden. And you want to be present at the very millisecond and in the very location where the user’s intent - desire - arises. You don’t want the user to have the burden of even taking a phone out of their pocket, or having to formulate an unconscious intent into words. Being closest to the origin of intent will crowd out their competitor companies.

That explains the push for devices like AI-enabled glasses or lanyards or mics or cameras that read your body language.

This is why I think the future of interfaces is Do What I Mean: it’s not just a new capability enabled by AI, there’s a whole attentional economics imperative to it.

Context

What makes an AI able to handle intent really, really well is context.

Sure there’s the world knowledge in the large language model itself, which it gets from vast amounts of training data.

But let’s say an AI agent is taking some user intent and hill-climbing towards that goal using a sequence of tool calls (which is how agents work) then it’s going to do way better when the prompt is filled with all kinds of useful context:

For example:

Background knowledge from sources like Wikipedia or Google about what others have done in this situation.
Documentation about the tools the agent will use to satisfy the intent.
The user’s context such as what they’ve done before, the time of day, etc.
Tacit knowledge and common ground shared between the user and the AI, i.e. what we’re all assuming we’re here to do.
The shared “whiteboard”: the document we’re working on.
For the agent itself, session context: whether this task is a subtask of a larger goal, what’s worked before and what hasn’t, and so on.

This has given rise to the idea of context engineering (LangChain blog):

Context engineering is building dynamic systems to provide the right information and tools in the right format such that the LLM can plausibly accomplish the task.

btw access to context also explains some behaviour of the big AI companies:

If you want to best answer user intent, then you need to be where the user context is, and that’s why being on a lanyard with an always-on camera is preferred over a regular on-demand camera, and why an AI agent that lives in your email archive is going to be more effective than one that doesn’t. So they really wanna get in there, really cosy up.

(And what’s context at inference time is valuable training data if it’s recorded, so there’s that too.)

Plumbing?

What’s missing in the idea of context engineering is that context is dynamic. It changes, it is timely.

Context appears at disparate sources, by user activity or changes in the user’s environment: what they’re working on changes, emails appear, documents are edited, it’s no longer sunny outside, the available tools have been updated.

This context is not always where the AI runs (and the AI runs as close as possible to the point of user intent).

So the job of making an agent run really well is to move the context to where it needs to be.

Essentially copying data out of one database and putting it into another one – but as a continuous process.

You often don’t want your AI agent to have to look up context every single time it answers intent. That’s slow. If you want an agent to act quickly then you have to plan ahead: build pipes that flow potential context from where it is created to where it’s going to be used.

How can that happen continuously behind the scenes without wasting bandwidth or cycles or the data going stale?

So I’ve been thinking of AI system technical architecture as plumbing the sources and sinks of context.

In the old days of Web 2.0 the go-to technical architecture was a “CRUD” app: a web app wrapping a database where you would have entities and operations to create, read, update, and delete (these are also the HTTP verbs).

This was also the user experience, so the user entity would have a webpage (a profile) and the object entity, say a photo, would have a webpage, and then dynamic webpages would index the entities in different ways (a stream or a feed). And you could decompose webapps like this; the technology and the user understanding aligned.

With AI systems, you want the user to have an intuition about what context is available to it. The plumbing of context flow isn’t just what is technically possible or efficient, but what matches user expectation.

Anyway.

I am aware this is getting - for you, dear reader - impossibly abstract.

But for me, I’m building the platform I’ve been trying to build for the last 2 years only this time it’s working.

I’m building on Cloudflare and I have context flowing between all kinds of entities and AI agents and sub-agents running where they need to run, and none of it feels tangled or confusing because it is plumbed just right.

And I wanted to make a note about that even if I can’t talk specifically, yet, about what it is.

管道（互连） Context plumbing

管道（互连）
Context plumbing