Mistral AI 发布 Forge

Mistral AI 发布 Forge
Mistral AI Releases Forge

## Forge：植根于您知识的企业级人工智能 Mistral AI推出Forge，一个系统，使企业能够构建基于*自身*专有数据训练的高级人工智能模型，摆脱对通用公开信息的依赖。Forge通过允许组织将独特的知识——政策、代码、流程——直接融入人工智能模型，弥合了广泛人工智能能力与特定业务需求之间的差距。通过预训练、后训练和强化学习，Forge支持构建能够理解内部术语和工作流程的模型，从而带来更可靠、更准确的企业级智能体。至关重要的是，Forge优先考虑控制和战略自主性；模型始终受组织管理，这对受监管行业至关重要。 Forge支持多种模型架构和多模态输入，并设计用于通过反馈和评估持续改进。其应用范围涵盖政府、金融、软件开发和制造业，为能够执行复杂任务（如政策分析、代码辅助和运营诊断）的智能体提供支持。最终，Forge将人工智能从外部工具转变为战略资产，并*随着*组织专业知识的积累而不断发展。

## Mistral AI 发布 Forge Mistral AI 发布了“Forge”，一项专注于为客户定制 AI 模型开发的新服务。与优先考虑最大、最先进模型的竞争对手不同，Mistral 战略性地瞄准定制工程并服务于欧洲客户。这一消息在 Hacker News 上分享，引发了关于 Mistral 独特方法的讨论。评论员指出，小型公司进行模型训练的难度，并强调像 Forge 以及最近发布的“unsloth”等工具正在使其更容易实现。 Forge 似乎超越了简单的微调，Mistral 与客户合作进行*预*训练，甚至整合了强化学习 (RL)，这是 AI 开发中一个以复杂著称的领域。社区对 Mistral 的发展方向以及其扩大强大 AI 能力访问潜力的可能性表示乐观。

原文

Today, we’re introducing Forge, a system for enterprises to build frontier-grade AI models grounded in their proprietary knowledge.

Most AI models available today are trained primarily on publicly available data. They are designed to perform well across a broad range of tasks. But enterprises operate using internal knowledge: engineering standards, compliance policies, codebases, operational processes, and years of institutional decisions.

Forge bridges the gap between generic AI and enterprise-specific needs. Instead of relying on broad, public data, organizations can train models that understand their internal context embedded within systems, workflows, and policies, aligning AI with their unique operations.

Mistral AI has already partnered with world-leading organizations, like ASML, DSO National Laboratories Singapore, Ericsson, European Space Agency, Home Team Science and Technology Agency (HTX) Singapore, and Reply to train models on the proprietary data that powers their most complex systems and future-defining technologies.

Forge Users

Training models on institutional knowledge.

Forge enables enterprises to build models that internalize their domain knowledge. Organizations can train models on large volumes of internal documentation, codebases, structured data, and operational records. During training, the model learns the vocabulary, reasoning patterns, and constraints that define that environment.

This allows teams to develop models and agents that reason using internal terminology and understand enterprise workflows. Forge supports modern training approaches across several stages of the model lifecycle:

Pre-training allows organizations to build domain-aware models by learning from large internal datasets.
Post-training methods allow teams to refine model behavior for specific tasks and environments.
Reinforcement learning helps organizations align models and agents with internal policies, evaluation criteria, and operational objectives while improving agentic performance in real environments, like complex orchestration, tool use, and decision-making.

Together, these capabilities allow enterprises to move beyond generic AI behavior and develop models that reflect institutional intelligence.

Control and strategic autonomy.

For many organizations, AI adoption raises questions about control over models, data, and long-term intellectual property. Forge allows enterprises to build models that remain under their control. Models can be trained using proprietary datasets and governed using internal policies, evaluation standards, and operational requirements.

This allows organizations to retain control over how their knowledge is encoded and used by AI systems. In regulated environments, this level of control is critical. Enterprises must ensure that models reflect compliance requirements, operational constraints, and internal governance frameworks.

By allowing organizations to build models grounded in their own knowledge and operated within their own infrastructure environments, Forge enables a higher degree of strategic autonomy as AI becomes part of core enterprise systems.

Custom models make enterprise agents reliable.

Enterprise agents must do more than generate answers. They need to navigate internal systems, use tools correctly, and make decisions within the constraints of the organization.

Custom models make this possible by providing agents with a deeper understanding of the environment in which they operate. Instead of relying on generic reasoning, agents powered by domain-trained models can interpret internal terminology, follow operational procedures, and understand how different systems and data sources relate to one another.

This changes how agents behave in practice. Tool selection becomes more precise. Multi-step workflows become more reliable. Decisions can reflect internal policies and business logic rather than generic assumptions.

The result is agents that move beyond simple assistance and begin to function as operational components of enterprise systems capable of executing tasks, coordinating across tools, and supporting complex processes with greater accuracy and speed.

Marketecture Forge

Support for multiple model architectures.

Forge offers flexibility with support for both dense and mixture-of-experts (MoE) architectures. This lets organizations optimize for performance, cost, and operational constraints. Dense models provide strong general capability across a wide range of enterprise tasks, while MoE enables very large models to run more efficiently; delivering comparable capability with lower latency and compute cost than a dense model of similar scale. Forge also supports multimodal inputs where required, allowing models to learn from text, images, and other data formats.

Agent-first by design

Code agents are becoming the primary users of developer tools, so we built Forge for them first, not as an afterthought. An autonomous agent like Mistral Vibe can use it to fine-tune models, find optimal hyperparameters, schedule jobs, and generate synthetic data to hill-climb evals. Throughout the process, Forge monitors metrics to make sure the model isn't regressing on the benchmarks you care about. Because Forge handles infrastructure and includes battle-tested recipes for data pipelines and Mistral AI's own training methods, anyone, including agents, can customize a model just by writing plain English.

Continuous improvement through reinforcement learning and evaluation.

Enterprise environments evolve constantly. Regulations change. Systems are updated. New data becomes available. Forge is designed for continuous adaptation rather than one-time training. Organizations can use reinforcement learning pipelines to refine model behavior using feedback derived from internal evaluations and operational workflows.

This allows teams to improve models over time and align outputs with organizational objectives. Evaluation frameworks allow enterprises to test models against internal benchmarks, compliance rules, and domain-specific tasks before deploying them into production environments.

The result is a model lifecycle that supports ongoing improvement rather than static deployment.

Examples of enterprise applications.

Organizations can apply Forge across many types of enterprise workflows.

Government agencies can build models trained for different languages and dialects, policy frameworks, regulatory texts, and administrative procedures. This allows AI agents to be reliable when working on policy analysis, public service delivery, and operational planning while reflecting institutional mandates and governance requirements.

Financial institutions can train models on compliance frameworks, risk procedures, and regulatory documentation. This allows AI systems to produce outputs consistent with internal governance policies.

Software teams can train models on proprietary codebases and development standards. The real value comes from shaping models to perform exceptionally well on the specific engineering tasks that drive productivity and quality inside the company. A model trained on proprietary repositories and development standards can better understand internal abstractions, patterns, and architectural choices. When post-trained for priority workflows like implementation, debugging, migration, review, or system design support, it can provide outputs that are more context-aware, more consistent with internal practices, and more useful across the software development lifecycle.

Manufacturers can train models on engineering specifications, operational data, and maintenance records. These models can support diagnostics, design analysis, and operational decision-making.

Large enterprises can deploy agents built on models trained on internal knowledge systems. These agents can use company documentation, operational records, and historical decisions to assist employees across complex workflows. Because the underlying custom models understand the organization’s terminology and knowledge structures, agents can retrieve information and execute tasks with greater accuracy and speed.

In each case, the objective is the same: enabling models and the agents built on top of them to operate within the organization’s domain context.

Build your own frontier models with Forge.

AI models are becoming a foundational layer of enterprise infrastructure. As organizations integrate AI agents into core operations, the ability to encode institutional knowledge into model behavior will become increasingly important.

Forge enables enterprises to build and continuously improve models trained on their own data and aligned with their operational context. These models can power AI systems and agents that operate using the organization’s terminology, processes, and constraints. Over time, this approach allows organizations to treat AI models not simply as external tools, but as strategic assets that evolve alongside their knowledge, processes, and expertise.

If your organization is ready to explore what it means to build AI around its own knowledge, sign up to learn more about Forge.