Cohere 面向开发者的首个模型

Cohere 面向开发者的首个模型
Cohere's First Model for Developers

原始链接: https://cohere.com/blog/north-mini-code

Cohere 发布了其首个开源智能体编程模型 **North Mini Code**。该模型专为软件开发而设计，采用混合专家（MoE）架构，拥有 30B 总参数及 3B 激活参数，在强大的性能与硬件效率之间实现了平衡。主要亮点包括： * **性能与效率：** 该模型针对代码生成、架构映射和代码审查进行了优化。内部测试显示，其输出吞吐量比 Devstral Small 2 等同类模型高出 2.8 倍，Token 间延迟优化了 30%。 * **智能体能力：** 专为处理复杂的智能体工作流而构建，擅长编排子智能体和终端任务。 * **易用性：** 模型采用 Apache 2.0 许可证发布，旨在推动“主权 AI”，允许开发者在本地或私有环境下部署，从而避免厂商锁定。 * **可用性：** 模型权重已在 Hugging Face 上发布，可通过 Cohere 的 Model Vault 进行托管部署，并支持通过 OpenCode 使用各种编程智能体。 Cohere 希望 North Mini Code 能够成为新一代开源模型的开端，并邀请社区通过 X、Discord 和 Reddit 提供反馈，以指导未来的开发方向。

Hacker News 最新 | 过往 | 评论 | 提问 | 展示 | 招聘 | 提交登录 Cohere 面向开发者的首个模型 (cohere.com) 6 分，由 hmokiguess 发布于 1 小时前 | 隐藏 | 过往 | 收藏 | 1 条评论帮助 cyanydeez 15 分钟前 [–] 看起来就是 Qwen 3.6 coder。回复指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系搜索：

Today we're launching North Mini Code open-source. A mixture-of-experts (MoE) model, North Mini Code is Cohere's first agentic coding model, and the inaugural member of our next generation of powerful models.

At 30B total parameters with just 3B active, North Mini Code delivers strong software development performance without demanding extensive hardware to match. Efficient by design, it's built to run where you need it.

Freely available under an Apache 2.0 license, North Mini Code advances Cohere’s mission to make sovereign AI a practical reality, giving developers direct access to agentic coding capabilities. We're building in the open, because the future of AI should be shaped by the people running, testing, and improving it.

Download the weights on Hugging Face, or deploy in a dedicated, managed inference environment on Model Vault. Alternatively, try it for free in your harness of choice on OpenCode or with a Cohere API key. Share what you build and tag @ Cohere on X or Discord, or engage with us on Reddit.

Snapshot

Model	North-Mini-Code-1.0
License	Apache 2.0
Model size	30B total; 3B active
Context length	256K total context; 64K max generation
Optimized for	Code generation, agentic software engineering, and terminal tasks
Availability	Hugging Face (Weights), Cohere API, Cohere Model Vault, OpenRouter
Hardware (minimum)	1× H100 @ FP8

Agentic coding capabilities

North Mini Code achieves competitive scores across benchmarks against models of this size class, demonstrating strong performance in real-world software engineering tasks.

Benchmarks comparing North Mini Code to leading open-source models of small sizes. — Image 1: North Mini Code’s performance in agentic software engineering and terminal tasks, along with complex code generation benchmarks, compared to leading open-source models of a similar size. ¹ ²

North Mini Code’s benchmark scores translate to a 33.4 on the Artificial Analysis Coding Index, a competitive position among similarly sized models.

The speed advantage for developer tasks

North Mini Code is designed for speed and efficiency, with a strong focus on minimizing total cost of ownership as we continue to refine and scale the model.

In our testing, North Mini Code achieved up to 2.8x higher output throughput than Devstral Small 2 under identical concurrency levels and hardware configurations. In practical terms, that translates to nearly three times the work rate, enabling faster iteration while reducing computational overhead.

North Mini Code also demonstrated a 30% advantage in inter-token latency, a metric that reflects the consistency and pacing of token generation. Time-to-first-token (TTFT) performance was more closely matched between the two models, with Devstral Small 2 maintaining a slight edge across the tested conditions.

Image 2: North Mini Code’s output speed and latency compared to Devstral Small 2, across high and low concurrencies, in internal tests using coding prompts.

Sovereign open models for developers

North Mini Code is our first open-source model for developers. As coding agents transform software engineering, developers need control and flexibility over their agentic coding infrastructure.

North Mini Code represents a step forward in small agentic coding models that can accomplish tasks that matter to developers. Specifically, it is built for agentic workflows, including understanding and orchestrating sub-agents, mapping systems architecture, and running code reviews. Deploy on-prem or locally, on your own terms.

Community feedback will directly shape our roadmap as we expand the ecosystem toward more open and sovereign developer models. Try North Mini Code when you need freedom from vendor constraints, and help us build what's next.

What’s next?

North Mini Code launches as the first—but certainly not the last—of Cohere's new generation of powerful models, designed for a more sovereign open-source ecosystem.

We're committed to increasing our capabilities, with community input informing what comes next.

Getting started

Help us build a complete sovereign AI ecosystem for software development by trying North Mini Code. North Mini Code is available for free on Hugging Face and Model Vault—our fully managed inference platform. We've specifically trained it for compatibility with OpenCode, but it works with most coding agents.

Share what you build and tag @ Cohere on X or Discord, or engage with us on Reddit to help shape the future of sovereign models.

Visit our documentation for detailed model specs, deployment guides, and cookbooks to get started.

Footnotes

¹ We used publicly reported scores for competitor models either from original reports or Artificial Analysis Intelligence Index where available. Additionally, Gemma 4’s scores for agentic coding tasks were reported by Qwen team. For the benchmark results that any public report is missing denoted by (*) in Image 1, we run internally with recommended model configuration.

² We evaluated North Mini Code using “SWE-agent” harness for SWE-Bench Verified and SWE-Bench Pro, and a simple ReAct harness employing a single terminal-use tool for Terminal Bench v2. For Terminal Bench Hard, we used Terminus-2 harness for both North Mini Code and the other models that are evaluated internally.