开源权重模型的难以承受的廉价感

开源权重模型的难以承受的廉价感
The Unbearable Cheapness of Open Weight Models

原始链接: https://jamesoclaire.com/2026/06/25/the-unbearable-cheapness-of-open-weight-models/

DeepSeek V4 等高性能、低成本模型的出现，凸显了经济型选项与 OpenAI 及 Anthropic 等“前沿”模型之间巨大的价格差距。这种差异引发了质疑：行业领头羊是在通过制造人为稀缺、将自身定位为奢侈品牌来刻意维持高价，还是因为其昂贵的架构导致它们根本无法进行价格竞争。作者担忧，在位者面对这种竞争压力时，可能不会选择通过创新来提升效率，而是会以“中国恐惧”为幌子，游说政府对开源权重模型进行限制。这种转变将威胁到开源生态系统——而开源历来是美国技术领先地位的基石。为了应对这一问题，作者提倡“真正”的开源人工智能，即像艾伦人工智能研究所（Allen AI）的 OLMO 项目那样，实现训练数据流水线的完全透明。随着竞争加剧，人工智能的未来可能取决于行业是优先选择普及型的开放式创新，还是继续受困于当前前沿实验室那些高墙林立、定价高昂的生态系统中。

Hacker News 最新 | 往期 | 评论 | 提问 | 展示 | 招聘 | 提交登录开源权重模型的难以承受之廉价 (jamesoclaire.com) 14 分，ddxv 发布于 3 小时前 | 隐藏 | 往期 | 收藏 | 1 条评论 | 帮助 linzhangrun 1 分钟前 [–] 随着推理成本的降低，GPT 和 Claude 变得更便宜也不足为奇。两年前，o1 是最强的模型，其成本远高于 Fable，但其智能程度远不及现在可以在 DGX Spark 上轻松运行的 Qwen 3.6 35B。回复指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系搜索：

原文

Today I was setting up Hermes to see how it does with web research. I chose DeepSeek V4 because I know it is cheap, but seeing it’s pricing next to Anthropic and OpenAI ‘frontier’ models is crazy. Nearly a 50x price increase based on tokens alone, let alone how much pondering any of their models might fall into (using more tokens for the same task).

What worries me about this is that Anthropic and OpenAI seem to have backed themselves into a corner of high costs. Can they reasonably decrease their prices by 20-50x to compete with DeepSeek or Xiaomi’s Mimo?

Open Weight vs Low Cost

Are these models cheap because they are open weight and having hundreds or people stress test running them on different hardware helped to lower the cost? Or is it that they are being provided as loss leaders to drive the prices down?

How do you keep prices high for commodity products?

You manufacture scarcity. You sell luxury and premium branding. This is what OpenAI and Anthropic seem to be doing by gating ‘frontier’ model usage behind higher walls.

This is how luxury brands have sold cars and hand bags forever. They are clubs and status symbols for the rich and not meant to be widely distributed.

Will Anthropic & OpenAI lean on China fears to push bans on open weight models?

This has been my fear for a few months now and each week that goes by seems to support this. How do you manufacture scarcity? One easy way is to fear monger and get the government to help restrict access to competition.

Why not compete?

The US used to be such a champion of open source, and I would hope that serious open source competition can come out of the US to prove that open weight and open source models are ultimately the future.

Google Gemma 4 was released in April 2026
Meta had llama which hasn’t had a release
OpenAI last released open weight gpt models in 2025
Anthropic to my knowledge has never released any open weight model

True Open Source vs Open Weight

I think the leap frog scenario for Open Source will be the true Open Source models where the data pipeline for training is also open sourced.

https://allenai.org/olmo -> You can download these models now and they’re seeing increasing popularity. That being said, they are a bit out of date, with data cutoffs in Dec 2024

Looking to the future, the US NSF partnered with Nvidia to enable Allen AI to develop a true fully open AI:
https://www.nsf.gov/news/nsf-nvidia-partnership-enables-ai2-develop-fully-open-ai

Bonus:

Curious to dig more into Claude / ChatGPT tech stacks? Check out the tools they used to build their iOS and Android apps:

Claude Android
ChatGPT Android

You can navigate to SDKs to view even more detailed breakdowns of specific parts as well as unmapped SDK paths.