开放权重模型的难以忍受的廉价感
阅读原文· jamesoclaire.comDeepSeek V4 定价极低,仅基于 token 费用就比 Anthropic 和 OpenAI 的前沿模型便宜近 50 倍,未计入思考类模型额外 token 消耗。文章指出对方陷入高成本困境,难以降价 20–50 倍竞争。开放权重模型低成本源于硬件压力测试还是亏本引流,尚存争议。担忧美国可能借“中国恐惧”推动限制开放权重模型。Google 于 2026 年 4 月发布 Gemma 4,Meta Llama 无新版本,OpenAI 最后开放权重停留在 2025 年 GTP,Anthropic 从未发布。真正开源(含训练数据管道)模型正流行,但数据截止 2024 年 12 月。美国 NSF 与 Nvidia 合作支持 Allen AI 开发完全开源 AI。
Today I was setting up Hermes to see how it does with web research. I chose DeepSeek V4 because I know it is cheap, but seeing it’s pricing next to Anthropic and OpenAI ‘frontier’ models is crazy. Nearly a 50x price increase based on tokens alone, let alone how much pondering any of their models might fall into (using more tokens for the same task).
What worries me about this is that Anthropic and OpenAI seem to have backed themselves into a corner of high costs. Can they reasonably decrease their prices by 20-50x to compete with DeepSeek or Xiaomi’s Mimo?
Open Weight vs Low Cost
Are these models cheap because they are open weight and having hundreds or people stress test running them on different hardware helped to lower the cost? Or is it that they are being provided as loss leaders to drive the prices down?
How do you keep prices high for commodity products?
You manufacture scarcity. You sell luxury and premium branding. This is what OpenAI and Anthropic seem to be doing by gating ‘frontier’ model usage behind higher walls.
This is how luxury brands have sold cars and hand bags forever. They are clubs and status symbols for the rich and not meant to be widely distributed.
Will Anthropic & OpenAI lean on China fears to push bans on open weight models?
This has been my fear for a few months now and each week that goes by seems to support this. How do you manufacture scarcity? One easy way is to fear monger and get the government to help restrict access to competition.
Why not compete?
The US used to be such a champion of open source, and I would hope that serious open source competition can come out of the US to prove that open weight and open source models are ultimately the future.
- Google Gemma 4 was released in April 2026
- Meta had llama which hasn’t had a release
- OpenAI last released open weight gpt models in 2025
- Anthropic to my knowledge has never released any open weight model
True Open Source vs Open Weight
I think the leap frog scenario for Open Source will be the true Open Source models where the data pipeline for training is also open sourced.
https://allenai.org/olmo -> You can download these models now and they’re seeing increasing popularity. That being said, they are a bit out of date, with data cutoffs in Dec 2024
Looking to the future, the US NSF partnered with Nvidia to enable Allen AI to develop a true fully open AI:
https://www.nsf.gov/news/nsf-nvidia-partnership-enables-ai2-develop-fully-open-ai
Bonus:
Curious to dig more into Claude / ChatGPT tech stacks? Check out the tools they used to build their iOS and Android apps:
Claude Android
ChatGPT Android
You can navigate to SDKs to view even more detailed breakdowns of specific parts as well as unmapped SDK paths.
