OpenRouter 上美国模型 token 使用份额在一年内从约 70% 降至约 30%。UBS 调查显示,60% 关注 AI 预算的公司正转向更便宜模型和开源中国模型,主因是极端账单:用户月花费高达 3.5 万美元、团队超配额 200%、公司从 5 个内部 AI 工具削减至 2 个。企业采用模型路由策略,将简单任务交给低成本模型,保留高级模型用于复杂推理、代码和长上下文任务。中国开源模型 Qwen、DeepSeek、MiniMax、GLM、Kimi 因可本地运行或通过云目录使用,契合企业成本曲线。
"the share of tokens used for US models on OpenRouter has collapsed" Bloomberg
On OpenRouter, US model token share fell from around 70% to around 30% in a year.
A clean warning signal.
open-weight Chinese models are becoming capable enough for routine work, cheaper to run, easier to customize, and less dependent on permission from a frontier lab.