AI 替代浪潮：三大力量重塑成本结构

2026-06-07 08:00·26天前

精选理由

Tunguz 用 Coinbase、Lindy 等真实案例，把「用开源/便宜模型替代昂贵前沿模型」的趋势讲透了，做 AI 应用的人该重新算一下单位经济账。

AI 摘要

三大力量重塑 AI 成本：前沿闭源模型持续涨价，开源模型在多数场景已足够好，买家开始替代。Coinbase 将提示词路由至更便宜模型，成本持平但 token 用量指数增长。Lindy 全切至 DeepSeek v4，节省数百万美元且多项核心性能提升。Harvey 在 Legal Agent Benchmark 上通过 SFT 使 Kimi 2.6 all-pass 率达 15%，超越 Opus 的 14%，同一 100 任务成本 $84 vs $954（约 11 倍价差）。Cursor 后训练 Kimi K2.5 得到 Composer 2.5，称其“性能优异且效率高达同类模型 10 倍”。闭源越来越贵，开源平价且性能接近，选择决定企业单位经济学的斜率。

原文 · 未翻译

Three forces are reshaping the AI cost structure :

Foundation labs are moving up the stack into applications,¹ ²
Frontier model prices keep rising for the smartest models,³
Open-source models have crossed the good enough threshold for most use cases.⁴ ⁵

The natural response from AI buyers is substitution.

Coinbase⁶ :

At Coinbase we’re working hot on routing prompts to cheaper models where appropriate, & in some cases have been able to keep costs roughly flat, while token usage continues to grow exponentially.

Lindy⁷ :

Pulled the trigger today & switched 100% of Lindy traffic to DeepSeek v4, churning from Anthropic models. Saves us millions of $ & we’re actually seeing an increase in performance on many core use cases. Transformative for the business.

Harvey⁸ :

On a 100-task slice of our Legal Agent Benchmark (LAB), SFT moved Kimi 2.6’s all-pass rate from 11% to 15%, beating Opus’ 14%. But the cost gap was even more striking : $84 vs $954 across the same 100 tasks, or ~11x cheaper.

Cursor went further. They post-trained Kimi K2.5 into their own production model, Composer.⁹

Composer 2.5 is exceptionally intelligent & up to 10x more efficient than similarly capable models.

Coinbase’s quote shows where the savings go : costs flat, tokens exponential. Buyers don’t pocket the discount — they spend it on more intelligence.

Closed models are getting more expensive at the frontier; open models are getting cheaper at parity. The choice is which slope you want under your unit economics.

Ramp cost curve framing for AI buyers and app purveyors

Tomer Tunguz 博客（VC 分析）

精选56导出 Markdown