Apple Silicon本地运行LLM成本高于云端API
阅读原文· williamangel.net分析显示,在Apple Silicon芯片(如M2 Ultra)上本地运行Llama 3.1 405B等大型语言模型,其成本高于使用OpenRouter等云端API服务。具体而言,本地运行每百万tokens成本约为0.73美元,而通过OpenRouter仅需0.59美元,成本高出约24%。这突显了对于大规模模型推理,云端服务目前可能比高端本地硬件更具经济性。
Tags: LLMs, Local LLMs, Apple Silicon, OpenRouter, AI Inference, Cost Analysis
Offline Agentic Coding part 3: Apple Silicon costs more than OpenRouter.
Published 2026-05-17
Apple silicon costs more than OpenRouter.
At ~50-100 watts under load, and ~$0.20 per kWh, my M5 MacbookPro will cost a few cents per hour. Accelerated depreciation (if any) from shortening the lifespan of the device will be more expensive than the electricity. At a few tens of tokens per second this works out to ammortized costs of ~$1.50 per million tokens. Openrouter for comparable models is 1/3rd the price and ~2x the speed.
Electricity