高通CEO Cristiano Amon指出,AI智能体(Agentic AI)因其执行规划、工具调用、验证等自主任务,单次请求的token消耗可能达普通回答的10至50倍以上,因此AI将需要“海量”token。高盛预测,到2030年AI智能体的token使用量将增长24倍,每月可能达120千万亿。同时,推理成本预计年降60%-70%。这标志着软件计量方式可能从按席位/点击转向主要按机器推理/token消耗量计算,Uber和Microsoft等公司已在重新评估高昂的智能体使用成本。
New video of Qualcomm CEO Cristiano Amon: AI will require "gazillions" of tokens.
Because, Agentic AI will consume dramatically more tokens because it performs autonomous tasks, uses multiple systems, and interacts with tools.
AI demand will grow hugely when software starts letting agents act, not just answer.
A chatbot spends tokens on language; an agent spends tokens on deciding, checking, calling tools, reading outputs, revising plans, and coordinating with other software.
Today a single human-AI exchange may be large, a reasoning task may be much larger, but we are already entering the agentic era, where an autonomous workflow can become exponentially larger still because the model is no longer producing one response.