OpenAI与Broadcom合作推出首款自研AI芯片Jalapeño(ASIC),专为ChatGPT、Codex、API及未来AI智能体产品的LLM工作负载设计。在已知工作负载下,Jalapeño比NVIDIA GPU更便宜、更快,通过减少数据移动、均衡计算/内存/网络资源实现更接近理论峰值的实际利用率,能效更优。该芯片从设计到流片仅用9个月,OpenAI自己的模型加速了部分设计工作。这标志着OpenAI从购买算力转向构建完整堆栈(模型、软件、服务器、网络、芯片)的战略转变。
OpenAI rolls out its 1st chip through a Broadcom tie-up as part of its "build the full stack" push.
Jalapeño is an ASIC, so it is less flexible than an Nvidia GPU, but can be cheaper and faster when the workload is known very well.
They say "the architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance."
Overall better performance per watt.
Jalapeño also signals OpenAI's shift from buying compute to shaping the whole stack: models, software, servers, networks, and now silicon.