# OpenAI 发布首款自研 AI 芯片 Jalapeño，专为大语言模型推理设计

- 来源：Chubby♨️ (@kimmonismus)
- 发布时间：2026-06-24 21:19
- AIHOT 分数：60
- AIHOT 链接：https://aihot.virxact.com/items/cmqs42it7000kslv64ag3jta5
- 原文链接：https://x.com/kimmonismus/status/2069772454591934778

## AI 摘要

OpenAI 推出其首款自研 AI 芯片 Jalapeño，与 Broadcom 和 Celestica 合作构建，针对 ChatGPT、Codex、API 及未来智能体产品的工作负载优化。早期样品已在实验室以目标频率和功耗运行 ML 工作负载，包括 GPT-5.3-Codex-Spark。OpenAI 称每瓦性能显著优于当前最先进水平，详细基准稍后公布。部署计划于 2026 年底启动。此举旨在减少对外部 GPU 的依赖，增强对计算经济的控制，并强化模型、产品、收入与基础设施之间的飞轮效应。

## 正文

OpenAI just unveiled Jalapeño， its first custom AI chip designed from scratch for LLM inference-

It is OpenAI moving deeper into the full stack： chips， kernels， memory， networking， racks， scheduling， deployment and product experience.

OpenAI has learned from Cerebras-deal what is valuable in specialized inference hardware and is now attempting to translate that lesson into its own controllable platform.

Built with Broadcom and Celestica， Jalapeño is optimized around the workloads OpenAI actually runs across ChatGPT， Codex， the API and future agentic products.

Early samples are already running ML workloads in the lab at target frequency and power， including GPT-5.3-Codex-Spark. OpenAI says performance per watt should be substantially better than current state of the art， with detailed benchmarks coming later！

The strategic angle is obvious： less dependence on external GPUs， more control over compute economics， and a stronger flywheel between models， products， revenue and infrastructure.

Deployment is planned to start by the end of 2026.

### 引用推文

> OpenAI：https://openai.com/index/openai-broadcom-jalapeno-inference-chip/
