On June 24, 2026, OpenAI and chipmaker Broadcom announced Jalapeño, OpenAI's first custom AI inference chip. The chip is designed specifically to run large language models like GPT-4 and future models. This makes OpenAI less dependent on chips from NVIDIA.
Jalapeño was developed in just nine months, which experts say is the fastest any ASIC chip of this complexity has ever been built. ASIC stands for Application-Specific Integrated Circuit, meaning the chip does one job extremely well instead of being general-purpose.
The chip focuses on inference, which is the step where an AI answers your question. Training, which is how an AI learns, requires different hardware. By building an inference chip, OpenAI can serve users more cheaply and with less electricity.
OpenAI expects to deploy Jalapeño in its data centers by late 2026. The chip could allow OpenAI to reduce its expensive NVIDIA GPU orders and lower its operating costs significantly in the years ahead.
OpenAI and semiconductor giant Broadcom jointly announced Jalapeño on June 24, 2026 -- OpenAI's inaugural custom silicon designed exclusively for large language model inference. The partnership marks OpenAI's strategic move to reduce its overwhelming dependence on NVIDIA hardware, which has constrained both supply and cost as AI demand has surged.
The chip's nine-month development cycle is extraordinary by semiconductor industry standards; comparable ASIC designs typically require 18 to 36 months. Broadcom's expertise in custom silicon -- it already designs chips for Google's TPU line and Apple's neural engines -- was critical in compressing the timeline. Jalapeño is optimized for transformer-based inference workloads, the mathematical backbone of models like GPT-4o and its successors.
Industry analysts note that Jalapeño's design reflects a structural split in AI computing: training remains the domain of power-hungry GPU clusters, while inference -- the real-time serving of responses to end users -- benefits from narrower, more efficient ASICs. By targeting inference, OpenAI can reduce the cost per query and lower its enormous electricity consumption without sacrificing model capability.
The financial implications are significant. OpenAI reportedly spends billions annually on NVIDIA GPUs. If Jalapeño performs as claimed -- delivering substantially better performance per watt for inference tasks -- the company could redirect that capital toward model research, potentially accelerating its competitive position against Google DeepMind and Anthropic.
The June 24, 2026 unveiling of Jalapeño -- OpenAI's inaugural custom ASIC, co-developed with Broadcom -- represents the most consequential silicon bet in the company's history. The chip is purpose-built for large language model inference, a workload that differs fundamentally from training in its latency sensitivity, memory access patterns, and power envelope requirements. By offloading inference from general-purpose NVIDIA H100/H200 GPUs to a domain-specific architecture, OpenAI is betting it can serve its growing user base more economically than any rival dependent on commodity GPU supply chains.
What makes Jalapeño technically notable is its development velocity: nine months from architecture sign-off to tape-out, a timeline that rivals even Apple's tightly controlled A-series silicon cycles. Broadcom's CoWoS packaging expertise, combined with its established TSMC shuttle relationships, allowed OpenAI to skip the typical foundry queue that plagues first-time chip designers. The chip is understood to employ a high-bandwidth memory architecture tuned for the key-value cache operations that dominate transformer inference, rather than the dense matrix multiplications that favor NVIDIA's tensor cores during training.
The strategic logic extends beyond cost. Designing proprietary silicon gives OpenAI control over the instruction set and memory hierarchy, enabling compiler-level optimizations specific to its own model architecture -- an advantage Google has leveraged with its TPU v5e and Meta is building toward with MTIA. Jalapeño also insulates OpenAI from the geopolitical risk embedded in NVIDIA's supply chain, where US export controls and TSMC capacity constraints have repeatedly created bottlenecks. Owning the inference stack end-to-end is as much a supply-chain resilience play as a unit-economics play.
The competitive implications ripple outward. NVIDIA's inference revenue -- a fast-growing segment that now accounts for a meaningful share of its data-center division -- faces structural pressure as hyperscalers and frontier AI labs alike pursue vertical integration. If Jalapeño's performance-per-watt claims hold at scale, it will accelerate the bifurcation of the AI compute market into a training tier still dominated by GPU clusters and an inference tier increasingly contested by bespoke ASICs from Google, Amazon, Microsoft, and now OpenAI.
OpenAI and chipmaker Broadcom announced Jalapeño on June 24, 2026 -- OpenAI's first custom silicon chip, built to run large language models more efficiently than general-purpose NVIDIA GPUs. Developed in just nine months, the chip targets AI inference rather than training and is set for deployment in OpenAI's data centers by late 2026, potentially saving the company billions in annual GPU costs.
OpenAI and Broadcom made a big announcement on June 24, 2026. They built a brand-new computer chip together. The chip is called Jalapeño. It is the first chip that OpenAI has ever made.
Jalapeño was built to run AI programs called language models. Language models are programs that read and write text. ChatGPT is one example of a language model. The chip helps these programs work faster and use less power.
It took only nine months to build Jalapeño. That is very fast for a chip. Most chips like this take two or three years. OpenAI plans to start using Jalapeño at the end of 2026.
Right now, OpenAI uses chips made by NVIDIA. Jalapeño would let OpenAI need fewer of those chips. Having its own chip could save OpenAI a lot of money in the future.
1What is the name of the new chip?
2Who built Jalapeño together?
3What kind of programs is Jalapeño designed to run?
4How long did it take to build Jalapeño?
5When does OpenAI plan to start using Jalapeño?
6Jalapeño was made by NVIDIA.
7Jalapeño is OpenAI's first custom chip.
8Language models write and read text.
9Building Jalapeño took five years.
10OpenAI currently uses NVIDIA chips.
11The new chip is called ___.
12Jalapeño was built to run AI ___ models.
13OpenAI partnered with ___ to make the chip.