OpenAI and Broadcom unveil Jalapeño chip for scalable LLM inference

SiliconFeed EditorialJune 25, 2026

AI chips OpenAI Broadcom LLM inference data centers semiconductors

Sections and tags — in the Topics menu Search the feed

At a glance:

OpenAI and Broadcom have developed Jalapeño, an ASIC optimized for large language model inference in data centers.
The nine-month project marks the first generation of a long-term roadmap for specialized AI chips.
Early testing suggests Jalapeño delivers substantially better performance per watt than current state-of-the-art systems.

A targeted approach to AI acceleration

The Jalapeño chip represents a shift toward application-specific hardware for AI workloads. Unlike general-purpose processors or even existing GPUs and TPUs, the ASIC was designed from scratch to handle the unique demands of large language model inference. This specialization stems from close collaboration between Broadcom and OpenAI, with the chip's architecture informed by OpenAI's internal roadmap for future models and products.

The nine-month development cycle underscores the urgency around optimizing AI infrastructure. As demand for LLM-powered applications continues to surge, data center operators are under pressure to balance performance with energy efficiency. Jalapeño aims to address both challenges by focusing on the specific computational patterns that emerge during inference—the phase where trained models generate outputs rather than learn.

Beyond the first generation

While Jalapeño is positioned as a first-generation product, both companies frame it as part of an ongoing effort. They emphasize that future iterations will continue refining the architecture based on evolving AI requirements. This approach mirrors how other tech giants develop custom silicon, iterating through multiple generations to improve efficiency and performance.

The emphasis on long-term development suggests OpenAI is preparing for increasingly complex models that may outstrip the capabilities of existing hardware. By partnering with Broadcom—a company with deep experience in semiconductor design—OpenAI gains access to specialized expertise while reducing reliance on third-party accelerators.

Performance promises and next steps

OpenAI's early testing indicates Jalapeño outperforms current state-of-the-art systems in performance per watt, a critical metric for data center operations. However, the company notes that comprehensive benchmarking is still underway and a detailed technical report will follow in coming months. This cautious approach reflects the complexity of evaluating AI hardware, where real-world performance can vary significantly across different workloads.

The chip's deployment in large data centers could reshape how LLMs are served at scale. If Jalapeño meets its performance targets, it may accelerate the adoption of specialized AI hardware across the industry, potentially influencing how other companies approach similar challenges.

Industry implications

The partnership between OpenAI and Broadcom highlights a broader trend: leading AI labs are moving beyond off-the-shelf hardware to develop custom solutions. Google's TPUs, Amazon's Trainium and Inferentia chips, and Meta's various AI accelerators all point to the same direction. Jalapeño enters this competitive landscape as a potential option for organizations seeking to optimize their AI infrastructure.

For Broadcom, the project reinforces its position in the semiconductor market beyond traditional networking and connectivity products. The company has been expanding into AI-focused silicon, and Jalapeño represents a significant milestone in that strategy.

Looking ahead

As AI models grow in size and complexity, the gap between specialized and general-purpose hardware is expected to widen. Jalapeño's development signals that OpenAI is preparing for this future by investing in the underlying infrastructure that powers its services. The success of this first generation will likely influence whether other AI companies pursue similar partnerships or develop their own custom chips.

The coming months will reveal whether Jalapeño can deliver on its performance promises and how quickly it can be integrated into production systems. For now, the announcement marks a notable step in the evolution of AI hardware, one that could have lasting implications for how large language models are deployed and scaled.

Editorial SiliconFeed is an automated feed: facts are checked against sources; copy is normalized and lightly edited for readers.

FAQ

What is the Jalapeño chip and who developed it?

Jalapeño is an Application-Specific Integrated Circuit (ASIC) designed for large language model inference, developed jointly by OpenAI and Broadcom. It was built from scratch based on insights from OpenAI's research teams and roadmap for future AI models.

How does Jalapeño compare to existing AI hardware?

According to early testing by OpenAI, Jalapeño delivers substantially better performance per watt than current state-of-the-art systems used in data centers. However, comprehensive benchmarking is still ongoing, and detailed results will be shared in coming months.

When will Jalapeño be available and what are the future plans?

Jalapeño is the first generation in a long-term project, with future chip generations planned to be refined over time. The initial design and production took nine months, and it is intended for deployment in large data centers, though specific availability dates have not been announced.

More in the feed

Prepared by the editorial stack from public data and external sources.

Original article