# FuriosaAI 與 Broadcom 合作打造搭載次世代 HBM4/E 記憶體的高效能 AI 加速晶片。

*semiconductor, genai · news · 2026-05-27 · Wccftech*

## Key points

- FuriosaAI 與 Broadcom 正合作開發支援 HBM4/E 記憶體的 2nm AI 加速器。
- 該晶片將配備 12 個 HBM4/E 記憶體模組與兩個運算晶片組，總記憶體容量達 432 GB。
- FuriosaAI 著重頻寬而非執行緒管理，宣稱其代幣處理量高於 GPU。
- 晶片採用先進封裝技術整合多顆晶片，目標為機架級 AI 叢集部署。
- 第三代加速器將於 2028 年上半年開始提供資料中心客戶取樣。

FuriosaAI and Broadcom have partnered to build a high-performance AI accelerator chip featuring next-gen HBM4/E memory. FuriosaAI's Next-Gen AI Accelerator Features 2nm Chiplet Architecture, HBM4/E Memory Support for Massive AI Compute Clusters FuriosaAI has announced its third-generation AI accelerator, which builds upon its 2nd Generation RNGD platform, which is currently in mass production on TSMC's 5nm process technology. The 2nd Gen RNGD AI platform comes in the form of a 180W PCIe-based design, which aims at LLM & Agentic AI workloads. The next-generation design is going to go all-in on the AI inference segment as Agentic AI continues to see huge demand. The third-generation AI accelerator from FuriosaAI has the following highlights: The platform pairs 2nm compute technology with HBM4/4E memory, designed to enable high-bandwidth, rack-scale networking across massive AI compute clusters. The architecture is optimized for demanding inference workloads with a focus on high-bandwidth data movement that delivers higher performance-per-watt and greater token density than even the most efficient GPUs. It builds on Furiosa’s current-generation RNGD chip, now in mass production. Customers include Samsung SDS and LG AI Research. Starting with some of the details shared by FuriosaAI, the chip platform will utilize an advanced 2nm compute die and HBM4/E memory standard. The firm is working with Broadcom to harness advanced packaging capabilities, allowing them to integrate multiple silicon dies into a singular & performant AI chip (System-on-chip). In the teaser shot, the company shows the 3rd Gen AI chip with 12 HBM4/E memory sites, two massive compute chiplets (2nm), and two IO controllers. That rounds up to 432 GB if Furiosa uses 12-Hi 36 GB per stack memory modules. Besides the compute architecture, FuriosaAI will also leverage Broadcom's Ethernet and PCIe IPs, allowing higher bandwidth, rack-scale networking across massive AI compute clusters. The AI chip is optimized for demanding real-world AI workloads such as post-training sampling, and high bandwidth is a key focus & that's why the company is going with the latest HBM4/E standards. The company claims that its focus on bandwidth rather than thread management (required by GPUs) will help it deliver higher efficiency and higher token throughput than modern GPU designs. Furthermore, the company is saying that its software stack allows developers to deploy new AI models quickly while meeting throughput and latency requirements. Furiosa’s SDK leverages a general compiler that automatically maps high-level PyTorch code to silicon. For developers requiring more granular control, Furiosa’s Virtual ISA offers a declarative programming model that provides hardware control without the nondeterministic complexity of traditional GPU programming. “Bringing together Broadcom’s infrastructure capabilities and Furiosa’s Tensor Contraction Processor architecture and its industry-defining software stack allows us to move beyond the chip level and deliver a comprehensive solution for the token factory era,” said Furiosa Cofounder and CEO June Paik. As for availability, the 3rd Gen FuriosaAI accelerator is expected to begin sampling by the first half of 2028 and will be ready to meet compute requirements for next-gen AI data centers.

**Companies:** Broadcom, FuriosaAI

[Read the full story on Wccftech](https://wccftech.com/furiosaai-ditches-gpu-playbook-for-2nm-broadcom-built-inference-chip-claims-hbm4-e-bandwidth-beats-gpus/)

---

Canonical: https://newsio.io/zh-TW/n/ea659751-2401-4910-933b-de3b4daf8bc1/furiosaai-broadcom-hbm4e-ai-agentic-ai-ai
Summarized by Newsio from Wccftech. https://newsio.io/how-it-works
