FuriosaAI and Broadcom Target the Next Layer of AI Inference Infrastructure

BySendTech Times Chips & Compute DeskNewsroom-edited, source-reviewed coverage|Source: Aitimes

Newsroom brief

FuriosaAI said it will work with Broadcom on a next-generation AI inference platform built around its TCP architecture and Broadcom networking and packaging technologies. The planned third-generation accelerator will use a 2-nanometer compute die, HBM4/HBM4E memory and multi-die packaging, with sampling planned for the first half of 2028. The deal points to AI infrastructure competition shifting from single-chip performance toward memory, networking, power efficiency and rack-level system design.

Verified against source materialEdited by SendTech Times Chips & Compute Desk

FuriosaAI and Broadcom Target the Next Layer of AI Inference Infrastructure

Image source: 인공지능신문

What happened

FuriosaAI said it has formed a strategic partnership with Broadcom to co-develop a next-generation AI inference platform.

The Korean AI chip company said the project will move its Tensor Contraction Processor architecture into a multi-die chiplet system for hyperscale AI environments where token-processing demand is rising.

The planned platform combines FuriosaAI architecture with Broadcom AI networking, high-bandwidth Ethernet switching and advanced packaging.

The source says the goal is an integrated AI computing, networking and software platform for large inference clusters, not only a standalone accelerator.

The work builds on FuriosaAI RNGD, or Renegade, accelerator.

The source describes RNGD as a 180W PCIe AI accelerator in mass production using TSMC 5-nanometer process and SK hynix HBM3, optimized for large language model and agentic AI workloads.

FuriosaAI said it has been validated in customer environments including Samsung SDS and LG AI Research.

Why it matters

The announcement points to AI infrastructure competition moving beyond single-chip performance.

If inference demand keeps expanding, buyers may place more weight on memory bandwidth, interconnect, rack-level networking and power efficiency.

That matters for Korean AI semiconductor companies because it puts system design and global infrastructure partnerships at the center of the market.

FuriosaAI is positioning its TCP architecture alongside Broadcom networking and packaging assets to address bottlenecks in large agentic AI deployments.

Who is affected

The most direct audience is hyperscale AI infrastructure buyers, cloud providers and enterprises planning larger inference workloads.

It also matters for AI chip startups trying to compete in markets shaped by GPU-based infrastructure.

For Korean technology readers, the signal is that a domestic AI semiconductor company is working with a global chip and networking supplier on a platform aimed at frontier model and agentic AI inference.

What to watch next

FuriosaAI said the third-generation accelerator will use a 2-nanometer compute die, HBM4 and HBM4E memory, and Broadcom packaging to combine multiple silicon dies into one high-performance chip.

The companies plan to begin sampling in the first half of 2028.

Readers should watch whether the partnership moves from architecture plans to working silicon, whether customer adoption follows current RNGD deployments, and whether rack-scale networking becomes a clearer differentiator in AI inference infrastructure.

#FuriosaAI #Broadcom #AI inference #AI chips

Chips & Semiconductors

Korean NPU Makers Target Inference Niches as Nvidia Dominance Deepens

Executives from Rebellions, FuriosaAI and Mobilint said Korean NPU vendors see openings in inference, power efficiency and total cost despite Nvidia technical advantages. The panel highlighted Nvidia’s Groq deal, software ecosystems, interconnects and packaging as the main competitive barriers for domestic AI chip firms. Rebellions and FuriosaAI are focused on data-center inference, while Mobilint is positioning around edge and on-device AI where power and cost limits are tighter.

Chips & Semiconductors

Arm and Supermicro Put Agentic AI Servers to a CPU Test

Supermicro has introduced new server platforms built around Arm’s AGI CPU for inference-heavy and agentic AI workloads across cloud, enterprise and edge deployments. Arm says the AGI CPU includes up to 136 Arm Neoverse V3 cores, 12 DDR5 memory channels running at up to 8800 MT/s and PCIe Gen6 connectivity within a 300W power envelope. The key test is whether operators can use these CPU-heavy designs to add inference capacity without creating new pressure on power and cooling.

Chips & Semiconductors

Samsung Sets HBM And Server SSD Efficiency Targets With Fab Water Gap Still Open

Samsung’s 2026 Sustainability Report links lower-power HBM and server SSD targets to AI infrastructure, while its semiconductor division still has a 2050 net-zero timetable and limited product-level commercial detail.

Chips & Semiconductors

Marvell Teralynx T100 Puts AI Data-Center Switching Into the Chip Race

Marvell announced planned availability of its Teralynx T100 switch chip for AI training and inference infrastructure. The 102.4 Tbps chip is built on a 3nm process, supports up to a 512-port radix and is claimed to use 25 percent lower power than competitive solutions. The practical question is whether data-center customers use lower-power, high-radix switching to ease latency and power constraints in larger AI clusters.

Chips & Semiconductors

SK hynix Expert Frames AI Chip Race Around Power, Water And Megafabs

An SK hynix Newsroom expert column argues that AI semiconductor competition is moving beyond faster chips toward megafab capital, grid delivery, industrial water and memory supply stability.

Chips & Semiconductors

FuriosaAI Starts RNGD Accelerator Deployment At Equinix Lisbon Datacenter

FuriosaAI has begun deploying its RNGD AI accelerators at Equinix’s LS2 datacenter in Lisbon as the South Korean chip startup looks for European sovereign AI demand. The company describes 48 GB of HBM3, 1.5 TB/s memory bandwidth and 512 teraFLOPS of dense FP8 performance per card, but its Broadcom-linked next-generation accelerator still depends on HBM4 and HBM4e timing.