News
CAPACITY TEST:

Qualcomm AI250 Stacks DRAM Over Compute But Leaves FLOPS Undisclosed

Newsroom brief

Qualcomm is pitching high-bandwidth compute for AI inference, with AI250 cards claiming 768 GB of memory and 133 TB/s of effective bandwidth, but the company has not disclosed peak FLOPS or named customers.

Verified against source materialEdited by SendTech Times Chips & Compute Desk
Qualcomm AI250 Stacks DRAM Over Compute But Leaves FLOPS Undisclosed
Image source: The Register

Qualcomm Moves AI250 Memory Closer To Compute

Qualcomm is using its AI250 accelerator roadmap to push a different answer to the AI inference memory bottleneck.

The company describes high-bandwidth compute, or HBC, as a 3D-stacked design that places DRAM above logic so some work can happen closer to memory.

The AI250 is due to follow the AI200 Dragonfly rack systems and is planned to begin shipping in 2027.

Qualcomm also outlined a second-generation HBC platform, the AI300, for 2028.

Qualcomm says the AI250 card will carry 768 GB of memory and up to 133 TB/s of effective memory bandwidth.

The company ties those claims to bandwidth-bound inference work, especially decode, where model weights are streamed from memory during token generation.

Effective Bandwidth Claims Need More Detail

The company is presenting HBC as a way to reduce data movement between memory and compute.

Qualcomm says the architecture uses LPDDR memory in a purpose-built near-memory design and differs from HBM because HBC does computing in the base logic die.

The bandwidth claims still depend on Qualcomm's definition of effective bandwidth.

For the AI200 generation, Qualcomm had cited 414 TB/s of effective memory bandwidth across 56 chips.

The AI250 marketing material says HBC gives 18x the AI200's effective bandwidth, while the AI300 would reach 54x.

Qualcomm says the AI250 can operate as a standalone AI accelerator.

It also says the part can sit in disaggregated inference systems, with GPUs or other Qualcomm parts handling prompt processing and AI250 accelerators handling memory-intensive decode.

The company declined to give peak FLOPS for AI250.

It also did not give the detailed physical bandwidth calculation behind the headline effective-bandwidth figures, even as the disclosed figures indicate that ordinary LPDDR5x bandwidth would not explain the claimed totals by itself.

Modular Deal Targets The Software Gap

Qualcomm's investor-day push also included its planned acquisition of Modular, the AI software startup behind Mojo and the Max serving platform.

Mojo is positioned as a low-level programming interface that can run across different hardware, while Max targets LLM model serving.

AI accelerator buyers are comparing more than silicon specifications.

They need serving tools, developer support and deployment paths that do not lock every workload to one vendor stack.

Qualcomm is using Modular to address that software gap while Nvidia and AMD remain the main comparison points for AI infrastructure buyers.

The plan also assumes Qualcomm can make a heterogeneous inference model attractive.

The article describes a possible split where other chips handle prompt processing and AI250 systems focus on memory-intensive decode, but it does not identify production deployments using that design.

Qualcomm has not disclosed peak FLOPS for AI250, the detailed method behind its effective bandwidth calculation, named AI250 customers, production deployment dates beyond the 2027 target or whether regulators will clear the Modular acquisition this year.

Share this article
inXf

Related articles

More
Qualcomm Wins Meta CPU Agreement, But Production Waits Until 2028
Chips & Semiconductors

Qualcomm Wins Meta CPU Agreement, But Production Waits Until 2028

Qualcomm Technologies will supply data center CPUs for Meta under a multi-generation agreement, with the first Dragonfly C1000 production scheduled for the second half of 2028 and capacity terms still undisclosed.

Qualcomm Names Meta As First Dragonfly Data Center CPU Customer
Chips & Semiconductors

Qualcomm Names Meta As First Dragonfly Data Center CPU Customer

Qualcomm said Meta will use its Dragonfly C1000 data center CPU when production starts in 2028, while the chipmaker raised its fiscal 2029 non-handset revenue projection to $40 billion.

SK hynix Uses HPE Discover to Push AI Memory Beyond HBM
Chips & Semiconductors

SK hynix Uses HPE Discover to Push AI Memory Beyond HBM

SK hynix used HPE Discover 2026 in Las Vegas to showcase HBM, CMM-DDR5, eSSD and server DRAM products for AI infrastructure buyers. The company said HPE-certified products already deployed in HPE servers include PS1010 E3.S eSSDs based on 176-layer 4D NAND and 64GB DDR5 RDIMM modules built on 1c process technology. The clearest commercial point is HPE certification and supply; the booth display does not by itself show broader customer adoption.

Tencent’s Canghai V2 Chip Pushes Video Encoding Into Its Cloud Infrastructure Stack
Chips & Semiconductors

Tencent’s Canghai V2 Chip Pushes Video Encoding Into Its Cloud Infrastructure Stack

Tencent Cloud says its self-developed Canghai V2 video encoding chip has entered mass production after leading MSU hardware encoding benchmarks. The company is positioning the chip as a way to cut bandwidth and compute costs for AI video, live streaming and cloud media workloads. The next test is whether benchmark leadership turns into wider deployment across Tencent Cloud services and external customers.

Keep Reading

More Stories

Latest
US Lifts Anthropic Model Export Controls After Safeguards DealCapital & PolicyJul 1, 2026US Lifts Anthropic Model Export Controls After Safeguards DealThe Commerce Department is removing licence requirements for Anthropic’s Mythos and Fable models after a safeguards agreement, reopening foreign access while leaving jailbreak controls as the unresolved policy test.Rocket Lab Sets $8bn Iridium Deal As Satellite Network Test Awaits RegulatorsCapital & PolicyJul 1, 2026Rocket Lab Sets $8bn Iridium Deal As Satellite Network Test Awaits RegulatorsRocket Lab has agreed to buy Iridium Communications for about $8bn, pairing launch and spacecraft manufacturing with a satellite communications network that serves more than 2.55 million active subscribers.SEC Chair Says Tokenized Deposits Could Get Approval Next YearFintech & Digital PaymentsJul 1, 2026SEC Chair Says Tokenized Deposits Could Get Approval Next YearSEC Chair Paul Atkins said regulators could approve tokenized deposits as soon as next year, while also tying crypto rulemaking to bank-capital talks with the Fed, FDIC and OCC.UAE AI Authority Consolidates Data And Digital Government MandatesCapital & PolicyJul 1, 2026UAE AI Authority Consolidates Data And Digital Government MandatesThe UAE will establish an Artificial Intelligence and Data Authority to consolidate federal AI, public data and digital government functions, but the government has not disclosed a budget, staffing plan or implementation timetable.Aarogya Setu 2.0 Adds Google Gemma For India Health RecordsCapital & PolicyJul 1, 2026Aarogya Setu 2.0 Adds Google Gemma For India Health RecordsIndia launched Aarogya Setu 2.0 on June 29, 2026 as an AI-enabled personal health-record app using Google Gemma and a medical data toolkit, but public materials have not named an independent privacy audit or model-risk assessment.Saudi PIF Profit Doubles As Revenue Reaches $119.7 BillionCapital & PolicyJul 1, 2026Saudi PIF Profit Doubles As Revenue Reaches $119.7 BillionPIF’s 2025 annual results said net profit rose to SAR65.1 billion and revenue reached SAR449.9 billion, giving Saudi Arabia’s sovereign fund more liquidity for Vision 2030 investments without naming project-level allocations.US Energy Chief Tells Data Center Builders To Answer Power CriticsCloud & Data CentersJul 1, 2026US Energy Chief Tells Data Center Builders To Answer Power CriticsUS Energy Secretary Chris Wright used an AWS Summit appearance to tell data center supporters to engage critics over electricity costs and local opposition as AI construction raises power and permitting pressure.Open USD Consortium Draws Visa, Mastercard And Stripe BackingFintech & Digital PaymentsJul 1, 2026Open USD Consortium Draws Visa, Mastercard And Stripe BackingOpen Standard says more than 140 banks, fintechs, payment companies and crypto firms are backing Open USD, a dollar-backed stablecoin planned for later this year, but the group has not disclosed transaction volumes or customer commitments.AMD Linux Patch Adds Low-Power CPU Core Support For Future ChipsChips & SemiconductorsJul 1, 2026AMD Linux Patch Adds Low-Power CPU Core Support For Future ChipsAMD has submitted Linux kernel patches that add a low-power CPU core classification beside performance and efficiency cores, but the company has not disclosed the first processor, launch date or product line that will use the new core type.AWS Creates $1 Billion FDE Unit For Customer AI DeploymentsAIJul 1, 2026AWS Creates $1 Billion FDE Unit For Customer AI DeploymentsAmazon Web Services is investing $1 billion in a Forward Deployed Engineering unit that will place engineers inside customer projects, but AWS has not disclosed revenue targets or named paid deployment contracts for the new group.Zerodha Seeks SEBI Merchant Banking Licence As Brokerage Pressure BuildsCapital & PolicyJun 30, 2026Zerodha Seeks SEBI Merchant Banking Licence As Brokerage Pressure BuildsZerodha Corporate Advisors filed for a SEBI merchant banking licence on April 27, as tighter derivatives rules and lower account-maintenance fees pressure the broker’s core trading business.H-1B Ruling Leaves Tech Workers Weighing UAE And Canada MovesCapital & PolicyJun 30, 2026H-1B Ruling Leaves Tech Workers Weighing UAE And Canada MovesA June 8 court ruling struck down a $100,000 H-1B fee, but recruiters and workers say policy uncertainty is still pushing U.S. tech talent toward Canada, the U.K. and the UAE.