SendTech Times
News
CAPACITY TEST:

Korean NPU Makers Target Inference Niches as Nvidia Dominance Deepens

Article summary

Executives from Rebellions, FuriosaAI and Mobilint said Korean NPU vendors see openings in inference, power efficiency and total cost despite Nvidia technical advantages. The panel highlighted Nvidia’s Groq deal, software ecosystems, interconnects and packaging as the main competitive barriers for domestic AI chip firms. Rebellions and FuriosaAI are focused on data-center inference, while Mobilint is positioning around edge and on-device AI where power and cost limits are tighter.

Korean NPU Makers Target Inference Niches as Nvidia Dominance Deepens
Image source: 더에이아이

The Inference Market Signal

Korean NPU companies are treating Nvidia dominance as a market constraint rather than a reason to exit.

At an SAC 2026 panel, executives from Rebellions, FuriosaAI and Mobilint acknowledged a real technology gap with Nvidia, but argued that inference workloads, power limits and total cost of ownership are opening niches for domestic AI accelerators.

The discussion came as AI chip purchasing criteria are shifting.

The source says early benchmarks focused on raw operations and throughput, then token generation speed, while buyers now increasingly look at power efficiency and total cost.

That shift matters because large GPUs can push data centers toward expensive power and cooling upgrades, while edge devices often cannot accommodate high-power accelerators at all.

Nvidia Raises the Bar

The panelists also pointed to Nvidia strategy in inference.

The source says Nvidia struck a roughly 29 trillion won technology and talent licensing deal with Groq in December 2025, a move described as close to an acquisition because it absorbed key people and intellectual property.

Rebellions executive Kim Kwang-jung said the deal proved the inference market is real, but also raised the software benchmark for Korean NPU companies.

Kim said Rebellions is focusing on open-source AI frameworks, a wider software stack and chiplet-based networking and interconnect solutions.

He also cited an NPU deployment in an SK Telecom call reservation service as a sign that Korean accelerators are moving beyond proof of concept.

Different Paths for Korean NPUs

FuriosaAI vice president Cho Young-jin framed the technology challenge around architecture.

He expects attention and feed-forward layers in AI inference to become more separated, potentially creating a role for specialized hardware and different memory structures.

He said Nvidia is more than three years ahead in interconnect, packaging and system-level capabilities, but argued FuriosaAI has built a more mature software stack and will pursue positioning rather than direct frontal competition.

Mobilint is taking a different route.

Chief strategy officer Yoon Sang-hyun said the company is focused on edge and on-device markets, where performance, power and cost must be balanced together.

Mobilint began volume production of its Aries NPU in the second half of last year and is preparing commercialization of an AI SoC in the first half of this year.

What to Watch

The Korean NPU opportunity depends less on beating Nvidia everywhere and more on proving specific deployment economics.

Data-center players such as Rebellions and FuriosaAI need production customers, software compatibility and interconnect progress.

Edge-focused firms such as Mobilint need lightweight algorithms that let constrained hardware run transformer-era models without unacceptable accuracy losses.

Government support may help early commercialization, but the harder test is international adoption.

If Korean NPU vendors can convert domestic deployments into repeatable inference services, they will have a clearer path to compete in parts of the AI accelerator market where efficiency and localization matter more than GPU scale.

Share this article
inXf

Related articles

More
Nvidia's RTX Spark Turns AI PCs Into the Next Chip Battleground
Chips & Semiconductors

Nvidia's RTX Spark Turns AI PCs Into the Next Chip Battleground

Nvidia is entering the AI PC market with RTX Spark, a MediaTek-linked SoC that combines Blackwell GPU technology with a CPU on a single chip. The move shifts Nvidia's AI strategy closer to edge devices, where agentic AI could run locally instead of relying only on cloud infrastructure. Analysts cited in the source said the PC opportunity is still small compared with Nvidia's data center and networking businesses.

Marvell Teralynx T100 Puts AI Data-Center Switching Into the Chip Race
Chips & Semiconductors

Marvell Teralynx T100 Puts AI Data-Center Switching Into the Chip Race

Marvell announced planned availability of its Teralynx T100 switch chip for AI training and inference infrastructure. The 102.4 Tbps chip is built on a 3nm process, supports up to a 512-port radix and is claimed to use 25 percent lower power than competitive solutions. The practical test is whether data-center customers use lower-power, high-radix switching to ease latency and power constraints in larger AI clusters.

Ciena's $50 Billion AI Network Target Puts Optical Capacity on the Hyperscaler Clock
Chips & Semiconductors

Ciena's $50 Billion AI Network Target Puts Optical Capacity on the Hyperscaler Clock

Ciena says AI demand could roughly double its addressable market to about $50 billion by 2029 as hyperscalers and service providers invest in optical networking. It cited RLS Hyper Rail, DCOM, coherent modules and 400G/800G pluggable optics as demand areas while planning $250 million to $275 million in capex this year. The practical test is whether AI compute buildouts convert into durable network orders.

Warren Hearing Request Puts Nvidia China Chip Sales Under Export-Control Scrutiny
Chips & Semiconductors

Warren Hearing Request Puts Nvidia China Chip Sales Under Export-Control Scrutiny

Sen. Elizabeth Warren invited Nvidia CEO Jensen Huang to testify before the Senate Banking Committee on June 11 over China chip sales and export controls. The request focuses on Nvidia's views on U.S. export control laws and its business in China as lawmakers scrutinize advanced AI chip flows. The next signal is whether Huang appears and gives senators enough detail on Nvidia's China strategy and national-security posture.

Keep Reading

More Stories

Latest
Apple AI Architecture Puts Google And Nvidia Inside Its Privacy TestAIJun 9, 2026Apple AI Architecture Puts Google And Nvidia Inside Its Privacy TestApple is using Google and Nvidia to support its most advanced cloud AI model while trying to keep Apple Intelligence centered on private orchestration, proprietary models and on-device context.Amazon-Corning Fiber Deal Puts Optics Inside The AI Data Center BottleneckCloud & Data CentersJun 9, 2026Amazon-Corning Fiber Deal Puts Optics Inside The AI Data Center BottleneckAmazon has reached a multi-year optical fiber and networking agreement with Corning, adding North Carolina manufacturing jobs and highlighting fiber capacity as a practical constraint in AI data center expansion.Check Point VPN Exploitation Puts Legacy IKEv1 Access In The Ransomware SpotlightCybersecurityJun 8, 2026Check Point VPN Exploitation Puts Legacy IKEv1 Access In The Ransomware SpotlightA critical Check Point VPN flaw, CVE-2026-50751, is being exploited against legacy IKEv1 remote-access configurations, with activity tied in one case to a Qilin ransomware affiliate and a second related VPN issue also disclosed.Silent Ransom Group Uses Fake IT Support Calls to Pressure Law FirmsCybersecurityJun 8, 2026Silent Ransom Group Uses Fake IT Support Calls to Pressure Law FirmsSilent Ransom Group is targeting U.S. law firms and professional services organizations with fake IT support calls, remote access tools and rapid data-theft extortion. Mandiant links the activity to UNC3753, Luna Moth and Chatty Spider, while the FBI has warned of related social engineering and in-person theft attempts.Alphabet’s $85 Billion AI Financing Push Tests Data Center Investor AppetiteCloud & Data CentersJun 8, 2026Alphabet’s $85 Billion AI Financing Push Tests Data Center Investor AppetiteAlphabet is seeking $85 billion in equity financing after raising its capex outlook to as high as $190 billion. The company is presenting Google Cloud growth, AI adoption and lower Gemini serving costs as evidence that its data center spending can support long-term AI demand.Apple WWDC 2026 Turns Siri Into the Test of Its AI CredibilityAIJun 8, 2026Apple WWDC 2026 Turns Siri Into the Test of Its AI CredibilityApple is expected to put Siri back at the center of WWDC 2026 after delays to its promised Apple Intelligence assistant. The event is likely to test whether Apple can turn contextual awareness, chatbot-style interaction and agentic voice tasks into reliable platform features.ChatGPT Lockdown Mode Narrows AI Data Exfiltration PathsCybersecurityJun 8, 2026ChatGPT Lockdown Mode Narrows AI Data Exfiltration PathsOpenAI is rolling out Lockdown Mode for eligible ChatGPT users to reduce data exfiltration risk from prompt injection. The optional setting limits outbound web and tool capabilities, trading some product flexibility for stronger containment around sensitive workflows.Smart TV Proxy SDKs Turn Free Apps Into a Hidden AI Scraping Supply ChainCybersecurityJun 7, 2026Smart TV Proxy SDKs Turn Free Apps Into a Hidden AI Scraping Supply ChainBright Data's SDK has been reverse-engineered in research showing how free apps can turn consumer devices, including smart TVs, into residential proxy nodes for web-scraping traffic. The issue matters because AI data harvesting is increasing demand for residential IPs, while consent screens and background network behavior may not be clear to users or IT teams.Stratos Data Center Cuts Utah Plan as Water Backlash Tests AI Infrastructure GrowthAIJun 7, 2026Stratos Data Center Cuts Utah Plan as Water Backlash Tests AI Infrastructure GrowthA Kevin O'Leary-backed Utah data center plan has been cut back after water and transparency objections, showing how local resistance can reshape AI infrastructure projects.Dubai Hotels Turn to Residents as Tourism Shock Tests Luxury DemandEconomyJun 7, 2026Dubai Hotels Turn to Residents as Tourism Shock Tests Luxury DemandDubai luxury hotels are using resident staycation discounts to offset weaker international tourism, but the source shows weekend demand cannot fully replace longer foreign stays.liko.ai Funding Turns Edge AI Into a Smart-Home Hardware TestAIJun 7, 2026liko.ai Funding Turns Edge AI Into a Smart-Home Hardware Testliko.ai completed its first-round financing to fund edge-side vision-language models, AI-native hardware and multi-modal home terminals. The investor group includes Shangtang Guoxiang Capital, Orient Fortune Capital, iFlytek Venture Capital, Hongtai Fund, Zhengxuan Investment and Mianbi Intelligence. The practical test is whether the startup can turn camera-based edge AI into a consumer smart-home hub without relying on cloud processing.Impact Circle Turns Impact Finance Into a Japan Fintech Measurement TestFintech & Digital PaymentsJun 7, 2026Impact Circle Turns Impact Finance Into a Japan Fintech Measurement TestTokyo-based Impact Circle is building a fintech model that measures social impact through its own lending and visualization businesses. The company won the Tokyo Financial Award 2025 financial innovation category and raised 335 million yen in a November 2024 Series A round. The next signal is whether Impact Cloud IC can turn impact measurement into a repeatable workflow for investors and Japanese corporations.