SendTech Times
News
CAPACITY TEST:

SoftBank Launches Sovereign AI GPU Cloud

Article summary

SoftBank has introduced its AI Data Center GPU Cloud, a sovereign AI infrastructure service. The service aims to keep data within Japanese jurisdiction, filling a gap left by major cloud providers. A beta version is live now, with commercial availability set for October 2026.

SoftBank Launches Sovereign AI GPU Cloud
Image source: RCR Wireless

Overview of AI Data Center GPU Cloud

SoftBank has announced its AI Data Center GPU Cloud, a sovereign AI infrastructure service that moves the company further into competition with global cloud giants.

This service is a key part of SoftBank's broader "Activate AI for Society" strategy.

A beta version went live immediately, although commercial availability is not scheduled until October 2026, initially limited to internal use across SoftBank group companies.

Unique Offerings and Technology

This initiative builds on a series of partnerships SoftBank has quietly formed over the past year, particularly with NVIDIA.

Instead of offering a generic GPU cloud, SoftBank integrates its telecom assets, edge network, and AI compute into a single service designed for customers prioritizing data sovereignty within Japan.

The core of the service is SoftBank’s proprietary software stack, the Infrinia AI Cloud OS, which consolidates AI computing infrastructure with the necessary software layers to manage modern AI workloads at scale.

The platform offers two main delivery modes: Kubernetes as a Service (KaaS) for multi-tenant environments and Inference as a Service (Inf-aaS), which provides access to large language model inference through APIs.

This setup is intended to support a wide range of workloads, from model training to general data processing.

Hardware and Networking

On the hardware side, SoftBank relies heavily on Nvidia technologies.

The cloud is built on Nvidia GB200 NVL72 systems utilizing the Grace Hopper architecture and is hosted in Japan-based data centers.

The Infrinia AI Cloud OS manages everything from BIOS configuration to Kubernetes management on the GPU platforms.

SoftBank employs Nvidia BlueField-3 DPUs to enhance both vRAN and generative AI workloads, with an integrated Nvidia Spectrum Ethernet switch facilitating the 5G timing protocol.

This AI Data Center GPU Cloud is part of SoftBank's "Telco AI Cloud" vision, which aims to connect large-scale GPU data centers with multi-access edge computing across its telecom network.

The edge component operates on AITRAS, SoftBank’s fully software-defined AI-RAN solution, currently deployed at Nvidia’s Santa Clara headquarters.

The goal is to achieve low-latency distributed inference processing at the network edge, while central data centers manage training and heavy computational tasks.

By sharing hardware between AI and telecom workloads, SoftBank claims to effectively provide "5G for free" from the same infrastructure.

This approach reportedly offers up to a fourfold improvement in ROI for vRAN workloads compared to dedicated 5G deployments.

With Japanese enterprises increasingly focused on data sovereignty, SoftBank's offering fills a significant gap in the market, especially as major global cloud providers have limited sovereign options in Japan.

Share this article
inXf

Related articles

More
Google Compute Lease Turns SpaceX Data Centers Into an AI Capacity Test
Cloud & Data Centers

Google Compute Lease Turns SpaceX Data Centers Into an AI Capacity Test

SpaceX lined up a Google compute agreement that gives Google access to about 110,000 NVIDIA GPUs and related components. The filing-based terms call for $920 million a month from October 2026 through June 2029, with delivery protections if GPU access is not ready by September 30, 2026. The next signal is whether SpaceX can turn AI data-center capacity into reliable third-party infrastructure before Google's bridge-capacity need changes.

Alphabet’s $85 Billion AI Financing Push Tests Data Center Investor Appetite
Cloud & Data Centers

Alphabet’s $85 Billion AI Financing Push Tests Data Center Investor Appetite

Alphabet is seeking $85 billion in equity financing after raising its capex outlook to as high as $190 billion. The company is presenting Google Cloud growth, AI adoption and lower Gemini serving costs as evidence that its data center spending can support long-term AI demand.

AirTrunk Makes India a Bigger Test Case for AI Data Center Buildouts
Cloud & Data Centers

AirTrunk Makes India a Bigger Test Case for AI Data Center Buildouts

AirTrunk said it would invest $30 billion in India by 2030 to develop 5GW of new AI data center capacity. Bernstein’s forecast puts the country’s data center market at up to 8GW in 2030, compared with about 1.5GW today. The practical test is whether land, power and water availability can support the proposed buildout.

AI Infrastructure Borrowing Pushes Big Tech Deeper Into Global Bond Markets
Cloud & Data Centers

AI Infrastructure Borrowing Pushes Big Tech Deeper Into Global Bond Markets

Alphabet and Amazon are using non-U.S. corporate bond markets to broaden funding for AI infrastructure and data center investment. Amazon raised 14.5 billion euros in March, while Morgan Stanley expects about 50 billion euros of hyperscaler euro debt this year. The practical test is whether international bond markets can absorb more AI-linked technology issuance without taking on greater sector volatility.

Keep Reading

More Stories

Latest
Apple AI Architecture Puts Google And Nvidia Inside Its Privacy TestAIJun 9, 2026Apple AI Architecture Puts Google And Nvidia Inside Its Privacy TestApple is using Google and Nvidia to support its most advanced cloud AI model while trying to keep Apple Intelligence centered on private orchestration, proprietary models and on-device context.Amazon-Corning Fiber Deal Puts Optics Inside The AI Data Center BottleneckCloud & Data CentersJun 9, 2026Amazon-Corning Fiber Deal Puts Optics Inside The AI Data Center BottleneckAmazon has reached a multi-year optical fiber and networking agreement with Corning, adding North Carolina manufacturing jobs and highlighting fiber capacity as a practical constraint in AI data center expansion.Check Point VPN Exploitation Puts Legacy IKEv1 Access In The Ransomware SpotlightCybersecurityJun 8, 2026Check Point VPN Exploitation Puts Legacy IKEv1 Access In The Ransomware SpotlightA critical Check Point VPN flaw, CVE-2026-50751, is being exploited against legacy IKEv1 remote-access configurations, with activity tied in one case to a Qilin ransomware affiliate and a second related VPN issue also disclosed.Silent Ransom Group Uses Fake IT Support Calls to Pressure Law FirmsCybersecurityJun 8, 2026Silent Ransom Group Uses Fake IT Support Calls to Pressure Law FirmsSilent Ransom Group is targeting U.S. law firms and professional services organizations with fake IT support calls, remote access tools and rapid data-theft extortion. Mandiant links the activity to UNC3753, Luna Moth and Chatty Spider, while the FBI has warned of related social engineering and in-person theft attempts.Apple WWDC 2026 Turns Siri Into the Test of Its AI CredibilityAIJun 8, 2026Apple WWDC 2026 Turns Siri Into the Test of Its AI CredibilityApple is expected to put Siri back at the center of WWDC 2026 after delays to its promised Apple Intelligence assistant. The event is likely to test whether Apple can turn contextual awareness, chatbot-style interaction and agentic voice tasks into reliable platform features.ChatGPT Lockdown Mode Narrows AI Data Exfiltration PathsCybersecurityJun 8, 2026ChatGPT Lockdown Mode Narrows AI Data Exfiltration PathsOpenAI is rolling out Lockdown Mode for eligible ChatGPT users to reduce data exfiltration risk from prompt injection. The optional setting limits outbound web and tool capabilities, trading some product flexibility for stronger containment around sensitive workflows.Smart TV Proxy SDKs Turn Free Apps Into a Hidden AI Scraping Supply ChainCybersecurityJun 7, 2026Smart TV Proxy SDKs Turn Free Apps Into a Hidden AI Scraping Supply ChainBright Data's SDK has been reverse-engineered in research showing how free apps can turn consumer devices, including smart TVs, into residential proxy nodes for web-scraping traffic. The issue matters because AI data harvesting is increasing demand for residential IPs, while consent screens and background network behavior may not be clear to users or IT teams.Stratos Data Center Cuts Utah Plan as Water Backlash Tests AI Infrastructure GrowthAIJun 7, 2026Stratos Data Center Cuts Utah Plan as Water Backlash Tests AI Infrastructure GrowthA Kevin O'Leary-backed Utah data center plan has been cut back after water and transparency objections, showing how local resistance can reshape AI infrastructure projects.Dubai Hotels Turn to Residents as Tourism Shock Tests Luxury DemandEconomyJun 7, 2026Dubai Hotels Turn to Residents as Tourism Shock Tests Luxury DemandDubai luxury hotels are using resident staycation discounts to offset weaker international tourism, but the source shows weekend demand cannot fully replace longer foreign stays.Ciena's $50 Billion AI Network Target Puts Optical Capacity on the Hyperscaler ClockChips & SemiconductorsJun 7, 2026Ciena's $50 Billion AI Network Target Puts Optical Capacity on the Hyperscaler ClockCiena says AI demand could roughly double its addressable market to about $50 billion by 2029 as hyperscalers and service providers invest in optical networking. It cited RLS Hyper Rail, DCOM, coherent modules and 400G/800G pluggable optics as demand areas while planning $250 million to $275 million in capex this year. The practical test is whether AI compute buildouts convert into durable network orders.liko.ai Funding Turns Edge AI Into a Smart-Home Hardware TestAIJun 7, 2026liko.ai Funding Turns Edge AI Into a Smart-Home Hardware Testliko.ai completed its first-round financing to fund edge-side vision-language models, AI-native hardware and multi-modal home terminals. The investor group includes Shangtang Guoxiang Capital, Orient Fortune Capital, iFlytek Venture Capital, Hongtai Fund, Zhengxuan Investment and Mianbi Intelligence. The practical test is whether the startup can turn camera-based edge AI into a consumer smart-home hub without relying on cloud processing.Impact Circle Turns Impact Finance Into a Japan Fintech Measurement TestFintech & Digital PaymentsJun 7, 2026Impact Circle Turns Impact Finance Into a Japan Fintech Measurement TestTokyo-based Impact Circle is building a fintech model that measures social impact through its own lending and visualization businesses. The company won the Tokyo Financial Award 2025 financial innovation category and raised 335 million yen in a November 2024 Series A round. The next signal is whether Impact Cloud IC can turn impact measurement into a repeatable workflow for investors and Japanese corporations.