SoftBank Launches Sovereign AI GPU Cloud

BySendTech Times Cloud & Infrastructure DeskNewsroom-edited, source-reviewed coverage|Source: RCR Wireless

Newsroom brief

SoftBank has introduced its AI Data Center GPU Cloud, a sovereign AI infrastructure service. The service aims to keep data within Japanese jurisdiction, filling a gap left by major cloud providers. A beta version is live now, with commercial availability set for October 2026.

Verified against source materialEdited by SendTech Times Cloud & Infrastructure Desk

SoftBank Launches Sovereign AI GPU Cloud

Image source: RCR Wireless

Overview of AI Data Center GPU Cloud

SoftBank has announced its AI Data Center GPU Cloud, a sovereign AI infrastructure service that moves the company further into competition with global cloud giants.

This service is a key part of SoftBank's broader "Activate AI for Society" strategy.

A beta version went live immediately, although commercial availability is not scheduled until October 2026, initially limited to internal use across SoftBank group companies.

Unique Offerings and Technology

This initiative builds on a series of partnerships SoftBank has quietly formed over the past year, particularly with NVIDIA.

Instead of offering a generic GPU cloud, SoftBank integrates its telecom assets, edge network, and AI compute into a single service designed for customers prioritizing data sovereignty within Japan.

The core of the service is SoftBank’s proprietary software stack, the Infrinia AI Cloud OS, which consolidates AI computing infrastructure with the necessary software layers to manage modern AI workloads at scale.

The platform offers two main delivery modes: Kubernetes as a Service (KaaS) for multi-tenant environments and Inference as a Service (Inf-aaS), which provides access to large language model inference through APIs.

This setup is intended to support a wide range of workloads, from model training to general data processing.

Hardware and Networking

On the hardware side, SoftBank relies heavily on Nvidia technologies.

The cloud is built on Nvidia GB200 NVL72 systems utilizing the Grace Hopper architecture and is hosted in Japan-based data centers.

The Infrinia AI Cloud OS manages everything from BIOS configuration to Kubernetes management on the GPU platforms.

SoftBank employs Nvidia BlueField-3 DPUs to enhance both vRAN and generative AI workloads, with an integrated Nvidia Spectrum Ethernet switch facilitating the 5G timing protocol.

This AI Data Center GPU Cloud is part of SoftBank's "Telco AI Cloud" vision, which aims to connect large-scale GPU data centers with multi-access edge computing across its telecom network.

The edge component operates on AITRAS, SoftBank’s fully software-defined AI-RAN solution, currently deployed at Nvidia’s Santa Clara headquarters.

The goal is to achieve low-latency distributed inference processing at the network edge, while central data centers manage training and heavy computational tasks.

By sharing hardware between AI and telecom workloads, SoftBank claims to effectively provide "5G for free" from the same infrastructure.

This approach reportedly offers up to a fourfold improvement in ROI for vRAN workloads compared to dedicated 5G deployments.

With Japanese enterprises increasingly focused on data sovereignty, SoftBank's offering fills a significant gap in the market, especially as major global cloud providers have limited sovereign options in Japan.

#AI infrastructure #cloud

Cloud & Data Centers

Google Compute Lease Turns SpaceX Data Centers Into an AI Capacity Test

SpaceX lined up a Google compute agreement that gives Google access to about 110,000 NVIDIA GPUs and related components. The filing-based terms call for $920 million a month from October 2026 through June 2029, with delivery protections if GPU access is not ready by September 30, 2026. The next signal is whether SpaceX can turn AI data-center capacity into reliable third-party infrastructure before Google's bridge-capacity need changes.

Cloud & Data Centers

Iren Plans 800MW Australia AI Data Center Campus as Power Becomes the Capacity Gate

Iren signed a transmission connection agreement for a planned 800MW data center campus in Bundey, South Australia. The project is Iren's first Australian foray and is expected to be energized in 2028 as the company shifts more cash flow toward AI cloud infrastructure. The practical question is whether Iren can turn grid-connected power, financing and GPU capacity into energized AI cloud campuses on the announced timelines.

Cloud & Data Centers

Infrastructure Captures 82% Of Generative AI Value As Applications Lag

AI Times Korea cited Exponential View data showing 82% of measured AI-economy value going to cloud, GPU and inference infrastructure in Q1 2026, while foundation-model companies accounted for 11% and applications 7%.

Cloud & Data Centers

NAVER’s 55-Megawatt NVIDIA Buildout Tests Sovereign AI Cloud Demand

NAVER and NVIDIA are expanding sovereign AI infrastructure from a 55-megawatt starting point toward gigawatt scale, tying Korea’s AI factory ambitions to DSX software, GAK Sejong capacity and localized model services.

Cloud & Data Centers

CAS Star’s Photonics Bet Turns Into an AI Infrastructure Test

CAS Star founder Mi Lei says the AI boom has validated a decade-long investment thesis around photonics and other hard-tech fields. The firm has more than 200 photonics-related companies among roughly 600 portfolio companies, spanning sensing, communications, computing, storage and display. The next test is whether optical links, laser chips and photonic computing companies can turn AI data-centre demand into durable commercial scale.

Cloud & Data Centers

Crusoe Adds Serverless Fine-Tuning To AI Infrastructure Platform

Crusoe is adding serverless fine-tuning and self-service inference deployments to Intelligence Foundry, Data Center Knowledge reported. The launch moves the pitch beyond raw GPU access, but Crusoe did not disclose customers, exact pricing, utilisation targets or customer-verified savings.