News
CAPACITY TEST:

Nvidia And AWS Add Blackwell G7 GPUs To Production AI Stack

Newsroom brief

AWS is adding EC2 G7 instances with Nvidia RTX PRO 4500 Blackwell GPUs, cuVS-backed OpenSearch vector indexing and GB300 Exemplar Cloud status for AI training workloads.

Verified against source materialEdited by SendTech Times Cloud & Infrastructure Desk
Nvidia And AWS Add Blackwell G7 GPUs To Production AI Stack
Image source: NVIDIA

AWS Adds Blackwell G7 Instances

Nvidia and Amazon Web Services are expanding the AWS AI infrastructure stack by putting Nvidia RTX PRO 4500 Blackwell Server Edition GPUs inside the EC2 G7 instance family.

The launch targets production workloads that need inference, graphics, spatial computing and GPU-accelerated analytics without customers managing their own GPU platform.

The hardware claim is specific.

At the largest configuration, a G7 instance can carry eight GPUs, 256GB of combined GPU memory, EFA networking at 700 Gbps and local NVMe SSD storage reaching 7.6TB.

AWS is offering one-, two-, four- and eight-GPU configurations, with bare metal coming soon.

Nvidia says the instances deliver up to 4.6x AI inference performance and up to 2.1x graphics performance compared with G6 instances.

The same platform is also positioned for Amazon EMR analytics workloads using the Nvidia cuDF library for Apache Spark.

Vector Search Moves Into OpenSearch

The update also changes the retrieval layer for AI applications.

Amazon OpenSearch Serverless now sets Nvidia cuVS GPU acceleration as the default path for vector indexing in vector collections.

For teams building retrieval-augmented generation, semantic search, recommendation systems and agentic AI applications, the managed OpenSearch path changes the deployment work.

Instead of treating GPU vector search as a separate optimization project, AWS is making it part of the managed OpenSearch Serverless path.

Nvidia says the customer impact is vector indexing that can run up to 10x faster while costing a quarter as much as CPU-only builds.

It also says billion-scale vector databases can be built in under an hour.

Those are vendor performance claims, but they identify the operating burden AWS is trying to reduce: moving raw enterprise data into searchable AI retrieval systems without running separate infrastructure.

The managed-service angle is as important as the speed claim.

Enterprises building AI retrieval systems often need vector search, serverless scaling and idle-time cost control in the same workflow.

AWS and Nvidia are packaging those pieces inside OpenSearch rather than asking each team to build a separate GPU indexing pipeline.

G7 also reaches more than one buyer group.

AI teams can use the instances for lower-latency inference, media teams can use the same family for high-resolution video and rendering, and data teams can apply the GPU memory, storage and networking to analytics pipelines.

That breadth is useful for procurement teams because one instance family can support several production workloads instead of a single AI pilot.

GB300 Status Targets Training Buyers

AWS has also achieved Nvidia Exemplar Cloud status on Nvidia GB300 for training workloads.

Nvidia describes the status as evidence that AWS meets the performance thresholds it uses to benchmark AI workloads against its reference architecture.

The designation is aimed at companies comparing cloud providers for large-scale training.

It does not name customer deployments or pricing, but it gives procurement and AI infrastructure teams another benchmark when they compare training performance, total cost of ownership and the move from pilots to production.

For AWS customers, the new stack now covers GPU instances, managed vector indexing and a GB300 training-performance benchmark.

Nvidia and AWS did not publish regional availability, customer adoption figures or pricing in the announcement, leaving buyers to test whether the claimed inference, search and training gains hold inside their own workloads.

Share this article
inXf

Related articles

More
Blackwell’s MLPerf Run Puts AI Training Bottlenecks At Rack Scale
Cloud & Data Centers

Blackwell’s MLPerf Run Puts AI Training Bottlenecks At Rack Scale

NVIDIA says Blackwell led MLPerf Training 6.0 across all seven benchmarks, with submissions scaling to 8,192 GPUs and GB300 NVL72 training up to 1.6x faster than GB200 NVL72 at the same scale.

Digi Power X Lands $19.6 Million Blackwell Capacity Deal
Cloud & Data Centers

Digi Power X Lands $19.6 Million Blackwell Capacity Deal

Digi Power X signed a $19.6 million, 24-month agreement with SubQ AI for bare metal Nvidia Blackwell GPU capacity, but did not identify which data center will host the deployment.

Together AI Taps Rumble for Dedicated Blackwell Cloud Capacity
Cloud & Data Centers

Together AI Taps Rumble for Dedicated Blackwell Cloud Capacity

Together AI signed a multi-year cloud capacity agreement with Rumble Inc. for dedicated Nvidia HGX B300 systems. Rumble did not disclose the deal value, GPU count or deployment date, while Northern Data assets add more than 22,000 GPUs and approximately 250MW of capacity context. The practical question is whether the agreement turns into delivered Blackwell-class capacity for Together AI customers.

Buzz HPC’s £220 Million Deal Puts Canada’s Sovereign AI Push on Nvidia Racks
Cloud & Data Centers

Buzz HPC’s £220 Million Deal Puts Canada’s Sovereign AI Push on Nvidia Racks

Buzz HPC signed a three-year sovereign AI contract with Bell Canada and Cohere, with 2,304 Nvidia Grace Blackwell GPUs planned for Bell Canada’s Merritt data center.

Keep Reading

More Stories

Latest
Nvidia Tops 400 Systems On TOP500 Supercomputer ListChips & SemiconductorsJun 25, 2026Nvidia Tops 400 Systems On TOP500 Supercomputer ListNvidia says its technology now powers more than 400 of the world’s 500 fastest supercomputers, with Grace CPUs, GPUs and networking expanding across AI and science systems.Anthropic Hiring Points To Australia And Japan AI Data Center PushCloud & Data CentersJun 25, 2026Anthropic Hiring Points To Australia And Japan AI Data Center PushAnthropic is hiring compute and data center staff in Australia and Japan as its AI growth strains infrastructure and pushes the company toward new international capacity.Anthropic Alleges Alibaba Used 25,000 Accounts In AI Distillation CampaignAIJun 25, 2026Anthropic Alleges Alibaba Used 25,000 Accounts In AI Distillation CampaignAnthropic told U.S. Senate banking leaders that operators affiliated with Alibaba carried out 28.8 million model exchanges using roughly 25,000 fraudulent accounts between April 22 and June 12.SK Hynix Plans Nasdaq ADR As AI Memory Demand Lifts FundraisingChips & SemiconductorsJun 25, 2026SK Hynix Plans Nasdaq ADR As AI Memory Demand Lifts FundraisingSK Hynix plans a Nasdaq ADR listing that could raise 45.45 trillion won, giving global investors a new route into AI memory demand while capacity plans remain tied to Korea and Indiana.Micron Locks AI Memory Buyers Into Long-Term Supply DealsChips & SemiconductorsJun 25, 2026Micron Locks AI Memory Buyers Into Long-Term Supply DealsMicron said fiscal third-quarter revenue reached $41.46 billion and outlined 16 long-term customer agreements as AI data center demand keeps memory supply tight into 2028.Chile Cable Dispute Turns AI Data Routes Into A Sovereignty FightCloud & Data CentersJun 25, 2026Chile Cable Dispute Turns AI Data Routes Into A Sovereignty FightChile’s review of a $500-million China Mobile subsea cable proposal collided with U.S. pressure, while Google’s 14,800-kilometer Humboldt route remains the country’s approved Asia-Pacific link.UAE Central Bank Penalty Tightens AML Pressure On Foreign BanksFintech & Digital PaymentsJun 25, 2026UAE Central Bank Penalty Tightens AML Pressure On Foreign BanksThe UAE Central Bank fined a foreign bank branch Dh20 million and its compliance head Dh300,000 after finding significant, repeated AML/CFT failures, extending the country’s financial-sector enforcement push.Qualcomm Names Meta As First Dragonfly Data Center CPU CustomerChips & SemiconductorsJun 25, 2026Qualcomm Names Meta As First Dragonfly Data Center CPU CustomerQualcomm said Meta will use its Dragonfly C1000 data center CPU when production starts in 2028, while the chipmaker raised its fiscal 2029 non-handset revenue projection to $40 billion.AMD And Rackspace Set 30 MW AI Compute Plan For Regulated WorkloadsCloud & Data CentersJun 25, 2026AMD And Rackspace Set 30 MW AI Compute Plan For Regulated WorkloadsAMD and Rackspace Technology signed a definitive agreement for an initial 30 MW of AMD-based compute across Rackspace global data centers, with deployments planned from late 2026 through 2028.DEWA Puts AI And Reserves Behind Dubai Utility ReadinessEconomyJun 25, 2026DEWA Puts AI And Reserves Behind Dubai Utility ReadinessDEWA chief Saeed Mohammed Al Tayer said Dubai has about 18,000 megawatts of electricity capacity, expanding clean-energy capacity and AI-backed utility operations as the city prepares for growth and crisis continuity.First Street Puts Climate Risk Into Data Center Site SelectionCloud & Data CentersJun 25, 2026First Street Puts Climate Risk Into Data Center Site SelectionFirst Street says 79% of global data center capacity is exposed to significant acute climate hazards, adding flood, wind, wildfire, heat and water stress to AI infrastructure underwriting.Agility Robotics SPAC Deal Puts Digit Orders Under Public-Market ReviewAIJun 25, 2026Agility Robotics SPAC Deal Puts Digit Orders Under Public-Market ReviewAgility Robotics agreed to merge with Churchill Capital Corp XI at a $2.5 billion pre-money equity value, putting contracted Digit v5 orders and unresolved closing conditions in front of public investors.