The race to build AI infrastructure is intensifying, and companies that can offer large-scale computing power are becoming critical to the market.
As demand for AI models, cloud services, and data processing continues to rise, the need for stronger and faster infrastructure is growing just as quickly.
NVIDIA’s latest move shows it wants to stay at the centre of that buildout. It is now placing a major bet on Nebius Group, announcing a strategic partnership and a $2 billion investment to help build the next generation of hyperscale AI cloud infrastructure.
NVIDIA deepens partnership with Nebius
The two companies said they will work together to develop and deploy infrastructure for the growing AI market, serving both AI-native businesses and enterprise customers.
Nebius said the partnership will help it accelerate the buildout of its full-stack AI cloud platform. NVIDIA will support that expansion by providing Nebius with early access to its latest accelerated computing technologies.
The partnership comes as demand for high-performance AI compute continues to surge globally. To meet that need, Nebius plans to scale aggressively in the coming years.
The company said the collaboration builds on its existing rollout of Nvidia infrastructure across its global platform, including multiple gigawatt-scale AI factories in the US. Nebius aims to deploy more than 5 gigawatts of capacity by the end of 2030, with Nvidia backing its early adoption of next-generation computing systems.
What the two companies will work on
Under the partnership, NVIDIA and Nebius will collaborate across several core parts of the AI infrastructure stack.
AI factory design and support: Including access to partner design material, design review processes and acceptance, early samples and system software support, bring-up support, and regular system partner business and technical reviews.
Inference: Creating a best-in-class inference and agentic AI stack for developers and enterprises with NVIDIA’s latest software technologies, optimised models and libraries.
AI infrastructure deployment: Deploying multiple generations of NVIDIA infrastructure across Nebius’s platform through early adoption of NVIDIA computing architectures, including the NVIDIA Rubin platform, NVIDIA Vera CPUs and NVIDIA BlueField® storage systems.
Fleet management: Optimising Nebius’s holistic fleet health by deploying NVIDIA’s latest GPU health monitoring and software recommendations.
AI is entering a new phase
NVIDIA founder and CEO Jensen Huang said the market is entering another major shift, this time driven by agentic AI.
“AI is at another inflexion point — agentic AI, driving incredible compute demand and accelerating infrastructure buildout,” said Huang. “Nebius is building an AI cloud designed for the agentic era, fully integrated from silicon to software and powered by NVIDIA’s next-generation accelerated compute. Together, we are scaling the cloud to meet the surging global demand for intelligence.”
“Nebius has been built for AI since day one — not adapted from a general-purpose cloud, but designed for what developers actually need,” said Arkady Volozh, CEO of Nebius. “Now with NVIDIA, we are extending that throughout the stack — from gigawatt-scale AI factories to inference and software — as we build one of the first and largest clouds for all AI builders everywhere.”