Google Cloud and Nvidia built an integrated AI superstack combining custom hardware, networking, and software to optimize large-model training and inference at scale.
Nvidia Corp. and Google LLC used the search giant's annual Cloud Next event to deepen their long-running partnership, creating a full-stack "artificial intelligence factory" that integrates Google's AI Hypercomputer infrastructure with Nvidia's latest solutions, including Blackwell, open models and agentic and physical AI tooling. The announcement expands Google's distribution of Nvidia's accelerated computing stack, while customers gain a faster, lower-risk path from AI experimentation to large-scale deployment. Nvidia and Google Cloud have been co-developing the accelerated cloud stack for about a decade, starting with early K80/P100 GPU instances and evolving into the AI Hypercomputer architecture.
Google has quietly built out one of the world's largest accelerated infrastructure deployments, with well over a million Nvidia GPUs deployed across its global fleet for internal products and Google Cloud services. This scale has two major implications. First, it shortens deployment times—because the backbone, supply chain, and data center footprint are already GPU-centric, adding each new GPU generation like Hopper, Blackwell, and Vera Rubin can roll out faster, with those accelerators appearing quickly in customer-facing SKUs like A3/A5X and DGX Cloud. Second, there should be plenty of capacity for AI factories: the multitenant, massively scaled clusters that underpin the AI Hypercomputer concept allow enterprises to spin up large language model and agent workloads running across tens of thousands of Nvidia GPUs without bespoke infrastructure engineering.
Nvidia's horizontally broad programmability makes it map cleanly onto Google's AI Hypercomputer strategy, as both focus on building dense, software-defined supercomputers rather than generic cloud infrastructure as a service. While specialized accelerators like TPUs and other application-specific integrated circuits are powerful, they are narrow in comparison. Nvidia's cross-industry, multicloud footprint makes it attractive to enterprises that need to ship software to any customer, anywhere, and for Google Cloud, aligning with Nvidia broadens the appeal of its AI infrastructure to customers who want a neutral, portable accelerated platform rather than a proprietary stack that locks them into a single cloud or architecture.
For customers, this partnership is about reducing risk and shortening time-to-value by lowering organizational friction of adopting AI, as infra teams, data scientists and app teams share a common, battle-tested platform. The Google-Nvidia stack provides enterprises with a reference design for building AI factories and cloud-scale clusters for training, fine-tuning, inference and simulation that they can consume as a service or emulate on-premises with similar building blocks. Information technology leaders no longer have to guess which region or instance type will still be available at scale in eighteen months—Google is standardizing on Nvidia as the default accelerator fabric, alongside its tensor processing units.
Google has spent a decade playing third fiddle to Amazon Web Services and Microsoft Azure, but its partnership with Nvidia gives it a first-fiddle story in AI: a co-designed AI Hypercomputer, tuned for agentic and physical AI, that turns Google's Nvidia-powered supercomputers into a product enterprises and startups can actually buy. With a decade-long partnership with Nvidia and the widest range of Blackwell instances available today, Google provides customers with choice at a critical moment in the AI era.