Compute & Cloud · Report

Nebius launches AI Cloud 3.1 with next-generation NVIDIA Blackwell Ultra chips and transparent capacity management for cloud customers.

Brings Blackwell Ultra to market with improved capacity visibility, competing directly with Nvidia cloud and other managed GPU platforms.

Trade pressSlicast · December 17, 2025 · Global · Source: finanznachrichten.de

importance 75

Nebius announced Nebius AI Cloud 3.1 on December 17, 2025, bringing next-generation NVIDIA Blackwell Ultra compute and enhanced operational capabilities to its full-stack AI cloud platform. Version 3.1 builds on the foundations of Nebius AI Cloud "Aether," adding transparent capacity management and expanded infrastructure to deliver the operational visibility and resource-planning capabilities that customers need as they scale AI in production. As customers move beyond experiments to large-scale adoption, they are identifying clear operational priorities, including real-time visibility into GPU capacity across regions, transparent resource allocation for multi-team environments, and access to next-generation compute as models grow in complexity.

Nebius is deploying NVIDIA Blackwell Ultra infrastructure globally, with NVIDIA GB300 NVL72 systems and NVIDIA HGX B300 systems already being used by customers. Nebius is now the first cloud in Europe to operate both platforms in production. Nebius AI Cloud is also the first provider globally to run production GB300 NVL72 systems on 800 Gbps NVIDIA Quantum-X800 InfiniBand interconnect—doubling throughput for distributed workloads. This capability, alongside hardware-accelerated networking and enhanced storage caching that eliminate infrastructure bottlenecks, complements leading results in MLPerf Training v5.1 benchmarks.

To meet enterprise demands for operational transparency, version 3.1 introduces Capacity Blocks and a real-time Capacity Dashboard that give customers complete visibility into their reserved GPU capacity and availability across all data center regions. Project-level quotas and new lifecycle object storage rules extend granular control over resource allocation and costs. The platform ecosystem continues to expand with a new native integration with Dstack and simplified deployment of NVIDIA BioNeMo NIM microservices, including Boltz2, Evo-2, GenMol, and MolMIM, without requiring NGC keys or NVIDIA AI Enterprise licenses.

Additional features include enhanced Slurm-based orchestration with Manager Soperator, FOCUS-compliant billing exports, and various console UX improvements. The latest release builds on Aether's enterprise-ready security foundation with object storage data-plane audit logs for HIPAA compliance, per-object access controls, and VPC security groups. Enhanced IAM with Microsoft Entra ID integration and granular service roles provide stronger governance with simplified administration, advancing the compliance capabilities that enable customers in highly regulated sectors and government to deploy production AI workloads. Nebius will hold a technical walkthrough webinar on Nebius AI Cloud 3.1 with a live Q&A session on January 29 at 6 p.m. CET / midday EST / 9 a.m. PST.

Read the original