GPU NODES

NVIDIA GB200 NVL72

The GB200 Grace Blackwell Superchip™ is designed for a new type of data center—one that processes mountains of data to produce intelligence with maximum energy efficiency. These data centers run diverse workloads like AI, data analytics, hyperscale cloud applications, and high-performance computing (HPC).

Key Facts

30X FASTER
Llama 3.0 Inference
GB200 NVL72 is 30X faster for Inference 
vs. NVIDIA H100 Tensor Core GPU.
4X FASTER
Massive-Scale Training
GB200 NVL72 is 4X faster training for LLMs at scale than the H100
18X FASTER
Data Processing
GB200 NVL72 is 18X faster at processing data than Intel Xeon 8480+.
25X EFFICIENCY
Energy Efficiency
GB200 NVL2 is 25X more energy efficient than the H100.

Our Nodes

Take advantage of NVIDIA GB200 NVL72 GPU, connecting 36 Grace CPUs and 72 Blackwell GPUs in a rack-scale design. The GB200 NVL72 is a rack-scale solution that boasts a 72-GPU NVLink domain that acts as a single massive GPU and delivers 30X faster real-time trillion-parameter LLM inference.
Nscale AI Cloud Stack
A render of Nscale's nvidia gb200 NVL72 liquid-cooled solution

Get access to a fully integrated suite of AI services and compute

Reduce costs, grow revenue, and run your AI workloads more efficiently on a fully integrated platform. Whether you're using Nscale's built-in AI/ML tools or your own, our platform is designed to simplify the journey from development to production.

Serverless
Marketplace
Training
Inference
GPU nodes
Nscale's Data centers
Powered by renewable energy
LLM Library
Pre-configured Software
Pre-configured Infrastructure
Job Management
Job Scheduling
Container Orchestration
Optimised Libraries
Optimised Compilers and Tools
Optimised Runtime

Access thousands of GPUs tailored to your needs

Reserve GPUs