Resources

GPU Server configurations

GPU Servers are provisioned by choosing a server plan, which defines the amount of CPU, memory and number of GPUs.

GPU Server configurations

A GPU Server configuration includes CPU cores, memory and access to one or multiple GPU accelerators.

In addition, each server must have at least one block storage device for the operating system. Block storage devices can be of any size (1 GB - 4 TB each) and from any storage tier.

IPv4s can be added at additional cost if necessary.

Choosing the right GPU

NVIDIA L4NVIDIA L40SNVIDIA H100NVIDIA B200
Use caseImage generation, speech-to-text, basic inference.Video, graphics, 3D rendering, intensive inference.High-traffic inference, massive batch processing and large-model training.Trillion-parameter model inference, model training, running complex models in real-time.
GPU memory (VRAM)24 GB48 GB80 GB192 GB
Memory bandwidth300 GB/s864 GB/s3.35 TB/s8.0 TB/s
Performance (FP8)0.48 PFLOPS1.46 PFLOPS3.9 PFLOPS9.00 FLOPS

GPU Servers with NVIDIA L4

Plans include

  • CPU & Memory & GPUs
  • 99.999% SLA
  • Billed only when server is started

Add-ons

  • Block storage (any tier)
  • Public IP addresses
  • Backup options

CPU model: AMD EPYC 9575F

CPU coresRAMGPUIdentifier
8 cores64 GB1 x NVIDIA L4GPU-8xCPU-64GB-1xL4
12 cores128 GB1 x NVIDIA L4GPU-12xCPU-128GB-1xL4
12 cores128 GB2 x NVIDIA L4GPU-12xCPU-128GB-1xL4
16 cores192 GB1 x NVIDIA L4GPU-16xCPU-192GB-1xL4
16 cores192 GB2 x NVIDIA L4GPU-16xCPU-192GB-2xL4
16 cores192 GB3 x NVIDIA L4GPU-16xCPU-192GB-3xL4
20 cores256 GB1 x NVIDIA L4GPU-20xCPU-256GB-1xL4
20 cores256 GB2 x NVIDIA L4GPU-20xCPU-256GB-2xL4
20 cores256 GB3 x NVIDIA L4GPU-20xCPU-256GB-3xL4
32 cores384 GB2 x NVIDIA L4GPU-32xCPU-384GB-2xL4
32 cores384 GB3 x NVIDIA L4GPU-32xCPU-384GB-3xL4

GPU Servers with NVIDIA L40S

Plans include

  • CPU & Memory & GPUs
  • 99.999% SLA
  • Billed only when server is started

Add-ons

  • Block storage (any tier)
  • Public IP addresses
  • Backup options

CPU model: AMD EPYC 9575F

CPU coresRAMGPUIdentifier
8 cores64 GB1 x NVIDIA L40SGPU-8xCPU-64GB-1xL40S
12 cores128 GB1 x NVIDIA L40SGPU-12xCPU-128GB-1xL40S
12 cores128 GB2 x NVIDIA L40SGPU-12xCPU-128GB-1xL40S
16 cores192 GB1 x NVIDIA L40SGPU-16xCPU-192GB-1xL40S
16 cores192 GB2 x NVIDIA L40SGPU-16xCPU-192GB-2xL40S
16 cores192 GB3 x NVIDIA L40SGPU-16xCPU-192GB-3xL40S
20 cores256 GB1 x NVIDIA L40SGPU-20xCPU-256GB-1xL40S
20 cores256 GB2 x NVIDIA L40SGPU-20xCPU-256GB-2xL40S
20 cores256 GB3 x NVIDIA L40SGPU-20xCPU-256GB-3xL40S
32 cores384 GB2 x NVIDIA L40SGPU-32xCPU-384GB-2xL40S
32 cores384 GB3 x NVIDIA L40SGPU-32xCPU-384GB-3xL40S

GPU Servers with NVIDIA H100

Plans include

  • CPU & Memory & GPUs
  • 99.999% SLA
  • Billed only when server is started

Add-ons

  • Block storage (any tier)
  • Public IP addresses
  • Backup options

CPU model: Intel Xeon Platinum 8462Y+

CPU coresRAMGPUIdentifier
12 cores240 GB1 x NVIDIA H100GPU-12xCPU-240GB-1xH100
24 cores480 GB2 x NVIDIA H100GPU-24xCPU-480GB-2xH100
48 cores960 GB4 x NVIDIA H100GPU-48xCPU-960GB-4xH100
96 cores1920 GB8 x NVIDIA H100GPU-96xCPU-1920GB-8xH100

NVlink included

H100 GPU Servers include NVIDIA NVlink technology for direct GPU-to-GPU communication. NVlink provides 900 GB/s of bidirectional bandwidth between GPUs, enabling highly efficient multi-GPU workloads, distributed training, and large-scale model inference without going through CPU memory.

GPU Servers with NVIDIA B200

Plans include

  • CPU & Memory & GPUs
  • 99.999% SLA
  • Billed only when server is started

Add-ons

  • Block storage (any tier)
  • Public IP addresses
  • Backup options

CPU model: Intel Xeon Platinum 8570

CPU coresRAMGPUIdentifier
12 cores240 GB1 x NVIDIA B200GPU-12xCPU-240GB-1xB200
24 cores480 GB2 x NVIDIA B200GPU-24xCPU-480GB-2xB200
48 cores960 GB4 x NVIDIA B200GPU-48xCPU-960GB-4xB200
96 cores1920 GB8 x NVIDIA B200GPU-96xCPU-1920GB-8xB200

NVlink included

B200 GPU Servers include NVIDIA NVlink technology for direct GPU-to-GPU communication. NVlink provides 1800 GB/s of bidirectional bandwidth between GPUs, enabling highly efficient multi-GPU workloads, distributed training, and large-scale model inference without going through CPU memory.

How GPU Servers are billed when shut down
GPU Server plans are only billed when the server is powered on. However, attached block storages and public IPv4 addresses are reserved and thus billed even when the server is shut down.

Trial limitations
We offer a 7-day free trial to new users which is intended to allow getting familiar with our services and test Cloud Server deployments and managed services without commitment. GPU Servers are not included as part of the trial. If you have specific requirements to try GPU Servers, please contact us.

NVIDIA is a registered trademark of NVIDIA Corporation.

Can't find what you're looking for?

For more help you can contact our awesome 24/7 support team