Starting from
$0.69/hour
Experience exceptional performance across a range of demanding applications with our GPU Servers, from generative AI, LLM inference, and AI model training to 3D rendering and video processing.
On-demand NVIDIA L4, L40S, H100 and B200 GPUs available now!
Mian, Product Manager
GPU resources with the flexibility and easy management of our Cloud Servers.
No shared hardware – each GPU is always dedicated to a single server.
Get started quickly with drivers & tooling pre-installed.
No signup required
Instant access to our live demo. No signup. No commitment.
With UpCloud’s usage-based GPU billing, you only pay for active compute time. This allows for precise scaling of resources, eliminating costs for idle infrastructure. It’s ideal for teams seeking high-end GPUs without long-term commitments.
Experience the power of NVIDIA L40S GPUs!
Location: Helsinki, Finland
Processor: 8 vCPUs AMD EPYC 9575F
Memory: 64 GB DDR5 RAM
Price: from $1.27 / hour
Cost-effective AI Inference, High-throughput Video Processing, and Edge AI deployments. Optimized for low-latency, energy-efficient operations.
The NVIDIA L4 Tensor Core GPU powered by the NVIDIA Ada Lovelace architecture delivers universal, energy-efficient acceleration for video, AI, visual computing, graphics, virtualization, and more.
Generative AI, Mid-to-Large Scale AI Model Training and Inference, Real-time 3D Rendering, Virtual Production, and High-Performance Computing (HPC) simulations.
The NVIDIA L40S GPU is the most powerful universal GPU for the cloud, delivering end-to-end acceleration for the next generation of AI-enabled applications.
The NVIDIA H100 GPU delivers exceptional performance, scalability, and security for every workload. H100 uses breakthrough innovations based on the NVIDIA Hopper™ architecture to deliver industry-leading conversational AI, speeding up large language models (LLMs) by 30X.
H100 also includes a dedicated Transformer Engine to solve trillion-parameter language models.
Generative AI, Large-Scale AI Model Training and Inference (LLMs), High-Performance Computing (HPC) simulations, Scientific Computing, and Data Analytics.
The NVIDIA B200 GPU, powered by the Blackwell architecture, is the world’s most powerful AI chip, designed to power a new era of computing with up to 4x faster training and 30x faster inference than previous generations.
UpCloud GPU Servers offer a range of GPU models with the unique ability to scale the server and GPUs as needed.
Start with a smaller card for development purposes, then migrate to use more powerful cards for production use, or vice-versa, all without needing to reinstalling the server or losing any data.
It’s as easy as shutting down the server, changing the plan and powering the server back up again!
| NVIDIA L4 | NVIDIA L40S | NVIDIA H100 | NVIDIA B200 | |
|---|---|---|---|---|
| Primary use case | Energy-efficient accelerator for AI inference, video transcoding, graphics/VDI and edge deployments |
Multi-workload “universal” GPU – GenAI, LLM training & inference, 3D graphics, rendering, video |
High-traffic inference, massive batch processing and large-model training |
Trillion-parameter model inference, model training, running complex models in real-time |
| Architecture | Ada Lovelace | Ada Lovelace | Hopper | Blackwell |
| GPU Memory | 24GB GDDR6 | 48GB GDDR6 | 80GB HBM3e | 192GB HBM3e |
| Memory Bandwidth | 300GB/s | 864GB/s | 3.35TB/s | 8.0TB/s |
| Peak compute | FP32 30.3 TFLOPS FP8 Tensor 0.48 PFLOPS |
FP32 91.6 TFLOPS FP8 Tensor 1.46 PFLOPS |
FP32 67 TFLOPS FP8 Tensor 3.90 PFLOPS |
FP32 74.45 TFLOPS FP8 Tensor 9.00 PFLOPS |
Skip the API fees and data privacy concerns. Our new tutorial shows you how to spin up an UpCloud GPU and run powerful open-weight models like Mistral-7B with Ollama. Go from deployment to inference on your own private, high-performance server.
With our GPU servers, you keep full control over your data, your models, and your infrastructure.
Privacy-first by design: Hosted in Finland, your AI workloads stay in GDPR-compliant data centers with strong jurisdictional protections.
Open-source aligned: Whether you’re fine-tuning LLaMA, deploying open-source LLMs, or building on PyTorch etc, our infrastructure doesn’t impose restrictions or hidden service layers.
No platform dependencies: Unlike hyperscalers, we don’t force you into proprietary ML platforms or opaque orchestration layers.
1 – 3 per server
8 – 32
64 – 384 GB
Starting from
$0.69/hour
| GPU | CPU cores | RAM | Price |
|---|---|---|---|
| 1 x NVIDIA L4 | 8 cores | 64 GB | $0.69/h $464/mo |
| 1 x NVIDIA L4 | 12 cores | 128 GB | $0.83/h $558/mo |
| 1 x NVIDIA L4 | 16 cores | 192 GB | $0.97/h $652/mo |
| 1 x NVIDIA L4 | 20 cores | 256 GB | $1.11/h $746/mo |
| 2 x NVIDIA L4 | 12 cores | 128 GB | $1.40/h $941/mo |
| 2 x NVIDIA L4 | 16 cores | 192 GB | $1.54/h $1035/mo |
| 2 x NVIDIA L4 | 20 cores | 256 GB | $1.68/h $1,129/mo |
| 2 x NVIDIA L4 | 32 cores | 384 GB | $1.96/h $1,317/mo |
| 3 x NVIDIA L4 | 16 cores | 192 GB | $2.14/h $1,438/mo |
| 3 x NVIDIA L4 | 20 cores | 256 GB | $2.28/h $1,532/mo |
| 3 x NVIDIA L4 | 32 cores | 384 GB | $2.56/h $1,720/mo |
1 – 3 per server
8 – 32
64 – 384 GB
Starting from
$1.27/hour
| GPU | CPU cores | RAM | Price |
|---|---|---|---|
| 1 x NVIDIA L40S | 8 cores | 64 GB | $1.27/h $851/mo |
| 1 x NVIDIA L40S | 12 cores | 128 GB | $1.43/h $958/mo |
| 1 x NVIDIA L40S | 16 cores | 192 GB | $1.74/h $1170/mo |
| 1 x NVIDIA L40S | 20 cores | 256 GB | $2.06/h $1383/mo |
| 2 x NVIDIA L40S | 12 cores | 128 GB | $2.38/h $1596/mo |
| 2 x NVIDIA L40S | 16 cores | 192 GB | $3.01/h $2022/mo |
| 2 x NVIDIA L40S | 20 cores | 256 GB | $3.64/h $2447/mo |
| 2 x NVIDIA L40S | 32 cores | 384 GB | $4.28/h $2873/mo |
| 3 x NVIDIA L40S | 16 cores | 192 GB | $4.28/h $2873/mo |
| 3 x NVIDIA L40S | 20 cores | 256 GB | $4.91/h $3298/mo |
| 3 x NVIDIA L40S | 32 cores | 384 GB | $5.54/h $3724/mo |
1 – 8 per server
12 – 96
240 – 1920 GB
Starting from
$1.89/hour
| GPU | CPU cores | RAM | Price |
|---|---|---|---|
| 1 x NVIDIA H100 | 12 cores | 240 GB | $1.89/h $1,270/mo |
| 2x NVIDIA H100 | 24 cores | 480 GB | $3.78/h $2,540/mo |
| 4x NVIDIA H100 | 48 cores | 960 GB | $7.56/h $5,080/mo |
| 8 x NVIDIA H100 | 96 cores | 1920 GB | $15.12/h $10,161/mo |
1 – 8 per server
12 – 96
240 – 1920 GB
Starting from
$5.20/hour
| GPU | CPU cores | RAM | Price |
|---|---|---|---|
| 1 x NVIDIA B200 | 12 cores | 240 GB | $5.20/h $3,494/mo |
| 2x NVIDIA B200 | 24 cores | 480 GB | $10.40/h $6,989/mo |
| 4x NVIDIA B200 | 48 cores | 960 GB | $20.80/h $13,978/mo |
| 8 x NVIDIA B200 | 96 cores | 1920 GB | $41.60/h $27,955/mo |
99.999% uptime SLA across Public Cloud, Private Cloud, Managed Kubernetes, Managed Databases, and more.
Choose from a range of Linux and Windows distributions for full control and flexibility. Launch in just 45 seconds through our simple control panel.
Global reach spanning four continents and 15 data centers.
European cloud for a global audience. Help satisfy your compliance requirements with a European, ISO 27001 certified cloud provider.
We pride ourselves on providing outstanding customer service – 24 hours a day, 365 days a year.
In 2024, we were proud to continue our outstanding streak, responding to initial queries in just 46 seconds!
Simple, transparent pricing with no hidden fees. Focus on growth, knowing exactly what you’ll pay every month. Discover our plans, starting at just $3.5 per month.
Experiment with new technologies, build your servers with full root access, and scale resources on demand—all without worrying about unpredictable costs.
99.999% uptime SLA across Public Cloud, Private Cloud, Managed Kubernetes, Managed Databases, and more.
Choose from a range of Linux and Windows distributions for full control and flexibility. Launch in just 45 seconds through our simple control panel.
Global reach spanning four continents and 15 data centers.
European cloud for a global audience. Help satisfy your compliance requirements with a European, ISO 27001 certified cloud provider.
We pride ourselves on providing outstanding customer service – 24 hours a day, 365 days a year.
In 2024, we were proud to continue our outstanding streak, responding to initial queries in just 46 seconds!
Simple, transparent pricing with no hidden fees. Focus on growth, knowing exactly what you’ll pay every month. Discover our plans, starting at just $3.5 per month.
Experiment with new technologies, build your servers with full root access, and scale resources on demand—all without worrying about unpredictable costs.
Innovate responsibly. Our Helsinki data center routes excess heat generated from our GPUs directly into the city’s district heating network. This makes our GPU offering in Helsinki one of the most environmentally friendly options on the market, contributing to a greener future while powering your creative endeavors.
Our Helsinki data center is powered by 100% renewables and routes excess heat generated from our GPUs into the city’s district heating network. This makes our GPU offering in Helsinki one of the most environmentally friendly options on the market, contributing to a greener future while powering your AI.
High performance and reliability with no upfront costs or commitments.
By using our AI/ML-ready GPU Ubuntu template, the necessary NVIDIA drivers are pre-installed. If you choose another operating system, you will need to manually install the appropriate NVIDIA drivers and CUDA toolkit to enable GPU functionality.
Multi-GPU servers are equipped with more than one dedicated GPU. All GPUs are exposed to your server via PCIe passthrough, allowing you to utilize them for parallel processing, distributed training, or other multi-GPU workloads. You can verify the available GPUs using the nvidia-smi tool.
Yes, each GPU is dedicated to a single server and is not shared with other customers. This ensures consistent performance and security for your workloads.
Start your 14-day trial today and discover why thousands of businesses rely on UpCloud