Starting from
$1.27/hour
UpCloud’s NVIDIA L40S Cloud GPUs are engineered for the demands of modern AI/ML, offering the performance you need without the hidden fees or vendor lock-in.
Power your projects, from LLM inference to complex machine learning tasks, right from our European cloud.
Build AI without giving up control
Wille, IT Engineer
Host your AI workloads in Finland, within GDPR-compliant data centers, backed by strong jurisdictional protections. This means your models and data are secured without public cloud exposure.
Innovate responsibly. Our Helsinki data center is powered by 100% renewables and routes excess heat generated from our GPUs into the city’s district heating network. This makes our GPU offering in Helsinki one of the most environmentally friendly options on the market, contributing to a greener future while powering your AI.
We don’t force you into proprietary ML platforms or opaque orchestration layers. Whether you’re fine-tuning LLaMA, deploying open-source LLMs, or building on PyTorch, our infrastructure supports your chosen open-source tools without restriction.
Seamless Integration
Pair your GPU Servers with our high-performance 5th-gen AMD EPYC Cloud Servers for a complete and perfectly balanced architecture. Conquer massive parallel jobs and general tasks without compromise in a single architecture.
Fuel your workloads with our high-speed Managed Object Storage, ideal for storing immutable files like model checkpoints and large datasets. It supports stateless and concurrent access from any number of hosts, with built-in object versioning.
Coming soon, our Managed Kubernetes service will allow you to manage mixed workloads, enabling you to combine non-GPU and GPU workloads within the same cluster. Benefit from autoscaling support for your inference layer, multi-model routing with ease (e.g., for A/B testing models), and simplified observability.
Pair your GPU Servers with our high-performance 5th-gen AMD EPYC Cloud Servers for a complete and perfectly balanced architecture. Conquer massive parallel jobs and general tasks without compromise in a single architecture.
Fuel your workloads with our high-speed Managed Object Storage, ideal for storing immutable files like model checkpoints and large datasets. It supports stateless and concurrent access from any number of hosts, with built-in object versioning.
Coming soon, our Managed Kubernetes service will allow you to manage mixed workloads, enabling you to combine non-GPU and GPU workloads within the same cluster. Benefit from autoscaling support for your inference layer, multi-model routing with ease (e.g., for A/B testing models), and simplified observability.
Forget unpredictable network bills at the end of the month. With zero-cost egress, you’ll never see a surprise bill for transfer usage.
Never pay for network transfer, even when you scale up, you can redirect savings towards accelerating your business growth.
Spin up NVIDIA L40S GPU resources directly from your UpCloud Hub when you need them. Go live today without talking to sales or signing 12-month agreements.
UpCloud’s flexible model supports agile development cycles, allowing you to scale your GPU resources up or down based on actual demand. This freedom from fixed billing and pre-allocated GPU blocks makes it ideal for startups, research teams, and project-based work.
Spin up NVIDIA L40S GPU resources directly from your UpCloud Hub when you need them. Go live today without talking to sales or signing 12-month agreements.
UpCloud’s flexible model supports agile development cycles, allowing you to scale your GPU resources up or down based on actual demand. This freedom from fixed billing and pre-allocated GPU blocks makes it ideal for startups, research teams, and project-based work.
Experience the power of NVIDIA L40S GPUs!
By using sustainable infrastructure, the waste heat generated by the servers is collected and utilized in the district heating network, warming local homes.
Location: Helsinki, Finland
Processor: 8 vCPUs AMD EPYC 9575F
Memory: 64 GB DDR5 RAM
Price: from $1.27 / hour
Skip the API fees and data privacy concerns. Our new tutorial shows you how to spin up an UpCloud GPU and run powerful open-weight models like Mistral-7B with Ollama. Go from deployment to inference on your own private, high-performance server.
1 – 3 per server
8 – 32
64 – 384 GB
Starting from
$1.27/hour
| GPU | CPU cores | RAM | Price |
|---|---|---|---|
| 1 x NVIDIA L40S | 8 cores | 64 GB | $1.267/h $851/mo |
| 1 x NVIDIA L40S | 12 cores | 128 GB | $1.425/h $958/mo |
| 1 x NVIDIA L40S | 16 cores | 192 GB | $1.742/h $1170/mo |
| 1 x NVIDIA L40S | 20 cores | 256 GB | $2.058/h $1400/mo |
| 2 x NVIDIA L40S | 12 cores | 128 GB | $2.375/h $1596/mo |
| 2 x NVIDIA L40S | 16 cores | 192 GB | $3.008/h $2022/mo |
| 2 x NVIDIA L40S | 20 cores | 256 GB | $3.642/h $2447/mo |
| 2 x NVIDIA L40S | 32 cores | 384 GB | $4.275/h $2873/mo |
| 3 x NVIDIA L40S | 16 cores | 192 GB | $4.275/h $2873/mo |
| 3 x NVIDIA L40S | 20 cores | 256 GB | $4.908/h $3298/mo |
| 3 x NVIDIA L40S | 32 cores | 384 GB | $5.542/h $3724/mo |
Unlock powerful, private, and cost-efficient GPU computing for your AI/ML projects today.