Graphics accelerators (GPUs)

Written by

Updated at April 26, 2024

Graphics accelerators (GPUs)
GPU clusters
See also

Compute Cloud provides graphics accelerators (GPUs) for different VM configurations. GPUs outperform CPUs in terms of processing certain types of data and can be used for complex computing. To make problem solving even more efficient and convenient, you can use automatic allocation of resources in Yandex DataSphere.

In Compute Cloud, you can use the following GPUs:

NVIDIA® Tesla® V100 with 32 GB HBM2 (High Bandwidth Memory).
NVIDIA® Ampere® A100 with 80 GB HBM2.
NVIDIA® Tesla® T4 with 16 GB GDDR6.

Warning

GPUs run in TCC mode, which does not use the operating system's graphics drivers.

By default, the cloud has a zero quota for creating VMs with GPUs. You can request a quota increase in the management console. To do this, you need the quota-manager.requestOperator role or higher.

You cannot create VMs with GPUs in the ru-central1-c availability zone. For more information, see Deprecation of the ru-central1-c availability zone.

Graphics accelerators (GPUs)

Graphics accelerators are suitable for machine learning (ML), artificial intelligence (AI), and 3D rendering tasks.

You can manage GPUs and RAM directly from your VM.

NVIDIA® Tesla® V100

The NVIDIA® Tesla® V100 GPU contains 5120 CUDA® cores for high-performance computing (HPC) and 640 Tensor cores for deep learning (DL) tasks.

NVIDIA® Ampere® A100

The NVIDIA® A100 GPU based on the Ampere® microarchitecture uses third-generation Tensor Cores and delivers 80 GB HBM2 memory with up to 2 TB/s throughput.

NVIDIA® Tesla® T4

NVIDIA® Tesla® T4 based on the Turing™ architecture uses Turing tensor cores, as well as the RT cores, and provides 16 GB of GDDR6 memory with a throughput of 300 GB/s.

VM configurations

The computing resources may have the following configurations:

Intel Broadwell with NVIDIA® Tesla® V100 (gpu-standard-v1):

Number of GPUs VRAM, GB Number of vCPUs RAM, GB

1 32 8 96

2 64 16 192

4 128 32 384
Intel Cascade Lake with NVIDIA® Tesla® V100 (gpu-standard-v2):

Number of GPUs VRAM, GB Number of vCPUs RAM, GB

1 32 8 48

2 64 16 96

4 128 32 192

8 256 64 384
AMD EPYC™ with NVIDIA® Ampere® A100 (gpu-standard-v3):

Number of GPUs VRAM, GB Number of vCPUs RAM, GB

1 80 28 119

2 160 56 238

4 320 112 476

8 640 224 952
Intel Ice Lake with NVIDIA® Tesla® T4 (gpu-standard-v3-t4):

Number of GPUs VRAM, GB Number of vCPUs RAM, GB

1 16 4 16

1 16 8 32

1 16 16 64

1 16 32 128

Number of GPUs	VRAM, GB	Number of vCPUs	RAM, GB
1	32	8	96
2	64	16	192
4	128	32	384

Number of GPUs	VRAM, GB	Number of vCPUs	RAM, GB
1	32	8	48
2	64	16	96
4	128	32	192
8	256	64	384

Number of GPUs	VRAM, GB	Number of vCPUs	RAM, GB
1	80	28	119
2	160	56	238
4	320	112	476
8	640	224	952

Number of GPUs	VRAM, GB	Number of vCPUs	RAM, GB
1	16	4	16
1	16	8	32
1	16	16	64
1	16	32	128

VM GPUs are provided in full. For example, if a configuration has four GPUs specified, your VM will have four full-featured GPU devices.

You can create VMs on Intel Broadwell with NVIDIA® Tesla® V100, Intel Cascade Lake with NVIDIA® Tesla® V100, AMD EPYC™ with NVIDIA® Ampere® A100 and Intel Ice Lake with NVIDIA® Tesla® T4 in the ru-central1-a and ru-central1-b availability zones.

For more information about organizational and technical limitations for VMs, see Quotas and limits.

For information about the cost of VMs with GPUs, see VM computing resources.

OS images

For VMs with GPUs, you can use the following special images of operating systems with NVIDIA drivers:

Intel Broadwell with NVIDIA® Tesla® V100 and Intel Cascade Lake with NVIDIA® Tesla® V100

Ubuntu 16.04 LTS GPU (ubuntu-1604-lts-gpu)
Ubuntu 20.04 LTS GPU (ubuntu-2004-lts-gpu)

Intel Ice Lake with NVIDIA® Tesla® T4

Ubuntu 20.04 LTS GPU (ubuntu-2004-lts-gpu)

AMD EPYC™ with NVIDIA® Ampere® A100

Ubuntu 20.04 LTS GPU A100 (ubuntu-2004-lts-a100)
Ubuntu 18.04 LTS GPU A100 (ubuntu-1804-lts-a100)

We recommend using a standard image from Yandex Cloud. You can also install the drivers on another standard image yourself or create a custom image with pre-installed drivers.

GPU clusters

Note

GPU clusters will be publicly available and billable starting September 1, 2023.

You can group multiple VMs running on 8 NVIDIA A100 GPUs into a cluster. This will allow you to accelerate distributed training tasks that require higher computing capacity than individual VMs can provide. Make sure the cluster is created in the same availability zone as its VMs. The cluster VMs are interconnected through InfiniBand, a secure high-speed network.

You can add VMs from different folders, networks, and subnets to your cluster. For the cluster VMs to interact properly, we recommend using a security group that allows unlimited traffic within the group. The default security group meets this requirement. If you edited the default security group, add a group with unlimited internal traffic.

Graphics accelerators (GPUs)

Graphics accelerators (GPUs)Graphics accelerators (GPUs)

NVIDIA® Tesla® V100NVIDIA® Tesla® V100

NVIDIA® Ampere® A100NVIDIA® Ampere® A100

NVIDIA® Tesla® T4NVIDIA® Tesla® T4

VM configurationsVM configurations

OS imagesOS images

GPU clustersGPU clusters

See alsoSee also

Was the article helpful?

Graphics accelerators (GPUs)

NVIDIA® Tesla® V100

NVIDIA® Ampere® A100

NVIDIA® Tesla® T4

VM configurations

OS images

GPU clusters

See also