Yandex.Cloud
  • Services
  • Why Yandex.Cloud
  • Pricing
  • Documentation
  • Contact us
Get started
Yandex Compute Cloud
  • Getting started
    • Overview
    • Creating a Linux VM
    • Creating a Windows VM
    • Creating instance groups
  • Step-by-step instructions
    • All instructions
    • Creating VMs
      • Creating a Linux VM
      • Creating a Windows VM
      • Creating a VM from a set of disks
      • Create a VM with disks restored from snapshots
      • Creating a VM from a custom image
      • Creating a preemptible VM
      • Creating a VM with a GPU
    • DSVM
      • Overview
      • Creating a VM from a public DSVM image
    • Placement groups
      • Creating a placement group
      • Deleting a placement group
      • Creating a VM instance in a placement group
      • Adding a VM to a placement group
      • Removing a VM instance from a placement group
    • Images with pre-installed software
      • Creating a VM from a public image
      • Configuring software
      • Working with a VM based on a public image
      • Getting a list of public images
    • Getting information about a VM
      • Getting information about a VM
      • Viewing serial port output
    • Managing VMs
      • Stopping and starting a VM
      • Attaching a disk to a VM
      • Detaching a disk from a VM
      • Moving a VM to a different availability zone
      • Making a VM's public IP address static
      • Updating a VM
      • Changing VM computing resources
      • Deleting a VM
    • Working on VMs
      • Connecting to a VM via SSH
      • Connecting to a VM via RDP
      • Working with Yandex.Cloud from inside a VM
      • Installing NVIDIA drivers
    • Creating new disks
      • Creating an empty disk
    • Disk management
      • Creating a disk snapshot
      • Updating a disk
      • Deleting a disk
      • Deleting a disk snapshot
    • Creating new images
      • Uploading your image
    • Managing images
      • Deleting a disk image
    • Managing the serial console
      • Getting started
      • Connecting to a serial console via SSH
      • Connecting to a serial console via CLI
      • Start your terminal in the Windows SAC
      • Disabling access to the serial console
    • Creating instance groups
      • Creating a fixed-size instance group
      • Creating a fixed-size instance group with a load balancer
      • Creating an automatically scaled instance group
      • Creating an instance group from Container Optimized Image
    • Getting information about instance groups
      • Getting a list of groups
      • Getting information about a group
      • Getting a list of instances in a group
    • Managing instance groups
      • Update a group
      • Configure application health check on the VM
      • Update a group
        • Incremental update
        • Updating without downtime
      • Stop a group
      • Start a group
      • Delete a group
    • Dedicated hosts
      • Creating a VM in a group of dedicated hosts
      • Creating a VM on a dedicated host
  • Yandex Container Optimized Solutions
  • Scenarios
    • Configuring NTP time synchronization
    • Running instance groups with auto scaling
  • Concepts
    • Relationship between resources
    • Virtual machines
      • Overview
      • Platforms
      • vCPU performance levels
      • Graphics accelerators (GPUs)
      • Preemptible VMs
      • Network on a VM
      • Live migration
      • Placement groups
      • Statuses
      • Metadata
    • Disks
      • Overview
      • Disk snapshots
    • Images
    • Instance groups
      • Overview
      • Access
      • Instance template
      • Variables in an instance template
      • Policies
        • Overview
        • Allocation policy
        • Deployment policy
        • Scaling policy
      • Scaling types
      • Auto-healing
      • Update
        • Overview
        • Allocating instances across zones
        • Deployment algorithm
        • Rules for updating instance groups
      • Statuses
    • Dedicated host
    • Backups
    • Quotas and limits
  • Access management
  • Pricing policy
    • Current pricing policy
    • Archive
      • Before January 1, 2019
      • From January 1 to March 1, 2019
      • From March 1 to May 1, 2019
  • Compute API reference
    • Authentication in the API
    • gRPC
      • Overview
      • DiskPlacementGroupService
      • DiskService
      • DiskTypeService
      • HostGroupService
      • HostTypeService
      • ImageService
      • InstanceService
      • PlacementGroupService
      • SnapshotService
      • ZoneService
      • InstanceGroupService
      • OperationService
    • REST
      • Overview
      • Disk
        • Overview
        • create
        • delete
        • get
        • list
        • listOperations
        • update
      • DiskPlacementGroup
        • Overview
        • create
        • delete
        • get
        • list
        • listDisks
        • listOperations
        • update
      • DiskType
        • Overview
        • get
        • list
      • HostGroup
        • Overview
        • create
        • delete
        • get
        • list
        • listHosts
        • listInstances
        • listOperations
        • update
      • HostType
        • Overview
        • get
        • list
      • Image
        • Overview
        • create
        • delete
        • get
        • getLatestByFamily
        • list
        • listOperations
        • update
      • Instance
        • Overview
        • addOneToOneNat
        • attachDisk
        • create
        • delete
        • detachDisk
        • get
        • getSerialPortOutput
        • list
        • listOperations
        • removeOneToOneNat
        • restart
        • start
        • stop
        • update
        • updateMetadata
        • updateNetworkInterface
      • PlacementGroup
        • Overview
        • create
        • delete
        • get
        • list
        • listInstances
        • listOperations
        • update
      • Snapshot
        • Overview
        • create
        • delete
        • get
        • list
        • listOperations
        • update
      • Zone
        • Overview
        • get
        • list
      • Operation
        • Overview
        • get
      • InstanceGroup
        • Overview
        • createFromYaml
        • update
        • list
        • get
        • delete
        • start
        • stop
        • create
        • listAccessBindings
        • setAccessBindings
        • updateFromYaml
        • listLogRecords
        • listInstances
        • updateAccessBindings
        • listOperations
  • Questions and answers
    • General questions
    • Virtual machines
    • Disks and snapshots
    • Licensing
    • All questions on the same page
  1. Concepts
  2. Virtual machines
  3. Graphics accelerators (GPUs)

Graphics accelerators (GPUs)

  • VM configurations
  • OS images
  • Virtual graphics accelerators (vGPUs)
    • Configurations of VMs with vGPUs
  • See also

Yandex Compute Cloud provides graphics accelerators (GPUs) as part of graphics cards. GPUs outperform vCPUs in processing certain types of data and can be used for complex computing.

Compute Cloud uses NVIDIA® Tesla® V100 GPUs with 32 GB HBM2.

The NVIDIA® Tesla® V100 graphics card contains 5120 CUDA® cores that perform high performance computing (HPC), and 640 Tensor cores for deep learning (DL) tasks.

Graphics accelerators are also suitable for machine learning (ML), artificial intelligence (AI), and 3D rendering tasks.

You can control a GPU and RAM directly from your VM.

VM configurations

Available configurations of computing resources:

  • Intel Broadwell with NVIDIA® Tesla® V100 (gpu-standard-v1):

    Number of GPUs Number of vCPUs RAM, GB
    1 8 96
    2 16 192
    4 32 384
  • Intel Cascade Lake with NVIDIA® Tesla® V100 (gpu-standard-v2):

    Number of GPUs Number of vCPUs RAM, GB
    1 8 48
    2 16 96
    4 32 192
    8 64 384

For more information about VM organizational and technical limits, see Quotas and limits.

OS images

For VMs with GPUs, special images of Windows (2016 Datacenter GPU, windows-2016-gvlk-gpu) and Ubuntu (16.04 lts GPU, ubuntu-1604-lts-gpu) are available with NVIDIA drivers installed. To use other images, install the necessary drivers on your own.

Virtual graphics accelerators (vGPUs)

Yandex Compute Cloud lets you virtualize graphics accelerators (GPUs). Virtual GPUs are created based on NVIDIA® vGPU technology.

NVIDIA® vGPU software lets you use cards with GPUs for both graphics and computing tasks on vGPUs. This requires the appropriate licenses.

To use vGPU technology, you need:

  • A VM running on the platform vgpu-standard-v1 with one of the following images:
    • Ubuntu 18.04 lts vGPU.
    • Windows Server 2019 Datacenter vGPU.
  • License to use NVIDIA® vGPU technology.
  • NVIDIA® vGPU Software License Server.

To work with the license, you can:

  • Use your current license server.
    The current license server must be available over the network from VMs with vGPUs.
  • Create a VM with the NVIDIA® vGPU Software License Server in Yandex.Cloud.
    For information about how to install and configure the license server, see the NVIDIA documentation.

Configurations of VMs with vGPUs

The following configuration is available for VMs with vGPUs:

  • Intel Broadwell with NVIDIA® vGPU Tesla® V100 8G (vgpu-standard-v1):

    Number of vGPUs Number of vCPUs RAM, GB GPU RAM, GB
    1 4 12 8

See also

  • Create a VM with a GPU.
  • Learn how to add a GPU to an existing VM.
  • Learn how to change the number of GPUs.
In this article:
  • VM configurations
  • OS images
  • Virtual graphics accelerators (vGPUs)
  • Configurations of VMs with vGPUs
  • See also
Language
Careers
Privacy policy
Terms of use
© 2021 Yandex.Cloud LLC