Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

To use GPUs on Koa a user needs to specify an additional option when compared to the basic srun (interactive) or sbatch (batch) job.  By default, the scheduler will not allocate GPUs to a given job when the nodes are configured to be shared.  This is to prevent the potential of other users on the same node from utilizing resources they did not request.

The additional option needed to request resources is the --gres option.  Each possible gres value takes the form of name[[:type]:count].  This means, the only part of the value that is required is the the name, but at least specifying the count may also be desirable so that you are explicit on the number of that resources you require.   For example all of the following would be valid gres options

  • gpu

  • gpu:1

  • gpu:NV-A30:1 

gres also has a special value of help which when used will print out a list of all valid gres values.

As gres is explicitly used for selecting GPUs on Koa, this option is only valid when using either the gpu, kill-shared and  kill-exclusive partitions, or private partitions for lab that have purchased GPU nodes.

...

Table of Contents
maxLevel4


...

Available Gres Values 

This list can also be created by utilizing the sinfo command and viewing the gres column in the output

Code Block
sinfo -p gpu,kill-shared,kill-exclusive | grep gpu

gpu:NV-V100-SXM2
gpu:NV-RTX5000
gpu:NV-A30
gpu:NV-RTX-A4000
gpu:NV-RTX2080Ti
gpu:NV-RTX2070

...