4.3. Running jobs using Slurm#

In order to run jobs on the Esrum cluster, you must connect to the head node (see Connecting to the cluster) and queue them using the Slurm job management system. Slurm takes care of automatically queuing and distribute jobs on the compute and GPU nodes when the required resources are available.

While it is permitted to run small jobs directly on the head node, more resource intensive jobs must be queued using Slurm. We will terminate jobs running on the head node without prior warning, if it is necessary to prevent them from impacting users of the cluster.

The documentation for running jobs is split into three sections:

The Basic Slurm jobs section describes the basic procedure of queuing jobs using the sbatch command, requesting resources, monitoring your jobs, and running an interactive session.
The Advanced Slurm jobs section describes additional ways to monitor your jobs, how to batch multiple, similar jobs using sbatch, how to use srun for executing singular commands, and more.
The Using the GPU/hi-MEM node section describes how to run jobs on the GPU/High-MEM node.