Below you will find pages that utilize the taxonomy term “slurm”
Blogs
How to handle python job cancelation in Slurm job manager
If you use Slurm job manager to run jobs on shared cluster, it often occurs that your job reaches the time limit and is terminated by Slurm. To allow a user to deal with the job termination, Slurm does this in two stages: first, the job receives SIGTERM signal that indicates that the job will be killed soon, and then the job receives SIGKILL signal that actually kills it. The time interval between these two signals is specified via Slurm’s configuration parameter KillWait.