Say you dispatch thousands of jobs with Slurm, but goofed something up and want to cancel some of those jobs. Here is a bash script to do that. I stole the script from somebody named "cas" on stack exchange. NOTE: if you want to cancel all of your jobs then you can use scancel -u username, where "username" is your username (i.e. jharri62 is my username). This will cancel all of your jobs. Often you may want to keep some jobs running, but cancel others. This situation can be handled via the script below.
For more see: https://www.rc.fas.harvard.edu/resources/documentation/convenient-slurm-commands/
Step-by-step guide
- Make a file to house the program. Put it somewhere convenient, like in your home directory, or in a directory of common scripts in your home directory. One way to make a file is "touch yourfilename"
- Open the file with nano and paste the following into it:
#!/bin/bash
declare -a jobs=()
if [ -z "$1" ] ; then
echo "Minimum Job Number argument is required. Run as '$0 jobnum'"
exit 1
fiminjobnum="$1"
myself="$(id -u -n)"
for j in $(squeue --user="$myself" --noheader --format=%i) ; do
if [ "$j" -gt "$minjobnum" ] ; then
jobs+=($j)
fi
donescancel "${jobs[@]}"
- Make the file executable with chmod "chmod +x yourfile"
- Usage is: "bash yourfile 300000" where 300000 is the base job id number used to delimit where wanted jobs stop. In other words, any of your jobs with an ID smaller than this number will be retained, but jobs with IDs larger than this number will be removed. This will not mess with anybody else's jobs.
Related articles