10.3. Viewing Cluster Job Information

The RSM Cluster Load Monitoring application displays a list of jobs that have been recently submitted to the cluster from RSM. For each job you can see the job ID, number of requested cores, cluster queue, owner, and job status.

Below is a description of each job status:

StatusDescription
  Queued

The job has been placed in the cluster queue, and is waiting to run.

  Running

The job is running.

  Cancelled

The job has been terminated via a cancel or Abort action.

Also applies to jobs that have been aborted because you exited a project without first performing one of the following actions:

  • Saving the project since the update was initiated

  • Saving results retrieved since your last save

  Finished

The job has completed successfully.

Also applies to jobs that have been terminated via the Interrupt option or for which you have saved results prior to exiting the project.

  Failed

The job has failed.

Also may be applied to jobs that cannot be cancelled due to fatal errors.

  Unknown

The job status being reported by the cluster is one that is not recognized by RSM. For example, a cluster may have unique job states such as 'Held' or 'Suspended', which RSM cannot parse.