Knowledge Center Contents Previous Next |
lsb_suspreason()
Explains why a job was suspended.
DESCRIPTION
Using the SBD, lsb_suspreason() explains why system-suspended and user-suspended jobs were suspended.
SYNOPSIS
#include <lsf/lsbatch.h> char *lsb_suspreason(int reasons, int subreasons, struct loadIndexLog *ld) struct loadIndexLog { int nIdx; char **name; };PARAMETERS
reasons
Reasons a job suspends:
SUSP_HOST_LOCK
The LSF administrator has locked the execution host.
SUSP_LOAD_REASON
A load index exceeds its threshold. The subreasons field indicates which indices.
SUSP_MBD_PREEMPT
The job was preempted by mbatchd because of a higher priorty job.
SUSP_QUEUE_WINDOW
The run window of the queue is closed.
SUSP_RESCHED_PREEMPT
Suspended after preemption. The system needs to re-allocate CPU utilization by job priority.
SUSP_SBD_PREEMPT
Preempted by sbatchd. The job limit of the host/user has been reached.
SUSP_USER_RESUME
The job is waiting to be re-scheduled after being resumed by the user.
SUSP_USER_STOP
The user suspended the job.
SUSP_ADMIN_STOP
The job was suspened by root or the LSF administrator.
SUSP_SBD_STARTUP
The job is suspended while the sbatchd is restarting.
SUSP_HOST_LOCK_MASTER
The execution host is locked by the master LIM.
SUSP_QUE_STOP_CONDITION
The suspend conditions of the queue, as specified by the STOP_COND parameter in lsb.queues, are true.
SUSP_QUE_RESUME_CONDITION
The resume conditions of the queue, as specified by the RESUME_COND parameter in lsb.queues, are false.
SUSP_RES_RESERVE
The job is terminated due to resource limit.
SUSP_RES_LIMIT
The job's requirements for resource reservation are not satisfied.
SUSP_PG_IT
The job was suspended due to the paging rate and the host is not idle yet.
SUSP_REASON_RESET
Resets the previous reason.
SUSP_MBD_LOCK
The job is locked by the mbatchd.
SUSP_LOAD_UNAVAIL
Load information on the execution hosts is unavailable.
subreasons
If reasons is SUSP_LOAD_REASON, subreasons indicates the load indices that are out of bounds. The integer values for the load indices are found in lsf.h.
If reasons is SUSP_RES_LIMIT, subreasons indicates the job's requirements for resource reservation are not satisfied. The integer values for the job's requirements for resource reservation are found in lsbatch.h.
Subreasons a job suspends if reasons is SUSP_LOAD_REASON:
R15S
15 second CPU run queue length
R1M
1 minute CPU run queue length
R15M
15 minute CPU run queue length
UT
1 minute CPU utilization
PG
Paging rate
IO
Disk IO rate
LS
Number of log in sessions
IT
Idle time
TMP
Available temporary space
SWP
Available swap space
MEM
Available memory
USR1
USR1 is used to describe unavailable or out of bounds user defined load information of an external dynamic load indice on execution hosts.
USR2
USR2 is used to describe unavailable or out of bounds user defined load information of an external dynamic load indice on execution hosts.
Subreasons a job suspends if reasons is SUSP_RES_LIMIT:
SUB_REASON_RUNLIMIT
The run limit was reached.
SUB_REASON_DEADLINE
The deadline was reached.
SUB_REASON_PROCESSLIMIT
The process limit was reached.
SUB_REASON_CPULIMIT
The CPU limit was reached.
SUB_REASON_MEMLIMIT
The memory limit was reached.
ld
When reasons is SUSP_LOAD_REASON, ld is used to determine the name of any external load indices. ld uses the most recent load index log in the lsb.events file.
The loadIndexLog structure contains the following fields:
nIdx
Number of load indices.
names
Names of load indices.
RETURN VALUES
char: reasons
Returns the suspending reason string.
char:NULL
The function failed. The reason code is bad.
ERRORS
No error handling
SEE ALSO
Related API
lsb_pendreason() - Explains why a job is pending
Equivalent line command
bjobs -s
Environment Variable
LSB_SUSP_REASONS
Files
lsb.queues
lsb.events
Platform Computing Inc.
www.platform.com |
Knowledge Center Contents Previous Next |