IBM Platform LSF 9.1.3 Fix 368699 Readme File
Abstract
P101419. This solution introduces the parameter MC_SORT_BY_SUBMIT_TIME parameter and sets the value to Y|y in a multi-cluster environment. This allows forwarded jobs on the execution cluster to be sorted and run based on their original submission time (instead of their forwarded time).
Description
Readme documentation for IBM Platform LSF 9.1.3 Fix 368699 including installation-related instructions, prerequisites and co-requisites, and list of fixes.
This fix addresses the following issue:
After
a job has been pending for X amount of time, it will be recalled to the
submission cluster and then re-forwarded to an execution cluster.
In a multi-cluster environment, jobs forwarded to the execution cluster have always been sorted based on the job forward time.
The problem is that the execution cluster then places the forwarded job
at the bottom of the pending jobs list in the order it was forwarded,
not in the order that the job was submitted on the submission cluster.
Therefore, the other jobs that have not been pending as long, can actually get executed ahead of the recently re-forwaded job.
With the parameter MC_SORT_BY_SUBMIT_TIME set to Y in lsb.params, forwarded jobs are sorted based on the submission time.
When
the maximum rescheduled time has been reached and the pending jobs are
rescheduled on the execution cluster, they are ordered based on their
original submission time (the time when the job was first submitted on
the submission cluster) and not the forwarding time (the time when the
job was re-forwarded to the execution cluster).
Jobs forwarded to the execution cluster using brequeue -a or brequeue -p are also sorted based on the submission time.
Note : This solution does not change the behaviour of bswitch, bbot or btop.
i. After bswitch, LSF forwards the job based on job switch time instead of the submission time.
ii. Using btop or bbot on the submission
cluster does not affect the position of the jobs at the execution
cluster or future forwarding.
iii. User can
btop the job at the execution cluster but after the job is recalled and
forwarded again, LSF will order the job based on the original submission
time and the previous btop position is gone.
Readme file for: IBM® Platform LSF
Product/Component Release: 9.1.3
Update Name: Fix 368699
Fix ID: LSF-9.1.3-build368699
Publication date: 14 October 2015
Last modified date: 14 October 2015
Contents:
1. List of fixes
2. Download location
3. Products or components affected
4. System requirements
5. Installation and configuration
6. List of files
7. Product notifications
8. Copyright and trademark information
1. List of fixes
P101419
2. Download Location
Download Fix 368699 from the following location: http://www.ibm.com/eserver/support/fixes/
3. Products or components affected
Affected components include: LSF/bparams, LSF/mbatchd, LSF/mbschd, LSF/mbschd scheduler plugins
4. System requirements
Linux2.6-glibc2.3-x86_64
5. Installation and configuration
5.1 Before installation
(LSF_TOP=Full path to the top-level installation directory of LSF.)
1) Log on to the LSF master host as root
2) Set your environment:
- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
5.2 Installation steps
1) Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/
2) Copy the patch file to the install directory $LSF_ENVDIR/../9.1/install/
3) Run patchinstall: ./patchinstall <patch>
5.3 After installation
1) Log on to the LSF master host as root
2) On the execution cluster, set parameter MC_SORT_BY_SUBMIT_TIME = Y in lsb.params to enable sorting of forwarded jobs based on the submission time.
3) Run badmin mbdrestart
5.4 Uninstallation
To roll back a patch:
1) Log on to the LSF master host as root
2) Run ./patchinstall -r <patch>
3) Run badmin mbdrestart
6. List of files
bparams
mbatchd
mbschd
all scheduler plugins (schmod_advrsv.so, schmod_affinity.so,
schmod_aps.so,schmod_bluegene.so
schmod_cpuset.so, schmod_craylinux.so, schmod_crayx1.so, schmod_dc.so,
schmod_default.so
schmod_dist.so, schmod_fairshare.so, schmod_fcfs.so,
schmod_jobweight.so, schmod_limit.so
schmod_maui.so, schmod_mc.so, schmod_parallel.so,
schmod_preemption.so,schmod_pset.so, schmod_ps.so, schmod_rms.so,
schmod_reserve.so, schmod_xl.so)
7. Product notifications
To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.
8. Copyright and trademark information
© Copyright IBM Corporation 2015
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.