Readme file for IBM® Spectrum LSF 10.1 Fix 601300
Abstract
P104005. Fix to introduce a new parameter called MAX_PREEXEC_FORWARDING_RETRY, in the lsb.params configuration file. The parameter controls the maximum times to attempt forwarding a job to a remote cluster, if the job has been rejected due to the pre-execution command of the job.
Description
Readme documentation for IBM Spectrum LSF 10.1 Fix 601300 including installation-related instructions, prerequisites and co-requisites, and list of fixes.
This fix addresses the following issue:
For LSF with multicluster enabled, a job is forwarded to an execution cluster. After reaching the MAX_PREEXEC_RETRY value configured for the remote cluster, the job will be returned to the submission cluster in PEND state. The submission cluster can continually forward the job and LSF is unable to control the maximum number of times to attempt forwarding under this scenario.
This fix introduces a new MAX_PREEXEC_FORWARDING_RETRY parameter for the lsb.params configuration file for the submission cluster. If a job has been forwarded more than the maximum number of times, the submission cluster will take action on the job, based on LOCAL_MAX_PREEXEC_RETRY_ACTION setting. The default action is to suspend the job.
Here are the details of the MAX_PREEXEC_FORWARDING_RETRY parameter:
Syntax
MAX_PREEXEC_FORWARDING_RETRY=integer
Description
MultiCluster job forwarding mode only. At the submission cluster, the maximum number of times to attempt forwarding a job to a remote cluster if the job has been rejected due to the pre-execution command of the job. If the job fails all attempts, the job is suspended or terminated based on the LOCAL_MAX_PREEXEC_RETRY_ACTION setting. If the suspended job is resumed, the counter will be reset.
Valid values
1 < MAX_PREEXEC_FORWARDING_RETRY < 2147483647
Default
2147483647
Readme file for: IBM® Spectrum LSF
Product or component release: 10.1
Update name: Fix 601300
Fix ID: LSF-10.1-build601300
Publication date: 19 September 2022
Contents
1. List of fixes
2. Download location
3. Product or components affected
4. System requirements
5. Installation and configuration
6. List of files
7. Product notifications
8. Copyright and trademark information
1. List of fixes
P104005.
2. Download locations
Download Fix 601300 from the following location: https://www.ibm.com/support/fixcentral
3. Product or components affected
Affected product or components include:
mbatchd
mbschd
ebrokerd
bjobs
bparams
lsbatch.h
lsf.h
libbat.a
libbat.so
liblsf.a
liblsf.so
liblsbstream.a
liblsbstream.so
4. System requirements
linux2.6-glibc2.3-x86_64
linux3.10-glibc2.17-x86_64
5. Installation and configuration
Before you install
LSF_TOP is the full path to the top-level installation directory of LSF.
1. Before you apply this fix, ensure that you installed LSF 10.1 Fix Pack 12 or later. You can download LSF 10.1 Fix Pack 12 or 13 from https://www.ibm.com/support/fixcentral. Search for build600488 (Fix Pack 12) or build601088 (Fix Pack 13).
2. Log on to the LSF management host as the LSF primary administrator.
3.
Set your environment:
- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
Installation steps
1. In submission cluster, log on to the LSF management host as root and set the LSF cluster environment.
2. Go to the install directory: cd $LSF_ENVDIR/../10.1/install/
3. Copy the fix file to the install directory: $LSF_ENVDIR/../10.1/install/.
4. Run: patchinstall: ./patchinstall <fix>
After you install
1. In submission cluster, log on to the LSF management host as the LSF primary administrator and set the LSF cluster environment:
- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
2. Configure MAX_PREEXEC_FORWARDING_RETRY in the lsb.params configuration file.
3. Run: badmin mbdrestart
Uninstallation
1. In submission cluster, log on to the LSF management host as the LSF primary administrator and set the LSF cluster environment.
2. Go to the fix install directory: cd $LSF_ENVDIR/../10.1/install/
3. Run: ./patchinstall -r <patch>
4. Log on to the LSF management host as the LSF primary administrator and set the LSF cluster environment.
5. Remove the MAX_PREEXEC_FORWARDING_RETRY configuration from the lsb.params configuration file.
6. Run: badmin mbdrestart
6. List of files
The following components in all Linux packages:
mbatchd
mbschd
ebrokerd
bjobs
bparams
lsbatch.h
lsf.h
libbat.a
libbat.so
liblsf.a
liblsf.so
liblsbstream.a
liblsbstream.so
7. Product notifications
To receive information about product solution and fix updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.
8. Copyright and Trademark Information
©Copyright IBM Corporation 2022
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM®, the IBM logo, and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.