Readme file for IBM® Spectrum LSF 10.1 Fix 601649
Abstract
P104921. This fix adds a new parameter LSB_RC_EXTERNAL_HOST_ABNORMAL_TIME to lsf.conf that controls the number of minutes that LSF waits before timing out LSF resource connector hosts that are in an abnormal status (that is, closed_LIM, unavail, or unreach status).
Description
Readme documentation for IBM Spectrum LSF 10.1 Fix 601649, including installation-related instructions, prerequisites and co-requisites, and list of fixes.
This fix addresses the following issues:
This fix adds a new parameter LSB_RC_EXTERNAL_HOST_ABNORMAL_TIME to lsf.conf that controls the number of minutes that LSF waits before timing out LSF resource connector hosts that are in an abnormal status (that is, closed_LIM, unavail, or unreach status).
Syntax
LSB_RC_EXTERNAL_HOST_ABNORMAL_TIME=soft_timeout_integer:hard_timeout_integer
Description
The timeout values (in minutes) and LSF behavior when a LSF resource connector host
reaches its soft and hard timeout limits. Specify integers for the values and separate
the values with a colon (:).
The value before the colon means a soft timeout. When a host reaches its soft timeout, and there are still jobs on it, the mbatchd daemon will log a warning message. Cluster administrators can monitor this log message, and take immediate action, such as check and restart the LSF daemons on the host. If there are no jobs on the host when the host reaches the timeout, the host will be relinquished. The default soft timeout is 10 minutes.
The value after the colon means a hard timeout. When a host reaches its hard timeout, LSF will relinquish the host, and jobs on the host will be re-queued or exited. The default hard timeout is 60 minutes.
Default:
10:60
Readme file for: IBM® Spectrum LSF
Product or component release: 10.1
Update name: Fix 601649
Fix ID: LSF-10.1-build601649
Publication date: 31 Auguest 2023
Contents
1. List of fixes
2. Download location
3. Product or components affected
4. System requirements
5. Installation and configuration
6. List of files
7. Product notifications
8. Copyright and trademark information
1. List of fixes
P104921
2. Download locations
Download Fix 601649 from the following location: https://www.ibm.com/support/fixcentral
3. Product or components affected
Affected product or components include:
LSF/mbatchd
LSF/mbschd
LSF/ebrokerd
LSF/bjobs
LSF/bhosts
LSF/bparams
4. System requirements
linux2.6-glibc2.3-x86_64
linux3.10-glibc2.17-x86_64
linux3.12-glibc2.17-armv8
5. Installation and configuration
Before you install
LSF_TOP is the full path to the top-level installation directory of LSF.
1. Before you apply this fix, ensure that you installed LSF 10.1 Fix Pack 12 or later. You can download LSF 10.1 Fix Pack 12 from https://www.ibm.com/support/fixcentral. Search for build600488. Contact IBM LSF Support if you have any questions or problems with installing Fix Pack 12.
2.
Starting in LSF 10.1 Fix Pack 13, the default values of
the following three GPU parameters are changed to:
LSF_GPU_AUTOCONFIG=Y
LSB_GPU_NEW_SYNTAX=extend
LSF_GPU_RESOURCE_IGNORE=Y
If you have Fix Pack 13 installed, these GPU parameters not
configured in the lsf.conf configuration file will take the default values, and the parameters
already configured in the lsf.conf file will not be affected.
If you want to keep the former GPU behavior, and if any of the
three parameters are missing in the lsf.conf configuration file, you must
explicitly configure the following default settings that are defined in Fix
Pack 12 or earlier:
LSF_GPU_AUTOCONFIG=N
LSB_GPU_NEW_SYNTAX=N
LSF_GPU_RESOURCE_IGNORE=N
3. Log on to the LSF management host as the LSF primary administrator.
4.
Set your environment:
-For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
-For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
Installation steps
1.
Log on to the LSF management host as the root user
and set your environment:
-For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
-For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
2. Go to the install directory: cd $LSF_ENVDIR/../10.1/install/
3. Copy the fix file to the install directory: $LSF_ENVDIR/../10.1/install/
4. Run patchinstall: ./patchinstall <fix>
After you install
1. Log on to the LSF management host as the LSF primary administrator and set your environment:
-For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
-For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
2. Run badmin mbdrestart -s
Uninstallation
1. Log on to the LSF management host as the root user and set your environment:
-For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
-For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
2. Go to the install directory: cd $LSF_ENVDIR/../10.1/install/
3. Run ./patchinstall -r <fix>
4. Log on to the LSF management host as the LSF primary administrator and set your environment:
-For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
-For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
5. Run badmin mbdrestart -s
6. List of files
The following components in all Linux and UNIX packages:
LSF/mbatchd
LSF/mbschd
LSF/ebrokerd
LSF/bjobs
LSF/bhosts
LSF/bparams
7. Product Notifications
To receive information about product solution and fix updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.
8. Copyright and Trademark Information
©Copyright IBM Corporation 2023
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM®, the IBM logo, and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.