IBM Platform LSF 9.1.3 Fix 394627 Readme File
Abstract
P101605. Fix to resolve jobs that have a long pending time with the following pending reason: "Recently released Cray compute node cannot be re-used at this moment".
Description
Readme documentation for IBM Platform LSF 9.1.3 Fix 394627 including installation-related instructions, prerequisites and co-requisites, and list of fixes.
This fix addresses the
following issue:
Cray compute nodes may be unusable for a long
period of time with the following pending reason: "Recently
released Cray compute node cannot be re-used at this moment".
LSF ensures that nodes are no longer reserved by ALPS before
including them in another reservation request by keeping the released
reservation until a failure is returned or a timeout takes effect.
The timeout can be configured with a new parameter
LSF_CRAY_RELEASE_TIMEOUT=time_seconds in lsf.conf (600 by default).
Readme file for: IBM® Platform LSF
Product/Component Release: 9.1.3
Update Name: Fix 394627
Fix ID: LSF-9.1.3-build394627
Publication date: 24 February 2016
Last modified date: 24 February 2016
Contents:
1. List of fixes
2. Download location
3. Products or components affected
4. System requirements
5. Installation and configuration
6. List of files
7. Product notifications
8. Copyright and trademark information
1. List of fixes
P101605
2. Download Location
Download Fix 394627 from the following location: http://www.ibm.com/eserver/support/fixes/
3. Products or components affected
Affected components include: LSF/sbatchd
4. System requirements
Linux2.6-glibc2.3-x86_64-cray
5. Installation and configuration
5.1 Before installation
(LSF_TOP=Full path to the top-level installation directory of LSF.)
1) Log on to the boot node as root
2) Run the xtopview command to switch to a shared root file system
3) Set your environment:
- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
5.2 Installation steps
1) Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/
2) Copy the patch file to the install directory $LSF_ENVDIR/../9.1/install/
3) Run patchinstall: ./patchinstall <patch>
4) Remove or comment out the LSB_CRAYLINUX_NODE_REUSE_INTERVAL parameter in $LSF_ENVDIR/lsf.conf if it is defined.
5) Optional: Define LSF_CRAY_RELEASE_TIMEOUT=<time_seconds> in $LSF_ENVDIR/lsf.conf.
6) Exit xtopview
5.3 After installation
1) Log on to the LSF master host as root
2) Run badmin mbdrestart
3) Run badmin hrestart service nodes
5.4 Uninstallation
To roll back a patch:
1) Perform the steps in section 5.1
2) Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/
3) Run ./patchinstall -r <patch>
4) Exit xtopview
5) Log on to the LSF master host as root
6) Run badmin hrestart service nodes
6. List of files
sbatchd
7. Product notifications
To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.
8. Copyright and trademark information
© Copyright IBM Corporation 2016
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.