IBM Platform LSF 9.1.3 Fix 394627 Readme File

Abstract

P101605. Fix to resolve jobs that have a long pending time with the following pending reason: "Recently released Cray compute node cannot be re-used at this moment".

Description

Readme documentation for IBM Platform LSF 9.1.3 Fix 394627 including installation-related instructions, prerequisites and co-requisites, and list of fixes.

This fix addresses the following issue:
Cray compute nodes may be unusable for a long period of time with the following pending reason: "Recently released Cray compute node cannot be re-used at this moment". LSF ensures that nodes are no longer reserved by ALPS before including them in another reservation request by keeping the released reservation until a failure is returned or a timeout takes effect. The timeout can be configured with a new parameter LSF_CRAY_RELEASE_TIMEOUT=time_seconds in lsf.conf (600 by default).

Readme file for: IBM® Platform LSF

Product/Component Release: 9.1.3

Update Name: Fix 394627

Fix ID: LSF-9.1.3-build394627

Publication date: 24 February 2016

Last modified date: 24 February 2016

Contents:

1.     List of fixes

2.     Download location

3.     Products or components affected

4.     System requirements

5.     Installation and configuration

6.     List of files

7.     Product notifications

8.     Copyright and trademark information

 

1.   List of fixes

P101605

2.   Download Location

Download Fix 394627 from the following location: http://www.ibm.com/eserver/support/fixes/

3.   Products or components affected

Affected components include: LSF/sbatchd

 

4.   System requirements

Linux2.6-glibc2.3-x86_64-cray

 

5.   Installation and configuration

 

5.1          Before installation

 

 (LSF_TOP=Full path to the top-level installation directory of LSF.)

1)    Log on to the boot node as root

2)    Run the xtopview command to switch to a shared root file system

3)    Set your environment:

-      For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf

-      For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf

 

5.2          Installation steps

 

1)    Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/

2)    Copy the patch file to the install directory $LSF_ENVDIR/../9.1/install/

3)    Run patchinstall: ./patchinstall <patch>

4)    Remove or comment out the LSB_CRAYLINUX_NODE_REUSE_INTERVAL parameter in $LSF_ENVDIR/lsf.conf if it is defined.

5)    Optional: Define LSF_CRAY_RELEASE_TIMEOUT=<time_seconds> in $LSF_ENVDIR/lsf.conf.

6)    Exit xtopview

 

5.3          After installation

 

1)    Log on to the LSF master host as root

2)    Run badmin mbdrestart

3)    Run badmin hrestart service nodes


 

5.4          Uninstallation

 

To roll back a patch:

1)    Perform the steps in section 5.1

2)    Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/

3)    Run ./patchinstall -r <patch>

4)    Exit xtopview

5)    Log on to the LSF master host as root

6)    Run badmin hrestart service nodes


6.   List of files

 

sbatchd

 

7.   Product notifications

To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.



8.   Copyright and trademark information

© Copyright IBM Corporation 2016

U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.