IBM Platform LSF 9.1.3 Fix 491808 Readme File

Abstract

P102583. Fix to ensure that GPU jobs can run successfully on Linux2.6-glibc2.3-x86_64.

Description

Readme documentation for IBM Platform LSF 9.1.3 Fix 491808 including installation-related instructions, prerequisites and co-requisites, and list of fixes.

This fix addresses the following issue:

When cgroup enforcement is enabled for GPUs, jobs that require more GPUs (2 out of 2 available GPUs or 3 - 4 out of 4 available GPUs) often get terminated. The percentage of failure is close to 100%.

Readme File for: IBM® Platform LSF

Product/Component Release: 9.1.3

Update Name: Fix 491808

Fix ID: LSF-9.1.3-build491808

Publication Date: 28 May 2018

Last Modified Date: 28 May 2018


Contents

1. List of Fixes

2. Download Location

3. Product or Components Affected

4. System Requirements

5. Installation and Configuration

6. List of Files

7. Product Notifications

8. Copyright and Trademark Information


1. List of Fixes

P102583


2. Download Locations

Download Fix 491808 from the following location: http://www.ibm.com/eserver/support/fixes/


3. Product or Components Affected

Affected product or components include:

LSF/sbatchd


4. System Requirements

Linux2.6-glibc2.3-x86_64


5. Installation and Configuration

5.1 Before installation

(LSF_TOP=Full path to the top-level installation directory of LSF.)

1) Log on to the LSF master host as root

2) Set your environment:

- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf

- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf

5.2 Installation steps

1) Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/

2) Copy the patch file to the install directory $LSF_ENVDIR/../9.1/install/

3) Run badmin hclose all

4) Run badmin qinact all

5) Run patchinstall: ./patchinstall <patch>

5.3 After installation

1) Log on to the LSF master host as root

2) Run badmin hrestart all

3) Run badmin hopen all

4) Run badmin qact all

5.4 Uninstallation

1) Log on to the LSF master host as root

2) Run badmin hclose all

3) Run badmin qinact all

4) Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/

5) Run ./patchinstall -r <patch>

6) Run badmin hrestart all

7) Run badmin hopen all

8) Run badmin qact all


6. List of Files

sbatchd


7. Product Notifications

To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page ( www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.



8. Copyright and Trademark Information

©Copyright IBM Corporation 2018


U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

IBM®, the IBM logo, and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.