IBM Spectrum LSF 10.1 Fix 600874 Readme

 

Abstract

P104472: Fix to resolve the issue of the child mbschd daemon dying unexpectedly when submitting a job array with many individually-specified elements.

 

Description

Readme documentation for IBM Spectrum LSF 10.1 Fix 600874 including installation-related instructions, prerequisites, and co-requisites.

This fix addresses the following issue:

LSF might stop scheduling jobs when submitting large job arrays with several individually-specified elements. These jobs would cause the child mbschd daemon to die unexpectedly. This might occur when an eligible pending jobs limit (ELIGIBLE_PEND_JOBS), and the allocation planner are enabled.

Readme file for: IBM® Spectrum LSF
Product/Component Release: 10.1
Update Name: Fix 600874
PMR/APAR: P104472
Fix ID: lsf-10.1-build600874
Publication date:9 December 2021
Last modified date: 9 December 2021 

Contents:

1.     List of fixes

2.     Download location

3.     Product notifications

4.     Products or components affected

5.     System requirements

6.     Installation and configuration

7.     List of files

8.     Copyright and trademark information

 

1.   List of fixes

P104472

2.   Download Location

Download Fix 600874 from the following location: http://www.ibm.com/eserver/support/fixes

3.    Product Notifications

To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page http://www.ibm.com/support/mynotifications/ on the IBM Support website (http://support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.

4.    Products or components affected

LSF/mbschd

5.   System requirements

linux3.10-glibc2.17-ppc64le

6.   Installation and configuration

6.1 Before installation

Shutdown LSF on all work load manager (WLM) and launch node (LN) hosts.

 

Back up the LSF configuration from the conf ($LSF_ENVDIR) directory and any scripts or binary files that you added to the LSF_SERVERDIR directory (for example, customized elims, esubs, stage in pre-scripts, stage in post scripts).

 

Run rpm -qa| grep lsf to list the currently installed rpm files

 

Use yum erase or rpm -evh to unistall the existing LSF packages.

 

rpm -ev --allmatches -notriggers 'list of filenames as they appear in step 3'


6.2 Installation steps

Download lsf-10.1.0.11-600874.ppc64le_csm.bin package

 

Run lsf-10.1.0.11-600874.ppc64le_csm.bin to extract the RPM files. Accept the license agreement when prompted to continue with the file extraction.
lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-master-10.1.0.11-600874.ppc64le.rpm
lsf-misc-10.1.0.11-600874.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.11.ppc64le.rpm
ibm_smpi-jsm-10.03.01.00rtm5-rh7_20191114.ppc64le.rpm

 

Use rpm -ivh or yum install commands to deploy the common, server, and master RPM packages. The installation is relocatable (--prefix options supported)

 

On the work load manager
rpm -ivh lsf-common-10.1.0.11-600874.ppc64le.rpm lsf-server-10.1.0.11-600874.ppc64le.rpm lsf-master-10.1.0.11-600874.ppc64le.rpm

 

On the launch node
rpm -ivh lsf-common-10.1.0.11-600874.ppc64le.rpm lsf-server-10.1.0.11-600874.ppc64le.rpm

 

Verify that the installation is successful:
rpm -qa | grep lsf
lsf-server-10.1.0.11-600874.ppc64le
lsf-common-10.1.0.11-600874.ppc64le
lsf-master-10.1.0.11-600874.ppc64le


6.3 After installation


Restore previously backed up LSF conf directory and any customized scripts or binaries in LSF_SERVERDIR

 

(Optional) Under LSF_TOP:
Rename work.rpmsave to work

 

Using bash: On each WLM and LN, run the following commands as root:
. /opt/ibm/spectrumcomputing/lsf/conf/profile.lsf
or if you have used the --prefix to define your own LSF_TOP
. /conf/profile.lsf

 

Start up LSF
lsf_daemons start


6.4 Uninstallation


Shut down LSF on all WLM and LN hosts.
lsf_daemons stop

 

Back up the LSF conf directory and any customized scripts or binaries as stated in previous steps.

 

Use yum erase or rpm -evh commands to unistall LSF following the same previous steps.

7.     List of files

lsf-10.1.0.9-600874.ppc64le_csm.bin

 

      The contents of lsf-10.1.0.11-600874.ppc64le_csm.bin:

 

      lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-master-10.1.0.11-600874.ppc64le.rpm
lsf-misc-10.1.0.11-600874.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.11.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm
ibm_smpi-jsm-10.03.01.00rtm5-rh7_20191114.ppc64le.rpm

8.     Copyright and trademark information

© Copyright IBM Corporation 2021

U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp

IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml