IBM Spectrum
LSF 10.1 Fix 600874 Readme
Abstract
P104472: Fix to resolve the issue of the child mbschd daemon dying unexpectedly when submitting a job
array with many individually-specified elements.
Description
Readme
documentation for IBM Spectrum LSF 10.1 Fix 600874 including installation-related
instructions, prerequisites, and co-requisites.
This
fix addresses the following issue:
LSF
might stop scheduling jobs when submitting large job arrays with several individually-specified elements. These jobs would cause the
child mbschd daemon to die unexpectedly. This might
occur when an eligible pending jobs limit (ELIGIBLE_PEND_JOBS), and the
allocation planner are enabled.
Readme file for:
IBM® Spectrum LSF
Product/Component Release: 10.1
Update Name: Fix 600874
PMR/APAR: P104472
Fix ID: lsf-10.1-build600874
Publication date:9 December 2021
Last modified date: 9 December 2021
Contents:
1.
List of fixes
2.
Download location
3.
Product notifications
4.
Products or components affected
5.
System requirements
6.
Installation and configuration
7.
List of files
8.
Copyright and trademark information
1. List
of fixes
P104472
2. Download
Location
Download Fix 600874 from the following location: http://www.ibm.com/eserver/support/fixes
3. Product
Notifications
To receive information about product solution and patch updates automatically,
subscribe to product notifications on the My notifications page http://www.ibm.com/support/mynotifications/
on the IBM Support website (http://support.ibm.com). You can edit your subscription
settings to choose the types of information you want to get notification about,
for example, security bulletins, fixes, troubleshooting, and product
enhancements or documentation changes.
4. Products
or components affected
LSF/mbschd
5. System
requirements
linux3.10-glibc2.17-ppc64le
6. Installation
and configuration
6.1 Before
installation
Shutdown LSF on all work load manager (WLM) and launch node (LN) hosts.
Back up the LSF configuration from
the conf ($LSF_ENVDIR) directory and any scripts or binary files that you added
to the LSF_SERVERDIR directory (for example, customized elims,
esubs, stage in pre-scripts, stage in post scripts).
Run rpm -qa|
grep lsf to list the currently installed rpm files
Use yum erase or rpm -evh to unistall the existing LSF
packages.
rpm -ev --allmatches -notriggers 'list of
filenames as they appear in step 3'
6.2 Installation steps
Download lsf-10.1.0.11-600874.ppc64le_csm.bin package
Run lsf-10.1.0.11-600874.ppc64le_csm.bin
to extract the RPM files. Accept the license agreement when prompted to
continue with the file extraction.
lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-master-10.1.0.11-600874.ppc64le.rpm
lsf-misc-10.1.0.11-600874.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.11.ppc64le.rpm
ibm_smpi-jsm-10.03.01.00rtm5-rh7_20191114.ppc64le.rpm
Use rpm -ivh
or yum install commands to deploy the common, server, and master RPM packages.
The installation is relocatable (--prefix options supported)
On the work load
manager
rpm -ivh lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm lsf-master-10.1.0.11-600874.ppc64le.rpm
On the launch node
rpm -ivh lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm
Verify that the installation is
successful:
rpm -qa | grep lsf
lsf-server-10.1.0.11-600874.ppc64le
lsf-common-10.1.0.11-600874.ppc64le
lsf-master-10.1.0.11-600874.ppc64le
6.3 After installation
Restore previously backed up LSF conf directory and any customized scripts or
binaries in LSF_SERVERDIR
(Optional) Under LSF_TOP:
Rename work.rpmsave to work
Using bash: On each WLM and LN, run
the following commands as root:
. /opt/ibm/spectrumcomputing/lsf/conf/profile.lsf
or if you have used the --prefix to define your own LSF_TOP
. /conf/profile.lsf
Start up
LSF
lsf_daemons start
6.4 Uninstallation
Shut down LSF on all WLM and LN hosts.
lsf_daemons stop
Back up the LSF conf directory and
any customized scripts or binaries as stated in previous steps.
Use yum erase or rpm -evh commands to unistall LSF
following the same previous steps.
7. List
of files
lsf-10.1.0.9-600874.ppc64le_csm.bin
The
contents of lsf-10.1.0.11-600874.ppc64le_csm.bin:
lsf-common-10.1.0.11-600874.ppc64le.rpm
lsf-master-10.1.0.11-600874.ppc64le.rpm
lsf-misc-10.1.0.11-600874.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.11.ppc64le.rpm
lsf-server-10.1.0.11-600874.ppc64le.rpm
ibm_smpi-jsm-10.03.01.00rtm5-rh7_20191114.ppc64le.rpm
8. Copyright
and trademark information
© Copyright IBM Corporation 2021
U.S. Government Users Restricted
Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp
IBM®, the IBM logo and ibm.com® are
trademarks of International Business Machines Corp., registered in many
jurisdictions worldwide. Other product and service names might be trademarks of
IBM or other companies. A current list of IBM trademarks is available on the
Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml