IBM Spectrum LSF 10.1 Fix Pack 13 (601431) Readme

Abstract

LSF Version 10.1 Fix Pack 13. This Fix Pack includes new issues and solutions resolved between 11 June 2021 and 15 April 2022, and the following additional fixes.

Description

Readme documentation for IBM Spectrum LSF 10.1 Fix Pack 13 (601431) including installation-related instructions, prerequisites and co-requisites.

This fix addresses the following additional issues:

ID

Description

P104472

Fix to resolve the issue of the child mbschd daemon dying unexpectedly when submitting a job array with many individually-specified elements

601333

Fix to optimize the implementation of compound resource requirements to avoid heavy searching, improving performance of LSF planner scheduler

For further details on these solutions and other new features in LSF 10.1.0.13, refer to https://www.ibm.com/docs/en/spectrum-lsf/10.1.0?topic=wn-whats-new-in-lsf-101-fix-pack-13

The python package is tested against LSF 10.1.0.13 in this fix.

JSM is updated to 10.4.0.06rtm2 in this fix.

Readme file for: IBM® Spectrum LSF
Product/Component Release: 10.1.0.13
Update Name: Fix 601431
PMR/APAR: not available
Fix ID: lsf-10.1-build601431-csm
Publication date: 3 Mar 2023

Contents:

1.      List of fixes

2.      Download location

3.      Product notifications

4.      Products or components affected

5.      System requirements

6.      Installation and configuration

7.      List of files

8.      Copyright and trademark information

1.         List of fixes


P104472

601333

2.         Download location

Download Fix 601431 from the following location: http://www.ibm.com/eserver/support/fixes/

3.         Product notifications

To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page http://www.ibm.com/support/mynotifications/ on the IBM Support website (http://support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.

4.         Products or components affected

lsf-common-10.1.0.13-601431.ppc64le.rpm
lsf-master-10.1.0.13-601431.ppc64le.rpm
lsf-misc-10.1.0.13-601431.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.13.ppc64le.rpm
lsf-server-10.1.0.13-601431.ppc64le.rpm
ibm_jsm-10.4.0.06rtm2-rh8_20230210.ppc64le.rpm

lsf-conf-10.1.0.13-601431.noarch.rpm
lsf-man-pages-10.1.0.13-601431.noarch.rpm
lsf-client-10.1.0.13-601431.x86_64.rpm
lsf-python2-api-1.0.6-10.1.0.13.x86_64.rpm

5.         System requirements

linux3.10-glibc2.17-ppc64le

6.         Installation and configuration

6.1 Before installation

1. Shutdown LSF on all work load manager (WLM) and launch node (LN) hosts.

 

2. Back up the LSF configuration from the conf ($LSF_ENVDIR) directory and any scripts or binary files that you added to the LSF_SERVERDIR directory (for example, customized elims, esubs, stage in pre-scripts, stage out post scripts).

 

3. Run rpm -qa| grep lsf to list the currently installed rpm files.

 

4. Use yum erase or rpm -evh to uninstall the existing LSF packages.
  rpm -ev --allmatches -notriggers 'list of filenames as they appear in step 3'

6.2 Installation steps

6.2.1 Server Package Instructions

Download lsf-10.1.0.13-601431.ppc64le_csm.bin package

Run
lsf-10.1.0.13-601431.ppc64le_csm.bin to extract the RPM files. Accept the license agreement when prompted to continue with the file extraction.
lsf-common-10.1.0.13-601431.ppc64le.rpm
lsf-master-10.1.0.13-601431.ppc64le.rpm
lsf-misc-10.1.0.13-601431.ppc64le.rpm
lsf-server-10.1.0.13-601431.ppc64le.rpm
lsf-python2-api-1.0.6-10.1.0.13.ppc64le.rpm
ibm_jsm-10.4.0.06rtm2-rh8_20230210.ppc64le.rpm


Use
rpm -ivh or yum install commands to deploy the common, server, and master RPM packages. The installation is relocatable (--prefix options supported)

On the work load manager
rpm -ivh lsf-common-10.1.0.13-601431.ppc64le.rpm lsf-server-10.1.0.13-601431.ppc64le.rpm lsf-master-10.1.0.13-601431.ppc64le.rpm

On the launch node
rpm -ivh lsf-common-10.1.0.13-601431.ppc64le.rpm lsf-server-10.1.0.13-601431.ppc64le.rpm

Verify that the installation is successful:
rpm -qa | grep lsf
lsf-server-10.1.0.13-601431.ppc64le
lsf-common-10.1.0.13-601431.ppc64le
lsf-master-10.1.0.13-601431.ppc64le

6.2.2 X86_64 Client Package Instructions

Obtain and run lsf-client-10.1.0.13-601431.x86_64.bin package to extract the rpm packages. This should be done on x86_64 client host.
lsf-conf-10.1.0.13-601431.noarch.rpm
lsf-man-pages-10.1.0.13-601431.noarch.rpm
lsf-client-10.1.0.13-601431.x86_64.rpm
lsf-python2-api-1.0.6-10.1.0.13.x86_64.rpm


The installation directory is
/opt/ibm/spectrumcomputing/lsf.

The user
lsfadmin must be added if it does not exist on the host before installing the lsf-conf rpm package.
rpm -ivh ./lsf-conf-10.1.0.13-601431.noarch.rpm ./lsf-man-pages-10.1.0.13-601431.noarch.rpm ./lsf-client-10.1.0.13-601431.x86_64.rpm ./lsf-python2-api-1.0.6-10.1.0.13.x86_64.rpm

Verify that the installation is successful:
rpm -qa | grep lsf
lsf-conf-10.1.0.13-601431.noarch
lsf-man-pages-10.1.0.13-601431.noarch
lsf-client-10.1.0.13-601431.x86_64
lsf-python2-api-1.0.6-10.1.0.13.x86_64


6.3 After installation

6.3.1 Server Package

Restore previously backed up LSF conf directory and any customized scripts or binaries in LSF_SERVERDIR

 

(Optional) Under LSF_TOP:
Rename
work.rpmsave to work

 

Using bash: On each WLM and LN, run the following commands as root:

. /opt/ibm/spectrumcomputing/lsf/conf/profile.lsf
or if you have used the --prefix to define your own LSF_TOP
. LSF_TOP/conf/profile.lsf

Start up LSF

lsf_daemons start

6.3.2 X86_64 Client Package

Configuring client hosts within your cluster.

Log on to the LSF master host as the user
lsfadmin.

Static Client

It is recommended that the x86_64 hosts be configured as static clients.

Edit the
LSF_CONFDIR/lsf.cluster.cluster_name file and add host information for the new host to the Host section.

-    Add the name of the host.

-    Add model or type.

o    If you enter the ! keyword in the model and type columns, the host model and type reported by LSF will be UNKNOWN.

o    Alternatively, X86_64 can be specified for type.

-    Specify LSF server or client in the server column:

o    You must specify 0 (zero) to indicate an LSF client-only host.



Begin Host

HOSTNAME       model  type      server  RESOURCES

<< snip >>

clienthost     !      X86_64    0       ()

End Host


It is important that all host names of client hosts added to the cluster are resolvable by the management nodes. Depending on your site's DNS, you can configure a $LSF_ENVDIR/hosts file that includes all official names and IP addresses of the client hosts. If LSF cannot resolve the host name of a client host it will reject requests originating from that host.

Float Client

Alternatively, you could configure the x86_64 hosts as floating clients. You do not add the host information for the new host to the Host section of the
LSF_CONFDIR/lsf.cluster.cluster_name file. You must add the parameters FLOAT_CLIENTS_ADDR_RANGE and FLOAT_CLIENTS to the Parameters section of the LSF_CONFDIR/lsf.cluster.cluster_name file.

The FLOAT_CLIENT parameter sets the maximum allowable size for floating clients in a cluster. If FLOAT_CLIENTS is not specified in
lsf.cluster.cluster_name, the floating LSF client feature is disabled. When the LSF floating client feature is enabled, any host can submit jobs to the cluster. You can limit which hosts can be LSF floating clients with the parameter FLOAT_CLIENTS_ADDR_RANGE.

The FLOAT_CLIENTS_ADDR_RANGE specifies IP address or range of addresses of domains from which floating client hosts can submit requests.

A floating client can join the cluster only when an LSF command is run from this host and LSF has verified it against FLOAT_CLIENTS (total number of floating clients) and FLOAT_CLIENTS_ADDR_RANGE (IP range). After verification, it is listed by the
lshosts command. A floating client expires at 12:00 midnight or when the master LIM is reconfigured or restarted.

Begin Parameters

FLOAT_CLIENTS_ADDR_RANGE= IP_address ...

FLOAT_CLIENTS= number_of_floating_clients

End Parameters


Now reconfigure the cluster.

-    Reconfigure
lim to enable the new host in the cluster.

lsadmin reconfig


-    Restart
mbatchd.

badmin mbdrestart


Check your connectivity

Run the
lsid and bhosts command from a client host. These commands should return the same information as if they were run from a server machine.

Submitting Jobs

When submitting jobs from an x86_64 client host and your cluster's execution hosts are not x86_64 you should specify the resource requirement
-R 'type==any'. This tells LSF to ignore host type when selecting hosts for the job. Note that when submitting CSM job in easy mode the resource requirement is automatically set.

6.4 Uninstallation

6.4.1 Server Package

Shut down LSF on all WLM and LN hosts.
lsf_daemons stop

Back up the LSF conf directory and any customized scripts or binaries as stated in previous steps.

Use
yum erase or rpm -evh commands to uninstall LSF following the same previous steps.

6.4.2 X86_64 Client Package

Log on to the x86_64 client host. Use the rpm uninstall command to uninstall the rpm packages.

# rpm -ev --allmatches --notriggers lsf-client-10.1.0.13-601431.x86_64 lsf-man-pages-10.1.0.13-601431.noarch lsf-conf-10.1.0.13-601431.noarch lsf-python2-api-1.0.6-10.1.0.13.x86_64

7.         List of files

           lsf-10.1.0.13-601431.ppc64le_csm.bin
     lsf-client-10.1.0.13-601431.x86_64.bin


          The contents of
lsf-10.1.0.13-601431.ppc64le_csm.bin:

     lsf-common-10.1.0.13-601431.ppc64le.rpm
     lsf-master-10.1.0.13-601431.ppc64le.rpm
     lsf-misc-10.1.0.13-601431.ppc64le.rpm
     lsf-python2-api-1.0.6-10.1.0.13.ppc64le.rpm
     lsf-server-10.1.0.13-601431.ppc64le.rpm
     ibm_jsm-10.4.0.06rtm2-rh8_20230210.ppc64le.rpm


          The contents of
lsf-client-10.1.0.13-601431.x86_64.bin:

     lsf-conf-10.1.0.13-601431.noarch.rpm
     lsf-man-pages-10.1.0.13-601431.noarch.rpm
     lsf-client-10.1.0.13-601431.x86_64.rpm
     lsf-python2-api-1.0.6-10.1.0.13.x86_64.rpm

8.         Copyright and trademark information

© Copyright IBM Corporation 2023

U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml