IBM Platform LSF 9.1.3 Fix Pack 10 (495433) Readme File

Abstract 

LSF Version 9.1.3 Fix Pack 10. This Fix Pack includes all fixed issues and solutions included in previous LSF Version 9.1.3 Fix Packs and addresses new issues fixed between 14 August 2017 and 22 June 2018. For detailed descriptions of the issues and solutions in this Fix Pack, refer to the LSF 9.1.3 Fix Pack 10 Fixed Bugs List (lsf9.1.3.10_fixed_bugs.pdf can be downloaded from Fix Central via fix ID lsf-9.1.3.10-spk-2018-Jun-build495433). 

Description

Readme documentation for IBM Platform LSF 9.1.3 Fix Pack 10 (495433) including installation-related instructions, prerequisites and co-requisites, and list of fixes.

The new issues addressed in LSF Version 9.1.3 Fix Pack 10:

ID

Fixed Date

Description

P102587

2018/05/23

The bhist and bacct commands show a CPU time that is longer than the actual amount for a job when its pre-execution command fails.

P102583

2018/05/28

When cgroup enforcement is enabled for GPUs, jobs that require more GPUs (2 out of 2 available GPUs or 3 - 4 out of 4 available GPUs) often get terminated. The percentage of failure is close to 100%.

P102447

2017/12/07

The "lim -t" command might show incorrect types and cores on some new processors. This fix resolves these issues by updating to a later version of the hardware locality (hwloc) library.

P102437

2017/12/06

Ensure that jobs can be cleaned even if the first execution host was in unreach status but later came back to ok status.

P102418

2017/11/06

When a job is running on an execution host that is in unreach status and the job RES process exits successfully, if a new program reuses the pid of the job RES process, LSF might set the wrong status for the job once the host comes back to ok status.

P102415

2017/11/07

Resolves an issue with the badmin reconfig command being slow because the mbatchd daemon is attempting to resolve compute unit names.

P102414

2017/11/07

When running the bkill command to kill a parallel job that has multiple running processes, there is a delay between the time stamp of the KILL signal and the time stamp of the EXIT message in job's bhist output.

P102338

2017/08/25

Allows the NIOS process to exit and prevents the CPU usage from being at 100% if a job that was submitted using the bsub -XF command is automatically requeued.

P102333

2017/08/25

For a parallel job with alternative resource requirements such as "rusage[resA || resB || resC...]" only some of the specified resources are sufficient. The job is pending because of insufficient slots, but LSF reports the wrong pending reason.
The new parameter LSB_ALT_RUSAGE_REASON is introduced in lsf.conf for fixing this issue.

Syntax
   LSB_ALT_RUSAGE_REASON=Y|N|y|n

Description
When this parameter is set to Y, jobs with alternative resource requirements (separated by the "||" operator, such as "rusage[resA || resB || resC || ....]"), report different pending reasons depending on which resource requirement cannot be satisfied. The pending reason LSF reports relates to the resource that cannot be satisfied.

Different pending reasons are reported depending on the kind of resource that is insufficient:

   - All resources specified in rusage are insufficient and all are shared resources. The pending reason is set to the last specified resource in rusage.
   - All resources are shared resources and part of the resources specified in rusage are enough for parallel jobs. The job is pending because of insufficient slots. The pending reason is set to slots.
   - All resources are insufficient and the rusage specifies built-in resources. The pending reason is set to the first insufficient built-in resource. After configuring this parameter, run badmin reconfig for the parameter settings to take effect.

Default
   N .

P102330

2017/08/14

The duration of memory resources is not ignored on the hosts in the guarantee resource pool that do not have guaranteed memory resources.

159805

2017/08/14

Correct the bhist message of MC job status.

 

Readme file for: IBM® Platform LSF

Product/Component Release: 9.1.3 

Update Name: Fix 495433 

Fix ID: lsf-9.1.3.10-spk-2018-Jun-build495433

Publication date: 06 July 2018

Last modified date: 06 July 2018 


Contents:


1.     List of fixes 

2.     Download location 

3.     Products or components affected

4.     System requirements 

5.     Installation and configuration

6.     List of files

7.     Product notifications

8.     Copyright and trademark information

 

1.   List of fixes

P102328, 159805,  P102587, P102583, P102447, P102437, P102418, P102415, P102414, P102338, P102333, P102330, P102278, 159805

2.   Download Location

Download Fix 495433 from the following location: http://www.ibm.com/eserver/support/fixes/

3.   Products or components affected

Components affected by the new issues addressed in LSF Version 9.1.3 Fix Pack 10 include:

LSF/lim
LSF/res
LSF/sbatchd
LSF/mbatchd
LSF/mbschd
LSF/nios
LSF/bhist
LSF/libbat.a
LSF/libbat.so
LSF/liblsf.a
LSF/liblsf.so
LSF/lsf.h
LSF/lsbatch.h
LSF/schmod_dist.so
LSF/schmod_maui.so
LSF/schmod_xl.so

 

4.   System requirements

Linux2.6-glibc2.3-x86_64
Linux3.10-glibc2.17-ppc64le

 

5.   Installation and configuration

 

5.1          Before installation

 

 LSF_TOP=Full path to the top-level installation directory of LSF.

1)    Log on to the LSF master host as root

2)    Set your environment:

-      For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf

-      For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf

 

5.2          Installation steps

 

1)    Go to the patch install directory: cd $LSF_ENVDIR/../9.1/install/

2)    Copy the patch file to the install directory $LSF_ENVDIR/../9.1/install/

3)    Run
badmin hclose all
badmin qinact all 

4)    Run patchinstall: ./patchinstall <patch>

 

5.3          After installation

 

1)    Run
badmin hshutdown all
lsadmin resshutdown all
lsadmin limshutdown all

2)    Run
lsadmin limstartup all
lsadmin resstartup all
badmin hstartup all

3)    Run
badmin hopen all
badmin qact all

 

5.4          Uninstallation

 

To roll back a patch:

1)    Log on to the LSF master host as root

2)    Set your environment:

-      For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf

-      For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf

3)    Run
badmin hclose all
badmin qinact all

4)    Run ./patchinstall -r <patch>

5)    Run
badmin hshutdown all
lsadmin resshutdown all
lsadmin limshutdown all

6)    Run
lsadmin limstartup all
lsadmin resstartup all
badmin hstartup all

7)    Run
badmin hopen all
badmin qact all

 

6.   List of files in package 

 

filelist.txt
fixlist.txt
include/
include/lsf/
include/lsf/lsbatch.h
include/lsf/lsf.h
include/lsf/lssched.h
linux2.6-glibc2.3-x86_64/
linux2.6-glibc2.3-x86_64/bin/
linux2.6-glibc2.3-x86_64/bin/bacct
linux2.6-glibc2.3-x86_64/bin/badmin
linux2.6-glibc2.3-x86_64/bin/bapp
linux2.6-glibc2.3-x86_64/bin/bgpinfo
linux2.6-glibc2.3-x86_64/bin/bhist
linux2.6-glibc2.3-x86_64/bin/bhosts
linux2.6-glibc2.3-x86_64/bin/bjobs
linux2.6-glibc2.3-x86_64/bin/bkill
linux2.6-glibc2.3-x86_64/bin/blaunch
linux2.6-glibc2.3-x86_64/bin/blimits
linux2.6-glibc2.3-x86_64/bin/bmgroup
linux2.6-glibc2.3-x86_64/bin/bmod
linux2.6-glibc2.3-x86_64/bin/bparams
linux2.6-glibc2.3-x86_64/bin/bpeek
linux2.6-glibc2.3-x86_64/bin/bqueues
linux2.6-glibc2.3-x86_64/bin/bresize
linux2.6-glibc2.3-x86_64/bin/bresources (Only on Linux2.6-glibc2.3-x86_64)
linux2.6-glibc2.3-x86_64/bin/brestart
linux2.6-glibc2.3-x86_64/bin/bsub
linux2.6-glibc2.3-x86_64/bin/bswitch
linux2.6-glibc2.3-x86_64/bin/lsadmin
linux2.6-glibc2.3-x86_64/bin/lsgrun
linux2.6-glibc2.3-x86_64/bin/lshosts
linux2.6-glibc2.3-x86_64/bin/lsmakerm
linux2.6-glibc2.3-x86_64/bin/lsreghost (Only on Linux2.6-glibc2.3-x86_64)
linux2.6-glibc2.3-x86_64/bin/lsrun
linux2.6-glibc2.3-x86_64/bin/pam
linux2.6-glibc2.3-x86_64/lib/
linux2.6-glibc2.3-x86_64/lib/cal_jobweight.so
linux2.6-glibc2.3-x86_64/lib/libbat.a
linux2.6-glibc2.3-x86_64/lib/libbat.so
linux2.6-glibc2.3-x86_64/lib/liblsbstream.so
linux2.6-glibc2.3-x86_64/lib/liblsf.a
linux2.6-glibc2.3-x86_64/lib/liblsf.so
linux2.6-glibc2.3-x86_64/lib/libptmalloc3.so (Only on Linux2.6-glibc2.3-x86_64)
linux2.6-glibc2.3-x86_64/lib/schmod_advrsv.so
linux2.6-glibc2.3-x86_64/lib/schmod_affinity.so
linux2.6-glibc2.3-x86_64/lib/schmod_aps.so
linux2.6-glibc2.3-x86_64/lib/schmod_bluegene.so
linux2.6-glibc2.3-x86_64/lib/schmod_cpuset.so
linux2.6-glibc2.3-x86_64/lib/schmod_craylinux.so
linux2.6-glibc2.3-x86_64/lib/schmod_crayx1.so
linux2.6-glibc2.3-x86_64/lib/schmod_dc.so
linux2.6-glibc2.3-x86_64/lib/schmod_default.so
linux2.6-glibc2.3-x86_64/lib/schmod_demand.so
linux2.6-glibc2.3-x86_64/lib/schmod_dist.so
linux2.6-glibc2.3-x86_64/lib/schmod_fairshare.so
linux2.6-glibc2.3-x86_64/lib/schmod_fcfs.so
linux2.6-glibc2.3-x86_64/lib/schmod_jobweight.so
linux2.6-glibc2.3-x86_64/lib/schmod_limit.so
linux2.6-glibc2.3-x86_64/lib/schmod_mc.so
linux2.6-glibc2.3-x86_64/lib/schmod_parallel.so
linux2.6-glibc2.3-x86_64/lib/schmod_preemption.so
linux2.6-glibc2.3-x86_64/lib/schmod_ps.so
linux2.6-glibc2.3-x86_64/lib/schmod_pset.so
linux2.6-glibc2.3-x86_64/lib/schmod_reserve.so
linux2.6-glibc2.3-x86_64/lib/schmod_rms.so
linux2.6-glibc2.3-x86_64/lib/schmod_xl.so
linux2.6-glibc2.3-x86_64/lib/schmod_maui.so
linux2.6-glibc2.3-x86_64/etc/
linux2.6-glibc2.3-x86_64/etc/daemons.wrap
linux2.6-glibc2.3-x86_64/etc/eauth
linux2.6-glibc2.3-x86_64/etc/ebrokerd
linux2.6-glibc2.3-x86_64/etc/egosc
linux2.6-glibc2.3-x86_64/etc/elim.gpu
linux2.6-glibc2.3-x86_64/etc/elim.hpc (Only on linux2.6-glibc2.3-x86_64)
linux2.6-glibc2.3-x86_64/etc/erestart
linux2.6-glibc2.3-x86_64/etc/gpolicyd
linux2.6-glibc2.3-x86_64/etc/krbrenewd

linux2.6-glibc2.3-x86_64/etc/eauth.krb5
linux2.6-glibc2.3-x86_64/etc/lim
linux2.6-glibc2.3-x86_64/etc/mbatchd
linux2.6-glibc2.3-x86_64/etc/mbschd
linux2.6-glibc2.3-x86_64/etc/melim
linux2.6-glibc2.3-x86_64/etc/mesub
linux2.6-glibc2.3-x86_64/etc/nios
linux2.6-glibc2.3-x86_64/etc/pim
linux2.6-glibc2.3-x86_64/etc/res
linux2.6-glibc2.3-x86_64/etc/rla (Only on linux2.6-glibc2.3-x86_64)
linux2.6-glibc2.3-x86_64/etc/sbatchd
linux2.6-glibc2.3-x86_64/etc/vemkd
misc/
misc/examples/
misc/examples/elim.gpu.ext/
misc/examples/elim.gpu.ext/Makefile
misc/examples/elim.gpu.ext/README
misc/examples/elim.gpu.ext/elim.gpu.ext
misc/examples/elim.gpu.ext/elim.gpu.ext.c
misc/examples/elim.gpu.ext/elim.gpu.topology
misc/examples/elim.gpu.ext/elim.gpu.topology.c
misc/examples/elim.gpu.ext/hwloc_nvml.so
misc/examples/external_plugin/
misc/examples/external_plugin/allocexample.c
misc/examples/external_plugin/matchexample.c
misc/examples/external_plugin/README
packagedef.txt 

 

7.   Product notifications

 

To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes.

 

8.   Copyright and trademark information

© Copyright IBM Corporation 2018 

U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.