IBM Spectrum LSF 10.1 Fix 512468 Readme File
Abstract
P102860. This fix enables lim and elim.gpu.topology to detect the GPU properly on hosts that have two sibling GPUs under one PCI, where the first GPU is not an Nvidia GPU, but the second GPU is an Nvidia GPU.
Description
Readme documentation for IBM Spectrum LSF 10.1 Fix 512468 including installation-related instructions, prerequisites and co-requisites, and list of fixes.
This fix addresses the following issue:
The lim and elim.gpu.topology processes cannot detect the Nvidia GPU properly on hosts that have two sibling GPUs under one PCI (especially on AWS GPU instances), where the first GPU is not an Nvidia GPU, but the second GPU is an Nvidia GPU.
Note: For LSF Resource Connector on AWS, you must apply this patch need to both the LSF master and LSF slave GPU instance, create an updated image, then use the new AMI ID to borrow the GPU instance from AWS.
Prerequisites: N/A
Readme file for: IBM® Spectrum LSF
Product/Component Release: 10.1
Update Name: Fix 512468
Fix ID: LSF-10.1-build512468
Publication date: 25 February 2019
Last modified date: 25 February 2019
Contents:
1. List of fixes
2. Download location
3. Products or components affected
4. System requirements
5. Installation and configuration
6. List of files
7. Product notifications
8. Copyright and trademark information
1. List of fixes
P102860
2. Download Location
Download Fix 512468 from the following location: http://www.ibm.com/eserver/support/fixes/
3. Products or components affected
Affected components include: LSF/lim, LSF/elim.gpu.topology, LSF/elim.gpu.topology.c
4. System requirements
Linux2.6-glibc2.3-x86_64
Linux3.10-glibc2.17-x86_64
5. Installation and configuration
5.1 Before installation
(LSF_TOP=Full path to the top-level installation directory of LSF.)
1) Log on to the LSF master host as root. For LSF Resource Connector on AWS, also log on the LSF GPU instance which is used to create LSF GPU image.
2) Set your environment:
- For csh or tcsh: % source LSF_TOP/conf/cshrc.lsf
- For sh, ksh, or bash: $ . LSF_TOP/conf/profile.lsf
5.2 Installation steps
1) Go to the patch install directory: cd $LSF_ENVDIR/../10.1/install/
2) Copy the patch file to the install directory $LSF_ENVDIR/../10.1/install/
3) Run patchinstall: ./patchinstall <patch>
4) For LSF Resource Connector on AWS, after having applied the patch on the GPU instance which is used to create GPU image, create an updated GPU AMI.
5) For LSF Resource Connector on AWS, on LSF master host, update the "imageId" parameter value as the new AMI ID in awsprov_templates.json.
6) Run "lsadmin limrestart all"
5.3 After installation
N/A
5.4 Uninstallation
To roll back a patch:
1) Log on to the LSF master host as root
2) Run ./patchinstall -r <patch>
3) For LSF Resource Connector on AWS, rollback the "imageId" parameter value in awsprov_templates.json.
4) Run "lsadmin lim all"
6. List of files
lim elim.gpu.topology elim.gpu.topology.c 7. Product notifications To receive information about product solution and patch updates automatically, subscribe to product notifications on the My notifications page (www.ibm.com/support/mynotifications) on the IBM Support website (support.ibm.com). You can edit your subscription settings to choose the types of information you want to get notification about, for example, security bulletins, fixes, troubleshooting, and product enhancements or documentation changes. 8. Copyright and trademark information © Copyright IBM Corporation 2019 U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.