IBM Platform Symphony 5.0 Fix 231953 Readme File
Abstract
A task running during a reclaim grace period that is interrupted by a host restart or reboot is not immediately allowed to restart on another host. Instead, Symphony waits for the full reclaim grace period to elapse before allowing the task to be restarted somewhere else.
Description
The issue has a severe impact on cluster productivity if the reclaim grace period is very long. The task that should be quickly dispatched to a new working host is stuck for the full grace period. This situation should follow Symphony's automatic failure recovery for when a compute host goes down. When SSM detects the broken connection with SIM, or EGO notifies SSM within three minutes of the compute host going down, the task should immediately be allowed to run elsewhere.
This fix addresses the following issue:
When the host has tasks running on restart or reboot, SSM does not requeue the tasks to run on other hosts during the process of reclaiming. The task should immediately be allowed to run elsewhere.
Readme file for: IBM® Platform Symphony
Product/Component Release: 5.0
Update Name: Fix 231953
Fix ID: sym-5.0-build231953-ms
Publication date: 2 April 2014
Last modified date: 2 April 2014
Contents:
1. List of fixes
2. Download location
3. Products or components affected
4. Installation and configuration
5. List of files
6. Copyright and trademark information
1. List of fixes
APAR#P100371: A task being reclaimed that is interrupted due to a compute host reboot is not restarted until the grace period has elapsed.
2. Download Location
Download Fix 231953 from the following location: http://www.ibm.com/eserver/support/fixes/
3. Products or components affected
Product/Component Name, Platform, Fix ID:
Platform Symphony/SSM, Linux 64-bit, sym-5.0-build231953-ms
4. Installation and configuration
4.1 Before installation
1. Disable all applications.
Log on to the master host as the cluster administrator and run:
> source cshrc.platform
> egosh user logon -u Admin -x Admin
> soamcontrol app disable all
4.2 Installation steps
1. Log on to each management host and back up the following binary that will be replaced by this patch:
> $SOAM_HOME/5.0/linux2.6-glibc2.3-x86_64/etc/ssm
2. Patch to the management hosts.
4.3 After installation
1. Verify the installation.
> $SOAM_HOME/5.0/linux2.6-glibc2.3-x86_64/etc/ssm -V
Platform Symphony 5.0.0.231953, 31 Mar 2014
Copyright (C) 2001-2009 Platform Computing Corporation.
2. Enable all applications.
Log on to the master host as the cluster administrator and run:
> egosh user logon -u Admin -x Admin
> source cshrc.platform
> soamcontrol app enable <appName>
4.4 Uninstallation
1. Diable all applications.
> source cshrc.platform
> egosh user logon -u Admin -x Admin
> soamcontrol app disable all
2. Uninstall.
Restore the corresponding files in the patch.
3. Enable all applications.
Log on to the master host as the cluster administrator and run:
> egosh user logon -u Admin -x Admin
> source cshrc.platform
> soamcontrol app enable <appName>
5. List of files
ssm
6. Copyright and trademark information
© Copyright IBM Corporation 2014
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
IBM®, the IBM logo and ibm.com® are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml.