Patch Name: PHKL_31162 Patch Description: s700_800 11.11 crash,vpars,timeout;SG TOC,nParCnfg,shutdown Creation Date: 04/07/07 Post Date: 04/08/10 Hardware Platforms - OS Releases: s700: 11.11 s800: 11.11 Products: N/A Filesets: OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_32,v=HP OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_64,v=HP Automatic Reboot?: Yes Status: General Release Critical: No (superseded patches were critical) PHKL_27892: HANG System hang during reboot. PHKL_26847: HANG If reboot -R or shutdown -R is issued from a partition on a vPars system, all partitions will hang.This behavior can be observed on a L-class and N-class machine.This patch fixes this problem. PHKL_24057: PANIC PHKL_23642: HANG Category Tags: defect_repair hardware_enablement enhancement general_release critical panic halts_system manual_dependencies Path Name: /hp-ux_patches/s700_800/11.X/PHKL_31162 Symptoms: PHKL_31162: ( SR:8606363592 CR:JAGaf24252 ) Booting from the mirrored VxVM root disk (alternate boot) causes the system to resync all the mirrors in rootdg, leaving the rootdg primary disk in an unknown state during the time that the resync activity. PHKL_30512: ( SR:8606350284 CR:JAGaf11104 ) The "Stable complex lock" can be locked, but not released. This will occur if the nPartition Configuration Privilege has been disabled via the Management Processor, and either of the commands "shutdown -R" or "reboot -R" are issued. The unreleased "Stable complex lock" will cause any partition changes to fail for any partition in the complex. Any future use of the commands "shutdown -R" and "reboot -R" will result in the error "Warning: Stable Complex Configuration Data lock error. Unable to get lock." This problem was introduced in patch PHKL_29535. PHKL_29535: ( SR:8606263792 CR:JAGae28117 ) Hardware_enablement: This product update is need to fully enable the nPartition Configuration Privilege. If the nPartition Configuration Privilege has been disabled via the Management Processor, and either of the commands "shutdown -R" or "reboot -R" are issued, an error will occur when the system attempts to lock and write the Stable Complex data. PHKL_28113: ( SR:8606269491 CR:JAGae33726 ) When a node in a ServiceGuard cluster fails, it is difficult to determine why without extensive dump and log analyses. PHKL_27892: ( SR:8606272151 CR:JAGae36291 ) System hang during shutdown phase of a reboot. PHKL_26847: ( SR:8606238062 CR:JAGae07103 ) Issuing a reboot -R or shutdown -R command from a partition on a vPars system will cause all partitions to hang.This behavior can be observed on L-class and N-class machines running vPars. PHKL_26201: ( SR:8606233099 CR:JAGae02323 ) When reboot -qh is executed to quickly halt a system such as when called by envd to halt an overheating system, the system does not always completely halt and can later restart even though the 'System has halted, OK to turn off power or reset system' messages have been displayed. PHKL_24553: ( SR:8606199580 CR:JAGad68767 ) This patch is a member of a set of patches needed to enable the HP-UX Virtual Partitions product. When the HP-UX Virtual Partitions product (VPARSBASE or T1335AA) is installed, it will install the full set of required patches for that product, including this patch. If the HP-UX Virtual Partitions product is not installed, this change will have no impact on your system. PHKL_24057: ( SR:8606178119 CR:JAGad47346 ) A HPMC can occur on various platforms while executing the crash/dump path. PHKL_24046: ( SR:8606166068 CR:JAGad35355 ) Initial Chassis log is incorrectly reported as 0. PHKL_23991: ( SR:8606167111 CR:JAGad36397 ) When a partition is being shutdown, there is no indication on the console, for the partition, as to whether it is in "boot is blocked" (BIB) or not. PHKL_23642: ( SR:8606168722 CR:JAGad38000 ) HP-UX hangs and is unable to create dump file when the machine encounters the HPMC. Defect Description: PHKL_31162: ( SR:8606363592 CR:JAGaf24252 ) Root disk mirrors are not being marked CLEAN during shutdown, triggering an automatic resync operation during next bootup. Resolution: Update VxVM shutdown scripts to ensure that the root disk volumes are marked CLEAN. PHKL_30512: ( SR:8606350284 CR:JAGaf11104 ) The "Stable complex lock" does not get released after attempts to update the "Stable complex" fail due to the nPartition Configuration Privilege being disabled when executing "shutdown -R" or "reboot -R". Resolution: Ensure that the "Stable complex lock" is unlocked following execution of "shutdown -R" or "reboot -R" if the nPartition Configuration Privilege is enabled. PHKL_29535: ( SR:8606263792 CR:JAGae28117 ) If nPartition Configuration Privilege has been disabled, the firmware calls to lock and write the Stable Complex data will return an error indicating insufficient authority. HP-UX will treat this as a general error and write messages to the console. The partition reboot may also incorrectly stop at the "BIB (boot is blocked)" state. Resolution: This product update fully enables the nPartition Configuration Privilege on machines with HP's new PA8800 processor. When calls to firmware to lock and write the Stable Complex data return with an error indicating insufficient authority, the partition will be rebooted without making any partition changes or stopping at the "BIB (boot is blocked)" state. PHKL_28113: ( SR:8606269491 CR:JAGae33726 ) When a node in a ServiceGuard cluster fails, it is most commonly due to a power failure, critical hardware failure, High Priority Machine Check (HPMC), kernel panic, or ServiceGuard-initiated TOC or Safety Timer timeout. In the case of a Safety Timer timeout, it is usually the case that the timeout value is too small for the node that failed. However, it is difficult to determine if this is the case without extensive dump and logs analyses. Resolution: Whenever a ServiceGuard-initiated TOC (or Safety Timer timeout) occurs, a global variable is set to indicate that the TOC was initiated by ServiceGuard. This variable is checked during the dump and a message is logged indicating that this was a timeout generated failure, allowing faster resolution of root cause. PHKL_27892: ( SR:8606272151 CR:JAGae36291 ) During the shutdown phase of a reboot, there is a chance of a system hang if the reboot process and the file system daemons are running on different processors of a multi-processor system. The hang occurs when a file system buffer sync is attempted and the file system daemons have that buffer locked. Resolution: When a deadlock is detected during a buffer sync operation, the reboot process will wait for 3 minutes and attempt the buffer sync again. If there is still a deadlock, the buffer is skipped and the reboot process continues. PHKL_26847: ( SR:8606238062 CR:JAGae07103 ) Issuing a reboot for reconfiguration command from a partition on a vPars system caused all partitions to hang. In the reboot for reconfiguration path, external interrupt and recovery counter bits were being unconditionally set after returning from a pdc call. Resolution: Save the external interrupt and recovery counter bits before invoking the pdc call and restore them after returning from the pdc call. PHKL_26201: ( SR:8606233099 CR:JAGae02323 ) The reboot/halt process doesnot disable the kepd safety timer used by ServiceGuard to TOC the system should cmcld not run. Resolution: Disabled safety timer before the system goes into the final infinite loop. PHKL_24553: ( SR:8606199580 CR:JAGad68767 ) This patch contains minor enhancements required to support the HP-UX Virtual Partitions product. Resolution: Enhancements added to support CPU migration. PHKL_24057: ( SR:8606178119 CR:JAGad47346 ) iva_nothread gets set to 0 (false) very early after panic() is called. After going to real mode and entering do_panic_dump(), this may result in the execution of a routine to make the process unswappable. However, this routine tries to touch a user structure(which is a virtual pointer) in real mode and the system HPMCs. Resolution: To prevent the HPMC we set iva_nothread to 1 (true) immediately after going to real mode and entering do_panic_dump(). This prevents us from trying to make the process unswappable. PHKL_24046: ( SR:8606166068 CR:JAGad35355 ) Initial Chassis Log incorrectly reports cell/cpu as 0. Resolution: Chassis Logs report correct cpu/ cell / cell numbers. PHKL_23991: ( SR:8606167111 CR:JAGad36397 ) The partition boot progress is updated on Virtual Front Panel (VFP). But not all customers look at the VFP for machine status. It is necessary to send a mesage to the partition's console to tell its state and remind custumers to see the VFP for more information. Resolution: Print out a line of message on the console indicating whether the partition is in BIB state or not. If in BIB state, remind customer to go to VFP for more information. PHKL_23642: ( SR:8606168722 CR:JAGad38000 ) When the HPMC occurs, the HPMC monarch gets hung up on the pdc_lock in pdc_call. This prevents the other cpus from making the rendezvous in the OS HPMC handler. Resolution: When entering crash dump path, eliminate the lock resource contention. Enhancement: No (superseded patches contained enhancements) PHKL_28113: Enhancements were delivered in a patch this one has superseded. Please review the Defect Description text for more information. SR: 8606166068 8606167111 8606168722 8606178119 8606199580 8606233099 8606238062 8606263792 8606269491 8606272151 8606350284 8606363592 Patch Files: OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_32,v=HP: /usr/conf/lib/libio.a(chassis_log.o) /usr/conf/lib/libshutdown-pdk.a(machdep.o) /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o) OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_64,v=HP: /usr/conf/lib/libio.a(chassis_log.o) /usr/conf/lib/libshutdown-pdk.a(machdep.o) /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o) what(1) Output: OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_32,v=HP: /usr/conf/lib/libio.a(chassis_log.o): chassis_log.c $Date: 2001/05/01 10:27:46 $Revision: r11.11/2 PATCH_11.11 (PHKL_24046) /usr/conf/lib/libshutdown-pdk.a(machdep.o): machdep.c $Date: 2004/07/06 12:27:56 $Revision: r11. 11/11 PATCH_11.11 (PHKL_31162) /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o): subr_nvm.c $Date: 2001/03/14 09:38:24 $Revision: r11 .11/1 PATCH_11.11 (PHKL_23642) OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_64,v=HP: /usr/conf/lib/libio.a(chassis_log.o): chassis_log.c $Date: 2001/05/01 10:27:46 $Revision: r11.11/2 PATCH_11.11 (PHKL_24046) /usr/conf/lib/libshutdown-pdk.a(machdep.o): machdep.c $Date: 2004/07/06 12:27:56 $Revision: r11. 11/11 PATCH_11.11 (PHKL_31162) /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o): subr_nvm.c $Date: 2001/03/14 09:38:24 $Revision: r11 .11/1 PATCH_11.11 (PHKL_23642) cksum(1) Output: OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_32,v=HP: 1251832881 8120 /usr/conf/lib/libio.a(chassis_log.o) 1433138057 36332 /usr/conf/lib/libshutdown-pdk.a(machdep.o) 734840741 7944 /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o) OS-Core.CORE2-KRN,fr=B.11.11,fa=HP-UX_B.11.11_64,v=HP: 235554898 19504 /usr/conf/lib/libio.a(chassis_log.o) 149485543 91376 /usr/conf/lib/libshutdown-pdk.a(machdep.o) 2903566080 22776 /usr/conf/lib/libshutdown-pdk.a(subr_nvm.o) Patch Conflicts: None Patch Dependencies: None Hardware Dependencies: None Other Dependencies: To resolve the defect relating to ServiceGuard timeout being too short, PHKL_28114 must be installed in addition to this patch. Both of these patches are independent in that there is no negative impact if either of them is installed without the other. However, both patches must be installed in order to resolve the defect. Supersedes: PHKL_30512 PHKL_29535 PHKL_28113 PHKL_27892 PHKL_26847 PHKL_26201 PHKL_24553 PHKL_24057 PHKL_24046 PHKL_23991 PHKL_23642 Equivalent Patches: None Patch Package Size: 100 KBytes Installation Instructions: Please review all instructions and the Hewlett-Packard SupportLine User Guide or your Hewlett-Packard support terms and conditions for precautions, scope of license, restrictions, and, limitation of liability and warranties, before installing this patch. ------------------------------------------------------------ 1. Back up your system before installing a patch. 2. Login as root. 3. Copy the patch to the /tmp directory. 4. Move to the /tmp directory and unshar the patch: cd /tmp sh PHKL_31162 5. Run swinstall to install the patch: swinstall -x autoreboot=true -x patch_match_target=true \ -s /tmp/PHKL_31162.depot By default swinstall will archive the original software in /var/adm/sw/save/PHKL_31162. If you do not wish to retain a copy of the original software, include the patch_save_files option in the swinstall command above: -x patch_save_files=false WARNING: If patch_save_files is false when a patch is installed, the patch cannot be deinstalled. Please be careful when using this feature. For future reference, the contents of the PHKL_31162.text file is available in the product readme: swlist -l product -a readme -d @ /tmp/PHKL_31162.depot To put this patch on a magnetic tape and install from the tape drive, use the command: dd if=/tmp/PHKL_31162.depot of=/dev/rmt/0m bs=2k Special Installation Instructions: None