diff --git a/operations/power_management/Power_Off_Management_Cabinets.md b/operations/power_management/Power_Off_Management_Cabinets.md new file mode 100644 index 000000000000..1503163884d2 --- /dev/null +++ b/operations/power_management/Power_Off_Management_Cabinets.md @@ -0,0 +1,32 @@ +# Power Off Management Cabinets + +Power off PDUs and any remaining components in management cabinets which are powered on, such as HPE Slingshot switches, management switches, and a KVM device. + +## Power Off Management Cabinet PDU circuit breakers + +**CAUTION:** The nodes and switches in management cabinets should only +be powered off when it has been confirmed that the management Kubernetes cluster and any Lustre or Spectrum Scale filesystems in the cabinets have been cleanly shut down. See the procedures in +[Power Off the External File Systems](System_Power_Off_Procedures.md#Power_off_the_External_File_systems) +and [Shut Down and Power Off the Management Kubernetes Cluster](Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md). + +1. (Optional) Power down the modular coolant distribution unit (MCDU) in a liquid-cooled HPE Cray EX2000 cabinet. + + The MCDU in a liquid-cooled HPE Cray EX2000 cabinet (also +referred to as a Hill or TDS cabinet) typically receives power from its management cabinet PDUs. If the +system includes an EX2000 cabinet, then do not power off the management cabinet PDUs until the MCDU has +been powered off. + + **WARNING:** Dropping power to the management cabinet PDUs without powering off the MCDU will cause an emergency power off (EPO) of the cabinet and may +result in data loss or equipment damage. + +1. Set each management cabinet PDU circuit breaker to `OFF`. + + A slotted screwdriver may be required to open PDU circuit breakers. + +1. To power off Motivair liquid-cooled chilled doors and CDUs, locate the power off switch on the CDU control panel and set it to `OFF`. + + Refer to vendor documentation for the chilled-door cooling system for power control procedures when chilled doors are installed on standard racks. + +## Next step + +Return to [System Power Off Procedures](System_Power_Off_Procedures.md) and continue with next step. diff --git a/operations/power_management/Power_Off_Storage_Cabinets.md b/operations/power_management/Power_Off_Storage_Cabinets.md index 43720c6b3b21..23b335e87fe4 100644 --- a/operations/power_management/Power_Off_Storage_Cabinets.md +++ b/operations/power_management/Power_Off_Storage_Cabinets.md @@ -5,7 +5,7 @@ Power off storage nodes and management switches in standard racks. ## Power off standard rack PDU circuit breakers **CAUTION:** The Lustre or Spectrum Scale (GPFS) file systems on nodes and switches in storage cabinets should only -be powered off once it has been confirmed that the filesystems have been cleanly shut down. See the procedures in +be powered off when it has been confirmed that the filesystems have been cleanly shut down. See the procedures in [Power Off the External File Systems](System_Power_Off_Procedures.md#Power_off_the_External_File_systems). 1. Set each cabinet PDU circuit breaker to `OFF`. diff --git a/operations/power_management/Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md b/operations/power_management/Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md index 3f41bb0eb876..711dd81a6b58 100644 --- a/operations/power_management/Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md +++ b/operations/power_management/Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md @@ -421,15 +421,6 @@ documentation (`S-8031`) for instructions on how to acquire a SAT authentication ipmitool -I lanplus -U "${USERNAME}" -E -H NCN-M001_BMC_HOSTNAME chassis power status ``` -1. (Optional) Power down Modular coolant distribution unit (MDCU) in a liquid-cooled HPE Cray EX20000 cabinet. - - **CAUTION:** The modular coolant distribution unit \(MDCU\) in a liquid-cooled HPE Cray EX2000 cabinet (also referred to as a Hill or TDS cabinet) typically receives power from its management - cabinet PDUs. If the system includes an EX2000 cabinet, then **do not power off** the management cabinet PDUs. Powering off the MDCU will cause an emergency power off \(EPO\) of the cabinet and - may result in data loss or equipment damage. - - 1. (Optional) If a liquid-cooled EX2000 cabinet is not receiving MCDU power from this management cabinet, then power off the PDU circuit breakers or disconnect the PDUs from facility power and - follow lock out/tag out procedures for the site. - ## Next step Return to [System Power Off Procedures](System_Power_Off_Procedures.md) and continue with next step. diff --git a/operations/power_management/System_Power_Off_Procedures.md b/operations/power_management/System_Power_Off_Procedures.md index 71df07ff596e..01ca58094fa4 100644 --- a/operations/power_management/System_Power_Off_Procedures.md +++ b/operations/power_management/System_Power_Off_Procedures.md @@ -41,6 +41,10 @@ To power off standard racks which have only storage nodes and switches, refer to To shut down the management Kubernetes cluster, refer to [Shut Down and Power Off the Management Kubernetes Cluster](Shut_Down_and_Power_Off_the_Management_Kubernetes_Cluster.md). +## Power Off Management Cabinets + +To power off management cabinets, refer to [Power Off Management Cabinets](Power_Off_Management_Cabinets.md). + ## `Lockout Tagout` Facility Power If facility power must be removed from a single cabinet or cabinet group for maintenance, follow proper `lockout-tagout` procedures for the site.