The HPE Cray EX system includes two types of nodes:
- Compute Nodes, where high performance computing applications are run, and node names in the form of
nidXXXXXX
- Non-Compute Nodes (NCNs), which carry out system functions and come in three versions:
- Master nodes, with names in the form of
ncn-mXXX
- Worker nodes, with names in the form of
ncn-wXXX
- Utility Storage nodes, with names in the form of
ncn-sXXX
- Master nodes, with names in the form of
The HPE Cray EX system includes the following nodes:
- Nine or more non-compute nodes (NCNs) that host system services:
ncn-m001
,ncn-m002
, andncn-m003
are configured as Kubernetes master nodes.ncn-w001
,ncn-w002
, andncn-w003
are configured as Kubernetes worker nodes. Every system contains three or more worker nodes.ncn-s001
,ncn-s002
, andncn-s003
for storage. Every system contains three or more utility storage node.
- Four or more compute nodes, starting at
nid000001
.
- Node Management Workflows
- Rebuild NCNs
- Reboot NCNs
- Enable Nodes
- Disable Nodes
- Find Node Type and Manufacturer
- Add a Standard Rack Node
- Replace a Compute Blade
- Swap a Compute Blade with a Different System
- Clear Space in Root File System on Worker Nodes
- Manually Wipe Boot Configuration on Nodes to be Reinstalled
- Troubleshoot Issues with Redfish Endpoint DiscoveryCheck for Redfish Events from Nodes
- Reset Credentials on Redfish Devices
- Access and Update Settings for Replacement NCNs
- Change Settings for HMS Collector Polling of Air Cooled Nodes
- Use the Physical KVM
- Launch a Virtual KVM on Gigabyte Servers
- Launch a Virtual KVM on Intel Servers
- Change Java Security Settings
- Verify Accuracy of the System Clock
- Configuration of NCN Bonding
- Troubleshoot Loss of Console Connections and Logs on Gigabyte Nodes
- Check the BMC Failover Mode
- Update Compute Node Mellanox HSN NIC Firmware
- TLS Certificates for Redfish BMCs
- Dump a Non-Compute Node
- Enable Passwordless Connections to Liquid Cooled Node BMCs