Skip to content

Commit

Permalink
Add details about motherboard replacement
Browse files Browse the repository at this point in the history
  • Loading branch information
mfisher87 committed Nov 23, 2023
1 parent d162b4c commit b02514b
Show file tree
Hide file tree
Showing 2 changed files with 29 additions and 6 deletions.
2 changes: 1 addition & 1 deletion changes/2023-11-disk-failure-incident-response/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ description: |
New storage was added and a new pool was configured in response to this incident.
---

## Storage changes
## Hardware changes

The failed 1TB HDD was removed.

Expand Down
33 changes: 28 additions & 5 deletions changes/2023-11-node-failure-incident-response/index.md
Original file line number Diff line number Diff line change
@@ -1,11 +1,34 @@
---
title: "2023-11: Node failure incident response"
description: |
TODO
Motherboard was replaced in response to this incident.
---

# TODO
## Hardware changes

* Buy a rackmount UPS with spare capacity (the current UPS probably is undersized)
* Buy a rack
* Rack mount servers
* Ordered a replacement `Supermicro X9SCL-F rev1.11a` from eBay (~$20 + shipping).
* TODO: Install


## Diagnosis

Diagnosed a motherboard failure. Rationale:

* Symptoms began after a full shutdown.
* Symptoms:
* Fans spin for ~1 second on pushing power button, then stop. Repeats after ~1
second delay, indefinitely.
* No display
* No beep from built-in speaker
* No POST
* Power cycling stops after holding powr button for 5 seconds.
* Symptoms unchanged when removing all components (one by one): disks, RAM sticks,
CPU.
* Symptoms unchanged with multiple PSUs, PSU cables, and wall outlets.


## TODO

- [x] Diagnose failed part(s)
- [x] Order replacement part(s)
- [ ] Install replacement part(s)

0 comments on commit b02514b

Please sign in to comment.