Cluster Outage from k3d Node Restart
I ran docker restart k3d-homelab-server-0 and my SSH session froze. Then it disconnected. Then I realized the SSH tunnel runs inside the cluster I just restarted. That was the beginning of a 75-minute full outage that taught me more about my own infrastructure than the previous six months of it working fine. Date 2026-03-08 Duration ~75 minutes Severity Full outage — all services down, no remote access Trigger docker restart k3d-homelab-server-0 to pick up containerd registry config Each part covers a different failure mode and the debugging methodology behind it. The specific technologies will change; the patterns won’t. ...