Hi I have an XCP-NG 8.2.1 server that the host console has completely frozen up on. All the vms on the host are still running and can be accessed, but the management interface is frozen.
I can’t ping it, can’t ssh in, the IMPI console and a physical keyboard and monitor show the same stuck screen asking any key to be pressed to access the console. This of course doesn’t work. I can change the status of num lock and Caps Lock on the keyboard though.
My plan is to remote into each running VM and shut them down 1 by one and then reboot the server but I wanted to
make sure that this isn’t the type of error that lights a RAID array on fire during a reboot and
that it isn’t me missing an obvious solution.
Editing in the “solution” for posterity.
Well I’m not sure what caused it or why but asking xcp-ng to rebuild the array listing the previously dev/sdd (now /dev/sdf) as the last drive with the command:
How many ethernet connections are you using? Is it possible that you have VM on one set of NICs and management on another set of NICs and that the management network or NICs are down?
Well I’m not sure what caused it or why but asking xcp-ng to rebuild the array listing the previously dev/sdd (now /dev/sdf) as the last drive with the command:
This is the thanks I get for making the local storage repo a RAID5 instead of the default RAID0. Remember kids, backups, backups backups, BACKUPS, backups.
Seems to be the way to go. I like having the OS on local storage and then data on NFS and I’ve never been an enormous fan of hardware raid but if that’s the best practice I’m willing to do that instead.