Reboot Required #
Meaning #
Node is expressing a need to be rebooted. Usually this is shown because of an unattended upgrade which included a new kernel version.
Impact #
Running with older kernel.
Diagnosis #
Log into a machine and check if latest kernel is used.
Mitigation #
Reboot the machine by following those steps:
- Drain node with
kubectl drain node NODE_NAME
- Ensure all pods are moved with
kubectl describe node NODE_NAME
- Add silence to alertmanager silencing every alert with
node=NODE_NAME
for 1h. - ssh into node and run
reboot
- wait for node to come up
- Make node schedulable
kubectl uncordon NODE_NAME
- (optional) remove silence from alertmanager.