DSM single node cluster unhealthy after powering cycle

0

Issue : DSM single node AWS cluster unhealthy after powering cycle 


affected version: DSM 4.4


Description:   After power cycle of single node DSM, Flannel pod enters crashloop state and not recovering.

Flannel pod log error : “Failed to create SubnetManager” 


Fix: usually cluster will be auto-recovered in 10-15mins. if not please follow the below steps.

Note: This issue is limited to a single-node setup due to a race condition with kubelet, kube proxy, and flannel

  1. systemctl stop kubelet
  2. wait 30-45 seconds
  3. systemctl start kubelet
  4. Restart the kube-proxy pod

 

Comments

0 comments

Please sign in to leave a comment.

Didn't find what you were looking for?

New post