Skip to content

Latest commit

 

History

History
27 lines (17 loc) · 900 Bytes

DEBUG.md

File metadata and controls

27 lines (17 loc) · 900 Bytes

debugging

prometheus crashloopbackoff

prometheus crash #1

Caused by duplicate prometheus-kubelet services in kube-system. There should only be one. Note that helm chart uninstall is not clearing out this service.

pods stuck in init

Describe pod state. Check if cni says it is running out of IPs.

cni out of IPs

Use the clear_cni.sh script to clear out the cni state. This will cause a restart of the cni pods and free up the IPs.

multus crashloopbackoff

k8snetworkplumbingwg/multus-cni#710 (comment)
Multus can get OOMKilled. Increase resources for multus pods.

prometheus crashloopbackoff

Prometheus can get OOMKilled if it is collecting too many series.

prometheus-tsdb

Reduce high-cardinality metrics or increase resources for prometheus pods.