How to Diagnose a Crashed API Server

The API server pod won't come back up - HELP 😱 😱 😱

Perhaps you've made a manifest edit, or perhaps some question has put you into a context where the API server is already broken. You're using docker ps or crictl ps and see the API server flash up briefly then go away. The container doesn't last long enough for you to grab an ID to pull logs from. Or maybe it never appears in the ps output.

Note that these techniques can be used for the other static pods like etcd, kube-controller-manager and kube-scheduler by looking for the corresponding name instead of apiserver in the commands below.

Steps to take

Restart kubelet so you don't have to wait too long in the following steps
```
systemctl restart kubelet
```
Determine if the kubelet can even start the API server

If there is a syntax error in the YAML manifest, then kubelet will not be able to parse it and will eventually complain. Do the following and watch the output for up to 60 seconds. Note that if you have errors in your pod manifest, kubelet will report them exactly the same way using the same kind of error messages that kubectl does!
```
journalctl -fu kubelet | grep apiserver
```
There are normally three classes of error we will see here
1. Could not process manifest file
  
  This is indicative of an error in the YAML of /etc/kubernetes/manifests/kube-apiserver.yaml. You should go directly to edit that file and correct the issue.
  Note that YAML parsers only report the first error they find. If you have more than one error - i.e. the apiserver doesn't come up after fixing whatever you've found, then repeat this diagnostic process from the top.
2. Structure or argument error
  
  The YAML has been parsed successfully, however you get some cryptic message about something or other. This means that although there's no syntax issues, what you've put in the manifest does not correctly represent the spec for the API server pod, or an argument for something like a volume mount is an invalid path. You need to find and fix this in the manifest file. See also YAML - Dealing with errors.
3. CrashLoopBackOff
  
  This means that the YAML was successfully parsed, the pod started and then exited with an error. To fix this, continue reading.
Kubelet does launch API server, but it crashes immediately.

This means there is likely an issue with one or more arguments to the kube-apiserver command. There is a location where the last output of the pod is stored, which can help you to get information about why the pod is not starting.
```
cd /var/log/pods
ls -ld *apiserver*
```
This should return something like
```
drwxr-xr-x 3 root root 4096 Oct 26 04:29 kube-system_kube-apiserver-controlplane_02d13ddeddf8e935ec2407132767aeaa
```
If there's more than one match, choose the one with the most recent timestamp.

NOTE: This directory can change name frequently. If you have to repeat the diagnostic process, don't assume it is the same as last time you did this in the same session. Repeat this step from the top.

Next, cd into the given directory
```
cd kube-system_kube-apiserver-controlplane_02d13ddeddf8e935ec2407132767aeaa
ls -l
```
You should see
```
drwxr-xr-x 2 root root 4096 Oct 26 04:29 kube-apiserver
```
```
cd kube-api-server
ls -l
```
There will be one or more .log files. Examine the content of the most recent log, e.g.
```
cat 1.log
```
The issue should be revealed here.

See all of the above demonstrated live in our Office Hours with Community session from March 2023.

You can use one of our Kubernetes playgrounds, or any Kubernetes lab (just ignore the questions).

Use this repo to get some scenarios to practice with. This is the repo used in the above video.

Return to main FAQ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diagnose-crashed-apiserver.md

diagnose-crashed-apiserver.md

How to Diagnose a Crashed API Server

Files

diagnose-crashed-apiserver.md

Latest commit

History

diagnose-crashed-apiserver.md

File metadata and controls

How to Diagnose a Crashed API Server