What are liveness, readiness, and startup probes, and how do they differ?

A **liveness** probe checks whether a container is still working correctly; if it fails, the kubelet kills and restarts the container. A **readiness** probe checks whether a container is currently able to serve traffic; if it fails, the Pod is removed from Service endpoints (traffic stops being routed to it) but the container is *not* restarted. A **startup** probe protects slow-starting containers by disabling liveness/readiness checks until it succeeds, preventing a liveness probe from prematurely killing a container that's still legitimately warming up.

How do you debug a Pod stuck in CrashLoopBackOff?

CrashLoopBackOff means the container keeps starting, then exiting/crashing, and Kubernetes is applying an exponential backoff delay between restart attempts. Start with `kubectl logs --previous` to see the crashed container's actual output (not the new attempt's, which may not have logged anything yet), then `kubectl describe pod ` for exit codes and recent events, checking specifically for a misconfigured liveness probe killing an otherwise-healthy container, an application error on startup, or a missing dependency/configuration.

How do you debug a Pod stuck in ImagePullBackOff or Pending?

`ImagePullBackOff` means the kubelet can't pull the specified container image — check for a typo in the image name/tag, a private registry requiring credentials that aren't configured (`imagePullSecrets`), or a network issue reaching the registry. A Pod stuck `Pending` (with no container-level errors) usually means the scheduler can't find a node satisfying its requirements — `kubectl describe pod` shows the specific reason directly in its Events section, most commonly insufficient resources, an unsatisfied affinity/taint rule, or a volume that can't be provisioned/attached.

What tools and commands do you use to investigate a failing pod?

The core toolkit: `kubectl describe pod` (events, conditions, container states — the first stop for almost any problem), `kubectl logs` (and `--previous` for a crashed container's last output), `kubectl exec -it -- sh` (an interactive shell inside a running container), `kubectl get events --sort-by=.lastTimestamp` (cluster-wide recent events, useful when you're not sure which object is actually at fault), and `kubectl top pod`/`kubectl top node` (current CPU/memory usage, when metrics-server is installed).

What causes a pod to be OOMKilled, and how do you diagnose and fix it?

A Pod is OOMKilled when a container's memory usage exceeds its configured memory limit (or, less commonly, the node itself runs out of memory entirely, even for containers under their individual limits) — the kernel's OOM killer terminates the process, and Kubernetes reports `OOMKilled` as the termination reason. Diagnose by confirming the reason via `kubectl describe pod` and checking actual memory usage trends (via `kubectl top` or a metrics/monitoring system) against the configured limit, then either fix a genuine memory leak in the application or raise the limit to a realistic, measured value if the usage is legitimate.

How does centralized logging typically work in Kubernetes, given pods are ephemeral?

Container stdout/stderr is written to a file on the node by the container runtime, and a log-collection agent — almost always deployed as a DaemonSet so it runs on every node — tails these log files and forwards them to a centralized, durable logging backend (Elasticsearch, Loki, a cloud logging service). This decouples log durability from any individual Pod's lifetime, since a Pod's local log files disappear when it's deleted, but the already-shipped copies in the centralized backend persist and remain searchable.

How does Kubernetes expose metrics, and what's the role of metrics-server vs. Prometheus?

**metrics-server** collects lightweight, real-time CPU/memory resource metrics from every node's kubelet and exposes them through the Kubernetes Metrics API — used specifically to power `kubectl top` and the Horizontal Pod Autoscaler's resource-based scaling, but it stores no history at all (only the current snapshot). **Prometheus** is a full-featured, general-purpose monitoring system that scrapes and stores detailed, historical time-series metrics from many sources (application-level custom metrics, node-level metrics, Kubernetes object states), enabling dashboards, alerting, and long-term trend analysis that metrics-server was never designed to provide.

Why might a pod be Running but not receiving any traffic?

The most common cause is a failing readiness probe — the Pod shows phase `Running`, but its `Ready` condition is `False`, so it's excluded from the Service's endpoints and receives no traffic even though the container itself is executing normally. Other causes: a Service's label selector doesn't actually match the Pod's labels (a typo, or a label that was changed), the Pod's containerPort doesn't match the Service's targetPort, or a NetworkPolicy is unexpectedly blocking the traffic path.

Observability and Troubleshooting

Probes, common failure states, debugging tools, logging, and metrics for diagnosing problems in a running cluster.

Difficulty

Open as page

Why three different probes exist

"Is this container healthy" turns out to have several distinct meanings, and conflating them causes real production problems — a container that's alive but not yet ready to serve traffic (still loading a large cache) shouldn't be killed, and a slow-starting container shouldn't be judged against the same timing as a fully-warmed-up one.

Liveness probe — is this container still working?

livenessProbe:
  httpGet:
    path: /healthz
    port: 8080
  initialDelaySeconds: 15
  periodSeconds: 10
  failureThreshold: 3

If this probe fails failureThreshold times in a row, the kubelet kills the container and restarts it (subject to the Pod's restartPolicy). This is meant to catch situations where a process is technically still running but has gotten into a genuinely broken state it can't recover from on its own (deadlocked, stuck in an infinite loop) — a restart is the appropriate remedy. A liveness probe should only fail for problems a restart would actually fix — a liveness probe that checks a downstream dependency (like a database connection) is a common and dangerous misconfiguration, since it causes the container to be endlessly restarted for a problem restarting it can't solve at all (the database being down), rather than just marking it not-ready.

Readiness probe — is this container currently able to serve traffic?

readinessProbe:
  httpGet:
    path: /ready
    port: 8080
  periodSeconds: 5

If this probe fails, the Pod is removed from the Service's endpoints (see the networking topic) — traffic stops being routed to it — but the container is left running, not restarted. This is the correct mechanism for temporary, self-resolving unavailability: warming up a cache on startup, briefly reconnecting to a dependency, or gracefully draining in-flight requests before shutdown. This is also exactly the mechanism that makes rolling updates safe (see the workload controllers topic) — a new Pod only starts receiving traffic once its readiness probe passes.

Startup probe — protects slow-starting containers

startupProbe:
  httpGet:
    path: /healthz
    port: 8080
  failureThreshold: 30
  periodSeconds: 10       # allows up to 300 seconds (30 x 10) for startup
livenessProbe:
  httpGet:
    path: /healthz
    port: 8080
  periodSeconds: 10

While a startup probe is configured and hasn't yet succeeded, liveness and readiness probes are disabled entirely — this exists specifically for applications with a slow, variable startup time (a large in-memory cache warm-up, a JVM application with a long class-loading phase), where a liveness probe's normal, tighter timing (tuned for steady-state health checking) would otherwise kill the container for simply still being in its legitimate startup phase, before it ever got a chance to finish starting.

Side-by-side summary

	Checks	On failure	Typical use
Liveness	Is the process still functioning	Kill and restart the container	Detecting deadlocks/unrecoverable internal states
Readiness	Can it currently serve traffic	Remove from Service endpoints (no restart)	Temporary unavailability, warm-up, graceful shutdown
Startup	Has the (slow) startup completed	Delays liveness/readiness checks until success	Slow-starting applications, avoiding premature liveness kills

Why getting this wrong causes real incidents

The single most common probe misconfiguration is a liveness probe that's too strict, or checks the wrong thing (a downstream dependency instead of the process's own health) — this produces exactly the symptom covered in the CrashLoopBackOff troubleshooting question: a container repeatedly killed and restarted for a condition restarting can never actually fix, often making an already-degraded situation (a slow dependency) actively worse by adding restart churn on top of it.

Related Resources

Kubernetes: Configure Liveness, Readiness and Startup Probes

Open as page

What CrashLoopBackOff actually means

The container is starting, then exiting (crashing, or being killed) repeatedly, and Kubernetes is deliberately backing off between restart attempts (waiting progressively longer — 10s, 20s, 40s, up to a cap around 5 minutes) rather than restarting instantly and indefinitely, which would otherwise hammer the node with a tight restart loop.

kubectl get pods
# NAME        READY   STATUS             RESTARTS   AGE
# my-app-xyz  0/1     CrashLoopBackOff   7          12m

Step 1: check the previous (crashed) container's logs

kubectl logs my-app-xyz --previous

This is the single most important first command — --previous retrieves logs from the last terminated instance of the container, which usually contains the actual error message explaining why it crashed (a stack trace, a missing environment variable error, a failed database connection). Without --previous, kubectl logs shows the current (possibly still-starting, possibly not-yet-logged-anything) attempt, which may be empty or unhelpful.

Step 2: check describe for exit code and events

kubectl describe pod my-app-xyz

Look specifically at:

Last State: Terminated, Reason, Exit Code — a specific exit code narrows down the cause considerably: 0 (clean exit — odd for a container that's supposed to run forever, might indicate the main process finished and exited normally when it shouldn't have), 1 (generic application error), 137 (128+SIGKILL — often an OOMKill, see that question, or a liveness probe failure), 143 (128+SIGTERM — graceful termination, possibly from a liveness probe or a manual action).
Events at the bottom — often shows directly whether a liveness probe is failing and killing the container (Liveness probe failed: ...), which points straight at a probe misconfiguration rather than the application itself being broken.

Common root causes, roughly in order of frequency

Application error on startup — a missing environment variable, a bad configuration file, an unhandled exception during initialization. The --previous logs should show this directly.
Misconfigured liveness probe — the probe is checking something that isn't actually indicative of a fatal problem (e.g., a downstream dependency being briefly unavailable), causing an otherwise-healthy container to be killed repeatedly (see the probes question).
Missing dependency/resource — the application can't reach a required database, another service, or a mounted ConfigMap/Secret that doesn't exist or is misnamed.
OOMKilled (see that question) — the container is repeatedly exceeding its memory limit; describe pod will show OOMKilled as the termination reason distinctly from a generic crash.
Immediate exit due to incorrect container command/entrypoint — e.g., a container built to run a one-shot script rather than a long-running server process, used incorrectly in a Deployment (which expects the main process to keep running).

When logs alone aren't enough

kubectl exec -it my-app-xyz -- /bin/sh    # only works if the container is currently running long enough

If the crash happens too fast to exec into the container, consider temporarily overriding the Pod's command to something that keeps it alive long enough to investigate interactively (command: ["sleep", "3600"], in a debug copy of the manifest — never in the real production manifest), or use kubectl debug (a newer, purpose-built command for attaching an ephemeral debug container to a running or crashing Pod) to get a shell alongside the problematic container without needing to modify its spec at all.

Naming --previous specifically (rather than just "check the logs") is a strong, concrete signal of hands-on debugging experience — it's the detail that trips up people who've only read about Kubernetes without actually having debugged a real crash loop.

Related Resources

Kubernetes: Debug Running Pods

Open as page

ImagePullBackOff — the kubelet can't pull the image

kubectl describe pod my-app-xyz
# Events:
#   Warning  Failed     kubelet  Failed to pull image "myapp:1.0.": rpc error: ...
#   Warning  BackOff    kubelet  Back-off pulling image "myapp:1.0."

Common causes, in rough order of frequency:

Typo in the image name or tag — a trailing period (as in the example above — myapp:1.0. instead of myapp:1.0), a misspelled repository name, or a tag that was never actually pushed. kubectl describe pod shows the exact image string the kubelet tried to pull, and the exact error the registry returned — often enough to spot the typo immediately.
Private registry requiring authentication — if the image is in a private registry and no credentials are configured, the pull fails with an authentication/authorization error. Fix by creating a docker-registry Secret and referencing it via imagePullSecrets in the Pod spec (or the associated ServiceAccount, so every Pod using that ServiceAccount picks it up automatically).

spec:
  imagePullSecrets:
    - name: my-registry-credentials
  containers:
    - name: app
      image: private-registry.example.com/myapp:1.0

Network connectivity issue — the node genuinely can't reach the registry (a firewall rule, a DNS resolution problem, the registry being down) — worth checking directly from a node or a debug Pod if credentials and image name both check out.
Rate limiting — some public registries (notably Docker Hub) impose pull rate limits per IP/account; a burst of Pod creations across many nodes can occasionally hit this, especially without an authenticated account configured for higher limits.

Pending — the scheduler can't place the Pod anywhere

kubectl describe pod my-app-xyz
# Events:
#   Warning  FailedScheduling  default-scheduler  0/5 nodes are available:
#     3 Insufficient memory, 2 node(s) had taint {dedicated: gpu}, that the pod didn't tolerate.

The Events section of describe pod is the essential first stop — it states, in plain language, exactly why every node was rejected during scheduling's filtering phase (see the scheduling question). Common reasons:

Insufficient resources — no node has enough unreserved CPU/memory to satisfy the Pod's requests; either the cluster genuinely needs more capacity (Cluster Autoscaler should address this automatically if configured — see that question), or the Pod's requests are set unrealistically high.
Unsatisfied taints/tolerations or node affinity — the Pod requires something (a specific node label, tolerance for a taint) that no current node provides.
Volume/topology issues — a PersistentVolumeClaim can't be bound or provisioned (e.g., a StorageClass misconfiguration, or a zone-topology mismatch between where the volume was created and where the Pod could be scheduled — see the StorageClass question).
PodDisruptionBudget or admission webhook rejection — less common for a purely Pending state, but worth checking if the Events mention an admission controller rejecting the request outright.

The universal first diagnostic command

kubectl describe pod <pod-name>

For both ImagePullBackOff and Pending, this single command's Events section is almost always where the actual, specific, human-readable reason lives — the general debugging instinct should always be "read the events before guessing," rather than jumping straight to speculation about what might be wrong.

Related Resources

Kubernetes: Debug Pods

Open as page

kubectl describe — the essential first command

kubectl describe pod my-app-xyz

Shows the Pod's full spec, its current status/conditions (see the pod-lifecycle question), each container's current and last state (with exit codes/reasons), and — critically — the Events section at the bottom, which is a chronological log of everything that's happened to this specific object recently (scheduling decisions, probe failures, image pull attempts). This should almost always be the very first command run when investigating any Pod problem.

kubectl logs — application-level output

kubectl logs my-app-xyz                      # current container's stdout/stderr
kubectl logs my-app-xyz --previous            # the LAST TERMINATED instance's logs (essential for crash loops)
kubectl logs my-app-xyz -c sidecar-container   # a specific container, for multi-container Pods
kubectl logs my-app-xyz --since=10m            # only recent logs, useful on a noisy long-running app
kubectl logs -f my-app-xyz                     # follow/stream logs live

kubectl exec — an interactive shell inside the container

kubectl exec -it my-app-xyz -- /bin/sh

Lets you poke around inside a currently running container directly — check environment variables, test network connectivity to a dependency, inspect mounted config files. Only useful if the container stays up long enough to attach to (not helpful for a container that crashes within milliseconds of starting).

kubectl debug — attaching to a Pod without modifying it

kubectl debug -it my-app-xyz --image=busybox --target=my-app-xyz

A more modern alternative for cases where exec isn't sufficient — e.g., the container image itself has no shell at all (common for minimal/distroless production images), or the Pod is crashing too fast to exec into. This attaches an ephemeral debug container to the existing Pod (sharing its network/process namespace, depending on flags), letting you investigate using a full-featured debug image without altering the original Pod's spec.

kubectl get events — cluster/namespace-wide recent activity

kubectl get events --sort-by=.lastTimestamp -n production

Useful when you're not yet sure which specific object is actually at fault — shows recent events across the whole namespace (or cluster, with -A), which can reveal a problem at a different layer than the one you started investigating (e.g., you're looking at a Pod, but the real root cause event was a failed PVC provisioning or a node becoming NotReady).

kubectl top — current resource usage

kubectl top pod my-app-xyz
kubectl top node

Requires metrics-server (or an equivalent) to be running in the cluster (see the HPA question) — shows current, real-time CPU/memory usage, useful for confirming whether a Pod is actually approaching its resource limits (a lead-in to investigating OOMKills or CPU throttling — see those questions) without needing a full metrics/monitoring stack for a quick, immediate check.

The general debugging workflow this toolkit supports

kubectl get pods — spot which Pod(s) are unhealthy and their current status/phase.
kubectl describe pod — get the specific reason (events, conditions, container states).
kubectl logs (with --previous if relevant) — get the application's own explanation, if it logged one.
kubectl exec/kubectl debug — interactively investigate further if logs/describe aren't sufficient.
kubectl get events/kubectl top — widen the investigation if the root cause seems to be somewhere other than the Pod itself.

Being fluent with this sequence — not just knowing the commands exist, but knowing the order and reason to reach for each — is what separates real hands-on troubleshooting experience from surface familiarity with kubectl's command list.

Related Resources

Kubernetes: Troubleshoot Applications

Open as page

Confirming OOMKilled is actually the cause

kubectl describe pod my-app-xyz
# Last State:  Terminated
#   Reason:    OOMKilled
#   Exit Code: 137

Exit Code: 137 (128 + 9, where 9 is SIGKILL) combined with Reason: OOMKilled confirms this specific cause definitively — distinguishing it from a generic application crash, which would show a different exit code and reason. This distinction matters because the fix is completely different depending on which one occurred.

The two different scenarios that both produce OOMKilled

Container exceeded its own memory limit — the most common case; the container's cgroup memory usage crossed the configured resources.limits.memory value, and the kernel killed it specifically for exceeding that container-level boundary.
The node itself ran out of memory — less common, but possible if requests/limits across the node are poorly configured (heavy overcommitment) or the kubelet's node-level memory-pressure eviction didn't act quickly enough; in this case even a container technically under its own individual limit can be killed as part of the node trying to reclaim memory generally.

Diagnosing whether this is a real leak or just an under-provisioned limit

kubectl top pod my-app-xyz --containers

Or, better, look at the actual memory usage trend over time in a real monitoring system (Prometheus/Grafana) rather than a single snapshot: a memory usage graph that climbs steadily and never plateaus, correlating with time since the container started (not with request volume), strongly suggests a genuine memory leak in the application — something that will eventually hit any limit you set, no matter how generous, and needs an actual code fix. A memory usage graph that climbs with load and then plateaus at a value just above the configured limit suggests the limit is simply set too low for the application's legitimate, steady-state working set — the fix here is raising the limit to a value with reasonable headroom above observed real usage, not a code change.

Fixing a genuine memory leak

This requires actual application-level investigation — heap dumps/profiling tools appropriate to the language runtime (e.g., a Java heap dump analyzed with a profiler, Node.js's --inspect and Chrome DevTools memory profiling, Python's tracemalloc) to identify what's actually accumulating and never being released. Kubernetes-level tooling can tell you that memory is growing unboundedly and when the kill happens, but not why the application's own code is holding onto memory it should have freed — that's an application-level debugging problem layered on top of the Kubernetes-level symptom.

Fixing an under-provisioned limit

resources:
  requests:
    memory: "512Mi"    # raised to reflect realistic steady-state usage
  limits:
    memory: "768Mi"    # some headroom above typical peak, not unlimited

Raise the limit based on actual measured usage data, not a guess — and consider whether a Vertical Pod Autoscaler (see the scheduling topic), run in recommendation mode, could help right-size this automatically based on real historical usage rather than manual tuning each time.

A subtlety worth knowing: OOMKilled doesn't always mean CrashLoopBackOff

A single OOMKill, if the container then starts fine and runs stably afterward, just shows up as one restart with OOMKilled as the previous state — it only becomes a CrashLoopBackOff if the container keeps hitting the same memory ceiling repeatedly, shortly after each restart. Seeing one isolated OOMKill in history is a signal worth investigating but not necessarily an active incident; a repeating pattern of OOMKills is the more urgent case demanding immediate action.

Related Resources

Kubernetes: Resource Management for Pods and Containers

Open as page

Why Pod ephemerality makes centralized logging necessary

kubectl logs reads directly from a log file the container runtime maintains on the node the Pod is (or was recently) running on — the moment a Pod is deleted (rescheduled, scaled down, or replaced during a rollout), those local log files are eventually cleaned up too, and kubectl logs for that specific Pod name no longer works at all. For anything beyond quick, live debugging of a still-running Pod, relying on kubectl logs alone is insufficient — you need logs collected and stored somewhere durable, independent of any individual Pod's lifetime.

The standard architecture: a log-shipping DaemonSet

Node                                    Node
┌──────────────────────┐          ┌──────────────────────┐
│ Pod A -> stdout/stderr │          │ Pod C -> stdout/stderr │
│    -> log file on node  │          │    -> log file on node  │
│ Pod B -> stdout/stderr │          │                        │
│    -> log file on node  │          │  Fluent Bit (DaemonSet) │
│                        │          │    tails log files       │
│ Fluent Bit (DaemonSet) │          │    ships them out          │
│    tails log files       │          └──────────────────────┘
│    ships them out          │                     │
└──────────────────────┘                     │
             │                                       │
             └───────────────┬───────────────────────┘
                             ▼
                  Centralized logging backend
              (Elasticsearch, Loki, CloudWatch, etc.)

A log-shipping agent (Fluentd, Fluent Bit, Vector, or a cloud provider's own agent) runs as a DaemonSet (see the workload controllers topic — this is exactly the canonical DaemonSet use case) so exactly one copy runs on every node, continuously tailing every container's log files on that node and forwarding them to a centralized backend, often enriching each log line with useful Kubernetes metadata (Pod name, namespace, labels) along the way.

Common centralized logging backends

Elasticsearch (or OpenSearch) + Kibana — the classic "ELK/EFK stack" (Elasticsearch, Fluentd, Kibana), offering rich full-text search and dashboarding.
Grafana Loki — a more lightweight, cost-efficient alternative that indexes only metadata/labels (not full log text), often paired with Grafana for visualization, appealing when Elasticsearch's resource cost and operational complexity are undesirable.
Cloud-native logging services — AWS CloudWatch Logs, GCP Cloud Logging, Azure Monitor Logs — convenient when already running on that cloud provider, with less operational overhead than self-hosting a logging stack.

An alternative pattern: sidecar-based log shipping

Instead of a node-level DaemonSet, some setups use a sidecar container per Pod (see the sidecar pattern question) specifically to ship that one Pod's logs — useful when an application writes logs to a file rather than stdout/stderr (requiring a sidecar with a shared volume to tail that specific file), but generally higher overhead (one shipping agent per Pod rather than one per node) than the DaemonSet approach, and used more selectively for that reason.

Best practice: log to stdout/stderr, not files

Applications running in containers should generally write logs to stdout/stderr rather than to files within the container — this is what the container runtime and standard log-shipping DaemonSets are built to capture automatically, without needing any sidecar or special per-application configuration. Writing to internal files instead requires extra plumbing (a sidecar, or a shared volume) to get those logs collected at all — one of the "twelve-factor app" principles that maps directly onto how Kubernetes logging infrastructure expects applications to behave.

Related Resources

Kubernetes: Logging Architecture

Open as page

metrics-server — minimal, real-time, no history

kubectl top pod
kubectl top node

metrics-server collects basic CPU/memory usage from every node's kubelet (which itself gets this data from cAdvisor, embedded in the kubelet) at a regular interval, and exposes it through the standard Kubernetes Metrics API (metrics.k8s.io). This is deliberately minimal by design: it holds only the current/most recent snapshot in memory — no historical data, no long-term storage, no querying capability beyond "what's the current usage." Its entire purpose is powering exactly two things: kubectl top (for quick, ad-hoc human inspection) and the Horizontal Pod Autoscaler's resource-based scaling decisions (see that question), both of which only need the current value, not history.

Prometheus — full-featured monitoring and alerting

Prometheus is a general-purpose time-series monitoring system, not Kubernetes-specific, but with excellent native Kubernetes integration (via service discovery that automatically finds Pods/Services to scrape based on annotations or the Kubernetes API). It scrapes metrics endpoints (applications, and cluster components, expose metrics in Prometheus's text format at an HTTP endpoint, typically /metrics) at a configured interval and stores the resulting time series durably, supporting rich querying (via PromQL), dashboards (commonly via Grafana), and alerting (via Alertmanager) based on arbitrary conditions over time (e.g., "alert if error rate exceeds 5% for more than 5 minutes").

Prometheus scrapes:
  - kube-state-metrics (Kubernetes object states: Deployment replica counts, Pod phases, etc.)
  - node-exporter (node-level OS metrics: disk, network, detailed CPU/memory)
  - application /metrics endpoints (custom, application-specific metrics)
  - cAdvisor (container-level resource usage, more detailed than metrics-server's snapshot)

Why you typically need both, not one or the other

metrics-server and Prometheus solve genuinely different problems and commonly coexist in the same cluster: metrics-server is the lightweight, always-on dependency that HPA and kubectl top specifically require (and Prometheus doesn't natively plug into the HPA's resource-metric mode without an additional adapter); Prometheus is the tool for everything else — dashboards, alerting, capacity planning, debugging a specific incident by looking at historical trends, and scaling on custom application-level metrics (which does require a Prometheus adapter to feed into the Custom Metrics API for HPA to consume — see that question).

kube-state-metrics — a commonly paired component

Distinct from both of the above: kube-state-metrics exposes the state of Kubernetes objects themselves as Prometheus-scrapeable metrics (how many replicas a Deployment currently has vs. desires, how many Pods are in each phase, node conditions) — this is object-state information, not resource-usage information, and is what lets Prometheus/Grafana dashboards show things like "how many Deployments currently have fewer ready replicas than desired" across a whole cluster.

Every cluster running an HPA needs metrics-server (or, more rarely, an equivalent custom metrics pipeline) as a baseline dependency; any cluster that cares about historical trends, alerting, or custom application metrics needs a full monitoring stack like Prometheus (often bundled as the "kube-prometheus-stack" Helm chart, including Prometheus, Grafana, Alertmanager, and kube-state-metrics together) layered on top — treating metrics-server as a substitute for real monitoring is a common early-stage mistake, since it was never designed to serve that broader purpose.

Related Resources

Kubernetes Metrics Server

Prometheus: Kubernetes Monitoring

Open as page

This is a genuinely common real-world confusion: kubectl get pods shows Running, everything looks healthy at a glance, yet requests to the Service time out or fail. The phase: Running field alone is not sufficient evidence the Pod is actually reachable — several other conditions have to also be true.

Cause 1: Failing readiness probe (the most common cause)

kubectl get pods
# NAME        READY   STATUS    RESTARTS   AGE
# my-app-xyz  0/1     Running   0          5m     <- Running, but READY shows 0/1

READY: 0/1 alongside STATUS: Running is the telltale sign — the container is executing, but its readiness probe (see that question) is currently failing, so the Pod has been removed from (or never added to) the Service's endpoints. Confirm directly:

kubectl describe pod my-app-xyz
# Look for: Readiness probe failed: ... in the Events section

Fix by understanding why the readiness check is failing (a slow-to-warm-up dependency, an incorrect readiness endpoint path, a genuine application problem) — this is functioning exactly as designed (the whole point of readiness probes is to prevent traffic from reaching a Pod that isn't actually able to serve it), so the fix is addressing the underlying readiness condition, not disabling the probe.

Cause 2: Service selector doesn't match the Pod's labels

kubectl get endpoints my-service
# NAME         ENDPOINTS   AGE
# my-service   <none>      10m      <- no endpoints at all, even though pods exist and are Ready

If a Service's selector doesn't match any Pod's actual labels (a typo, or the Pod's labels were changed without updating the Service, or vice versa), the Service has zero endpoints, regardless of how many perfectly healthy, Ready Pods exist elsewhere in the namespace. Compare the Service's selector directly against the Pod's actual labels:

kubectl get service my-service -o jsonpath='{.spec.selector}'
kubectl get pod my-app-xyz --show-labels

Cause 3: Port mismatch between Service and container

# Service expects the container to listen on 8080...
spec:
  ports:
    - port: 80
      targetPort: 8080
# ...but the container actually listens on a different port
containers:
  - name: app
    ports:
      - containerPort: 3000    # mismatch!

Even with correct labels and a passing readiness probe, if the actual application inside the container listens on a different port than the Service's targetPort expects, connections will fail once they reach the Pod — this is a configuration mismatch between the Service spec and the actual application, not something Kubernetes can detect or warn about on its own.

Cause 4: NetworkPolicy blocking the traffic path

If NetworkPolicies are enforced in the cluster (see the networking topic), a policy scoped to the target Pod might be blocking ingress from the specific source attempting to connect — worth checking explicitly (kubectl get networkpolicy -n <namespace>) if labels, readiness, and ports all check out but traffic still isn't getting through, especially in a namespace with a default-deny posture where a needed allow rule might simply be missing.

The systematic debugging order

Check READY column in kubectl get pods — is the Pod actually marked Ready, not just Running?
Check kubectl get endpoints <service> — does the Service actually have any endpoints listed at all?
Compare the Service's selector against the Pod's actual labels directly.
Verify the Service's targetPort matches what the container actually listens on.
Check for NetworkPolicies that might be blocking the specific traffic path, if steps 1-4 all check out.

Working through these in order, rather than guessing, resolves the overwhelming majority of "Running but unreachable" cases quickly.

Related Resources

Kubernetes: Debug Services