What is a ConfigMap, and how do you use it to inject configuration into a Pod?

A ConfigMap stores non-sensitive configuration data (key-value pairs or whole config files) as a Kubernetes object, decoupled from the container image itself, so the same image can be deployed with different configuration across environments. It can be consumed by a Pod as environment variables, as command-line arguments, or as files mounted into the container's filesystem via a volume.

What is a Secret, and how does it differ from a ConfigMap?

A Secret is structurally almost identical to a ConfigMap — the same key-value/file-based consumption model — but is intended specifically for sensitive data (passwords, tokens, TLS certificates), gets some additional handling by Kubernetes (excluded from `kubectl describe` output by default, held in memory-backed storage on nodes rather than written to disk), and can be encrypted at rest in etcd if explicitly configured. Critically, Secret values are only **base64-encoded**, not encrypted, by default — encryption at rest, tighter RBAC restriction, and often an external secrets manager are needed for genuinely strong protection.

What is a Volume in Kubernetes, and how does it differ from a Docker volume?

A Kubernetes Volume is storage attached to a Pod (not an individual container), whose lifecycle can be tied to the Pod (surviving individual container restarts within that Pod, but not the Pod's own deletion) or, for persistent volume types, can outlive the Pod entirely. This is broader than Docker's own volume concept — Kubernetes Volumes are an abstraction over many different underlying storage backends (local disk, cloud block storage, NFS, ConfigMaps/Secrets presented as files) unified under one consistent Pod-level API.

What's the difference between a PersistentVolume and a PersistentVolumeClaim?

A PersistentVolume (PV) represents an actual piece of storage in the cluster (a cloud disk, an NFS share) — provisioned either manually by an administrator or dynamically via a StorageClass. A PersistentVolumeClaim (PVC) is a request for storage made by a Pod's owner/developer, specifying how much space and what access mode is needed, without needing to know or care about the underlying storage implementation. Kubernetes binds a PVC to a matching PV, and the Pod simply references the PVC — this separation lets application developers request storage abstractly, while cluster operators (or dynamic provisioning) handle the actual storage infrastructure.

What is a StorageClass, and how does dynamic provisioning work?

A StorageClass defines a category of storage a cluster can provision on demand — it names a specific provisioner (e.g., the AWS EBS CSI driver) and parameters (disk type, IOPS, filesystem type) describing *how* to create matching storage. When a PersistentVolumeClaim references a StorageClass and no existing PersistentVolume already satisfies it, the StorageClass's provisioner automatically creates both a new PersistentVolume object and the real underlying storage resource (e.g., actually calling the cloud provider's API to create a disk) — eliminating the need for an administrator to manually pre-provision storage ahead of time.

What's the difference between emptyDir, hostPath, and a persistent volume-backed volume?

`emptyDir` is empty, node-local, ephemeral scratch space created fresh when a Pod starts and permanently deleted when the Pod is removed — useful for temporary data or sharing files between containers in the same Pod, never for anything that must survive. `hostPath` mounts a specific path from the underlying node's own filesystem directly into the Pod — powerful but tightly (and often problematically) coupled to whichever specific node the Pod happens to land on. A PersistentVolumeClaim-backed volume is genuinely durable, independent of any specific node or Pod's lifecycle, backed by real persistent storage (a cloud disk, network share) that survives Pod deletion and rescheduling.

How do StatefulSets handle persistent storage differently from Deployments?

A Deployment's Pod template, if it references a PersistentVolumeClaim at all, has every replica sharing the same single PVC (or, more commonly, Deployments simply aren't used with per-replica persistent storage at all) — there's no built-in mechanism for giving each replica its own distinct, durable volume. A StatefulSet's `volumeClaimTemplates` automatically creates a distinct PVC for each replica (`data-web-0`, `data-web-1`, ...), and — critically — that specific PVC stays bound to that specific replica's stable identity across restarts and rescheduling, giving each stateful instance its own persistent, individually-tracked storage.

What access modes can a PersistentVolume have, and why does this matter?

**ReadWriteOnce (RWO)** allows the volume to be mounted as read-write by a single node at a time (as of Kubernetes 1.22+, actually restricted per-node rather than per-Pod, so multiple Pods on the same node can share it). **ReadOnlyMany (ROX)** allows many nodes to mount it simultaneously, but only read-only. **ReadWriteMany (RWX)** allows many nodes to mount it simultaneously with read-write access — but this is only supported by certain storage backends (typically network file systems like NFS or cloud file-share services), not by most block-storage-backed volumes (like standard cloud disks), which is a common source of confusion when a Deployment with multiple replicas tries to share one volume.

Configuration and Storage

Injecting configuration into Pods, and managing persistent data with Volumes, PersistentVolumes, and StorageClasses.

Difficulty

Open as page

Why decouple configuration from the image

Baking environment-specific configuration (database hostnames, feature flags, log levels) directly into a container image means building a separate image per environment — defeating the whole point of a container image being an immutable, promotable artifact that's identical from dev through production. A ConfigMap lets the same image run in every environment, with only the ConfigMap's contents differing.

Creating a ConfigMap

apiVersion: v1
kind: ConfigMap
metadata:
  name: app-config
data:
  LOG_LEVEL: "info"
  MAX_CONNECTIONS: "100"
  app.properties: |
    feature.new_checkout=true
    cache.ttl_seconds=300

Consuming it as environment variables

spec:
  containers:
    - name: app
      image: myapp:1.0
      envFrom:
        - configMapRef:
            name: app-config    # every key becomes an env var: LOG_LEVEL, MAX_CONNECTIONS
      env:
        - name: LOG_LEVEL       # or reference just one specific key
          valueFrom:
            configMapKeyRef:
              name: app-config
              key: LOG_LEVEL

Consuming it as mounted files

spec:
  containers:
    - name: app
      image: myapp:1.0
      volumeMounts:
        - name: config-volume
          mountPath: /etc/app-config
  volumes:
    - name: config-volume
      configMap:
        name: app-config

Each key in the ConfigMap becomes a separate file inside /etc/app-config (e.g., /etc/app-config/app.properties), which is the natural approach for configuration formats applications expect to read as a whole file, rather than individual environment variables.

A key behavioral difference between the two consumption methods

Environment variables are only read once, at container start — updating the underlying ConfigMap has no effect on an already-running container's environment variables; the Pod must be restarted (e.g., via a rolling update) to pick up the change. Mounted ConfigMap volumes, by contrast, are updated automatically (after a propagation delay, typically up to a minute or so) without restarting the Pod — the kubelet periodically syncs the mounted files to match the ConfigMap's current content. This distinction matters when deciding how a config change should roll out: as environment variables, a change requires an explicit rollout to take effect (which some teams actually prefer, for predictability); as mounted files, the application needs its own file-watching logic to notice and react to the change live.

What ConfigMaps are not for

ConfigMaps are stored as plain, unencrypted data in etcd (readable by anyone with appropriate RBAC access to view the object) — they're explicitly not meant for passwords, API keys, or other sensitive values. That's what Secrets exist for (see that question), even though a Secret's data is only base64-encoded, not strongly encrypted, by default — the distinction between the two objects is more about signaling intent and enabling separate handling than about ConfigMaps being insecure and Secrets being inherently safe.

Related Resources

Kubernetes: ConfigMaps

Open as page

Creating and consuming a Secret

apiVersion: v1
kind: Secret
metadata:
  name: db-credentials
type: Opaque
data:
  username: YWRtaW4=          # base64("admin")
  password: c3VwZXJzZWNyZXQ=  # base64("supersecret")

spec:
  containers:
    - name: app
      envFrom:
        - secretRef:
            name: db-credentials

Consumption (env vars or mounted volumes) works identically to ConfigMaps — the key practical difference is in how Kubernetes handles the data, not in the mechanics of using it in a Pod spec.

The critical misconception: base64 is not encryption

echo "c3VwZXJzZWNyZXQ=" | base64 -d
# supersecret

Base64 is a reversible encoding, not encryption — anyone with read access to the Secret object (via kubectl get secret db-credentials -o yaml, or direct etcd access) can trivially decode it back to plaintext. Base64 exists purely so arbitrary binary data (like a TLS private key) can be represented safely inside YAML/JSON text, not to provide any confidentiality. A Secret's actual security comes entirely from who is authorized to read it (RBAC) and whether the underlying storage is encrypted — not from the encoding itself.

What Kubernetes does provide, beyond plain ConfigMaps

Excluded from some default output — kubectl describe pod doesn't print a mounted Secret's actual values (though kubectl get secret -o yaml still shows them to anyone with permission to run that command).
tmpfs (memory-backed) storage on nodes — a mounted Secret volume is typically backed by tmpfs (RAM), not written to the node's actual disk, reducing the risk of leftover sensitive data on a decommissioned or compromised node's filesystem.
Encryption at rest, if explicitly configured — by default, Secrets are stored in etcd with no encryption beyond base64 (i.e., effectively plaintext to anyone with etcd access); enabling encryption at rest (a control-plane configuration, not automatic out of the box) actually encrypts Secret data before it's written to etcd.

What Secrets still don't solve on their own

Even with encryption at rest enabled, anyone with sufficient RBAC permission to read the Secret object through the API server gets the plaintext value back — Kubernetes-native Secrets don't provide fine-grained audit trails of secret access the way a dedicated secrets manager typically does, and rotating a Secret's value still requires updating the object and getting consuming Pods to pick up the change (see the ConfigMap question's note on env vars vs. mounted volumes applying equally here). For production systems with genuinely sensitive data, many teams layer an external secrets manager (HashiCorp Vault, AWS Secrets Manager, paired with a tool like External Secrets Operator that syncs values into native Kubernetes Secrets, or injects them directly at runtime) on top of, or instead of, relying purely on native Kubernetes Secrets.

Treat a Kubernetes Secret as "slightly better protected than a ConfigMap, but not encrypted by default and not a substitute for tight RBAC" — always enable encryption at rest for any cluster handling real sensitive data, restrict who can read Secret objects via RBAC as tightly as possible (see the security topic's least-privilege question), and consider an external secrets manager for the most sensitive values, audit requirements, or rotation needs.

Related Resources

Kubernetes: Secrets

Open as page

The problem Volumes solve

A container's own filesystem is ephemeral — anything written to it is lost the moment the container restarts, even within the same Pod (a crash and restart of a single container gets a fresh filesystem). This is fine for a purely stateless process, but breaks anything needing to persist data across a restart, or needing to share data between multiple containers in the same Pod.

The Volume abstraction: attached to the Pod, not the container

apiVersion: v1
kind: Pod
metadata:
  name: app
spec:
  containers:
    - name: app
      image: myapp:1.0
      volumeMounts:
        - name: cache-volume
          mountPath: /app/cache
    - name: sidecar
      image: sidecar:1.0
      volumeMounts:
        - name: cache-volume        # same volume, mounted into a second container
          mountPath: /shared/cache
  volumes:
    - name: cache-volume
      emptyDir: {}

Because the Volume is defined at the Pod level and mounted into whichever containers declare it, both containers in this Pod see the same underlying storage — and, depending on volume type, data written to it can survive an individual container within the Pod restarting, even though the Volume's own lifetime is still ultimately tied to the Pod (an emptyDir, specifically, is deleted when the Pod itself is deleted, regardless of how many times its containers individually restarted along the way).

Many volume "types," one consistent Pod-level interface

Kubernetes Volumes aren't one storage mechanism — the volumes field supports many distinct types, each backed by different underlying storage: emptyDir (ephemeral, node-local scratch space), hostPath (a path on the node's own filesystem), configMap/secret (presenting Kubernetes objects as files — see the ConfigMap question), and persistent types backed by a PersistentVolumeClaim (network/cloud-backed storage that can outlive the Pod entirely — see that question). All of these are mounted into containers using the exact same volumeMounts syntax, regardless of what's actually backing them.

How this differs from a plain Docker volume

Docker's own volume concept is scoped to a single container/host and doesn't have a native notion of "shared across a group of co-located containers with different lifecycle rules per volume type," nor does it have a built-in abstraction spanning many different network/cloud storage backends behind one consistent interface. Kubernetes's Volume model is a broader, Pod-centric abstraction specifically designed to unify wildly different storage backends (local ephemeral scratch space, cloud block storage, network file shares, Kubernetes-object-as-file) under one API, so a Pod spec doesn't need different mounting logic depending on what's actually providing the storage underneath.

Ephemeral vs. persistent, at a glance

Volume type	Survives container restart (within the Pod)	Survives Pod deletion
`emptyDir`	Yes	No
`hostPath`	Yes	Yes, but tied to that specific node
`configMap` / `secret`	Yes (read-only)	No (recreated from the object if the Pod is recreated)
PersistentVolumeClaim-backed	Yes	Yes — this is the actual "durable data" story (see that question)

Understanding this table is the key to answering "will my data survive X" questions correctly — the answer depends entirely on which volume type is in play, not on some single universal Kubernetes storage guarantee.

Related Resources

Kubernetes: Volumes

Open as page

The separation of concerns: PV (supply) vs. PVC (demand)

# PersistentVolume: an actual piece of storage, typically created by an admin
# or dynamically by a StorageClass provisioner -- represents SUPPLY
apiVersion: v1
kind: PersistentVolume
metadata:
  name: pv-database-1
spec:
  capacity:
    storage: 20Gi
  accessModes:
    - ReadWriteOnce
  awsElasticBlockStore:
    volumeID: vol-0abc123

# PersistentVolumeClaim: a request for storage, created by an application
# developer -- represents DEMAND, with no knowledge of the underlying implementation
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: database-pvc
spec:
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 20Gi

# The Pod only references the PVC by name -- never the PV directly
spec:
  containers:
    - name: db
      volumeMounts:
        - name: data
          mountPath: /var/lib/data
  volumes:
    - name: data
      persistentVolumeClaim:
        claimName: database-pvc

Kubernetes's PV controller binds the PVC to a PV that satisfies its requested size and access mode (in this example, pv-database-1 matches database-pvc's request exactly) — once bound, that specific PV is reserved exclusively for that PVC and can't be claimed by any other PVC.

Why this indirection is genuinely useful

An application developer writing a Deployment or StatefulSet manifest shouldn't need to know (or care) whether the underlying storage is an AWS EBS volume, a GCP persistent disk, or an on-prem NFS share — they just declare "I need 20Gi, ReadWriteOnce" via a PVC, and the actual storage implementation detail is handled entirely by whoever manages PVs (or, far more commonly today, by dynamic provisioning via a StorageClass — see that question). This mirrors the same "declare what you want, let something else figure out how" philosophy that runs through Kubernetes generally (see the reconciliation-loop question).

Static vs. dynamic provisioning

Static provisioning: a cluster administrator manually creates PV objects ahead of time, representing real storage they've already set up, and PVCs bind to whichever pre-existing PV matches.
Dynamic provisioning (the overwhelmingly more common approach today): a PVC references a StorageClass, and if no existing PV matches, the StorageClass's provisioner automatically creates a brand-new PV (and the underlying real storage — e.g., actually calling the cloud API to create an EBS volume) on demand, with no administrator needing to pre-provision anything.

The lifecycle and reclaim policy

When a PVC is deleted, what happens to its bound PV (and the real underlying storage) depends on the PV's reclaim policy: Delete (the PV and underlying storage are deleted too — common for dynamically-provisioned PVs) or Retain (the PV and its data survive, but move to a "released" state, no longer available to be bound automatically, requiring manual admin action to reuse or clean it up) — an important setting to check deliberately for any PV holding genuinely important data, since the default behavior for dynamically-provisioned volumes is frequently Delete, which can be a nasty surprise if a PVC is deleted accidentally.

A strong answer emphasizes the separation of concerns this design achieves — developers declare storage needs abstractly via PVCs, while the actual storage implementation (PVs, and typically dynamic provisioning via StorageClasses) is a cluster/infrastructure-level concern — rather than just describing PV and PVC as two similarly-named objects without explaining why the split exists.

Related Resources

Kubernetes: Persistent Volumes

Open as page

Defining a StorageClass

apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: fast-ssd
provisioner: ebs.csi.aws.com     # which CSI driver actually creates the storage
parameters:
  type: gp3
  iopsPerGB: "50"
reclaimPolicy: Delete             # what happens to the PV when its PVC is deleted
volumeBindingMode: WaitForFirstConsumer

Requesting storage from it

apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: database-pvc
spec:
  storageClassName: fast-ssd      # references the StorageClass above
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 100Gi

What happens when this PVC is created

The PVC controller checks for an existing, unbound PV matching the request — if none exists (the common case with dynamic provisioning, since PVs aren't pre-created), it proceeds to provisioning.
The fast-ssd StorageClass's provisioner (ebs.csi.aws.com, a CSI — Container Storage Interface — driver) is invoked, which calls the actual cloud provider API to create a real 100Gi gp3 EBS volume.
A new PersistentVolume object is automatically created, representing this newly-created real disk.
The PVC is bound to this new PV — the whole process happening automatically, with no administrator needing to have anticipated this specific request ahead of time.

The Container Storage Interface (CSI) — another standard plugin interface

Similar in spirit to CRI (for container runtimes) and CNI (for networking), CSI standardizes how Kubernetes talks to storage systems, so any storage vendor can write a CSI driver that Kubernetes can use without storage-vendor-specific code baked into Kubernetes core. This is what lets the same StorageClass mechanism work uniformly whether the underlying provisioner is AWS EBS, GCP Persistent Disk, Azure Disk, or an on-prem storage system's CSI driver.

volumeBindingMode — an important, easy-to-miss setting

WaitForFirstConsumer (increasingly the recommended default) delays actually provisioning the volume until a Pod that will use the PVC is scheduled — this matters because the volume might have topology constraints (e.g., an EBS volume can only be attached to a node in the same availability zone it was created in), and provisioning it before knowing which node/zone the Pod will actually run in could create a volume in the wrong zone entirely, causing the Pod to become unschedulable. The older default, Immediate, provisions the volume as soon as the PVC is created, without waiting to see where the consuming Pod lands — a real source of "PVC bound to a volume in the wrong availability zone" issues if not configured carefully.

Default StorageClass

A cluster can designate one StorageClass as the default (via an annotation), used automatically for any PVC that doesn't explicitly specify storageClassName — worth being aware of when a PVC's storage behavior seems to "just work" without an explicit StorageClass reference; it's still using one, just implicitly.

Cluster operators typically define a small number of StorageClasses representing meaningful tiers (e.g., fast-ssd for databases needing high IOPS, standard for general-purpose storage, cold-archive for infrequently accessed data), and application teams simply reference the appropriate one by name in their PVCs — keeping the "what kind of storage, from which provider, with which performance characteristics" decision centralized and consistent across the cluster.

Related Resources

Kubernetes: Storage Classes

Open as page

emptyDir — ephemeral, Pod-scoped scratch space

volumes:
  - name: scratch-space
    emptyDir: {}

Created empty when the Pod starts, exists only as long as the Pod does on that node, and is permanently deleted the moment the Pod itself is deleted — a container within the Pod restarting does not lose an emptyDir's contents (it survives individual container restarts, just not Pod deletion). Good for: a temporary cache, scratch space for a batch computation, or as the shared medium between a main container and a sidecar/init container within the same Pod (see the sidecar and init container questions). Can optionally be backed by RAM (emptyDir: {medium: Memory}) for even faster, tmpfs-based scratch space, at the cost of counting against the Pod's memory usage.

hostPath — a specific node's own filesystem, mounted in

volumes:
  - name: node-logs
    hostPath:
      path: /var/log/containers
      type: Directory

Mounts an actual path from the specific node the Pod happens to be scheduled onto — powerful (direct access to node-level resources, useful for certain infrastructure DaemonSets that genuinely need to read/write the host's own filesystem, like a log collector reading /var/log), but comes with real caveats: the data is tied to that one specific node, not portable if the Pod is rescheduled elsewhere; different nodes might have different content/permissions at that path; and it's a meaningful security risk if used carelessly, since it gives a Pod direct access to the underlying host's filesystem, potentially including sensitive host-level files — most clusters restrict hostPath usage via Pod Security Admission policies (see the security topic) specifically because of this risk.

PersistentVolumeClaim-backed volume — genuinely durable, node-independent storage

volumes:
  - name: data
    persistentVolumeClaim:
      claimName: my-pvc

Backed by real persistent storage (a cloud block volume, an NFS share) that exists independently of any specific node or Pod — if the Pod is deleted and recreated (even on a completely different node, assuming the storage backend/access mode supports it), it reattaches to the same underlying data. This is the only one of the three that provides genuine durability across Pod rescheduling, which is why it's the correct choice for anything that must survive beyond a single Pod's lifetime (databases, uploaded files, any real application data).

Side-by-side summary

	emptyDir	hostPath	PVC-backed
Survives container restart (same Pod)	Yes	Yes	Yes
Survives Pod deletion	No	Yes, but tied to that node	Yes, node-independent
Tied to a specific node	No (doesn't matter, since it's deleted with the Pod anyway)	Yes	No (typically — depends on the storage backend/access mode)
Typical use	Scratch space, inter-container sharing within a Pod	Node-level infrastructure access (logs, host metrics)	Real application/database data

Default to a PVC-backed volume for anything that must actually persist; use emptyDir for genuinely temporary, disposable data; and treat hostPath as a specialized tool reserved for infrastructure-level DaemonSets that have a real, deliberate need to access the host filesystem — not a general-purpose storage option for application workloads.

Related Resources

Kubernetes: Volumes

Open as page

Why a Deployment can't give each replica its own volume

A Deployment's Pod template defines exactly one PVC reference (if any), shared identically across every replica it creates — there's no per-replica templating mechanism. In practice, this means a Deployment either has all replicas sharing one single PVC (only viable for storage backends supporting ReadWriteMany — see the access modes question — and only sensible for genuinely shared data, not per-instance data), or, far more commonly, Deployments simply aren't used at all when each replica needs its own distinct persistent data.

volumeClaimTemplates — per-replica PVCs, automatically

apiVersion: apps/v1
kind: StatefulSet
metadata:
  name: web
spec:
  serviceName: "web-headless"
  replicas: 3
  selector:
    matchLabels:
      app: web
  template:
    metadata:
      labels:
        app: web
    spec:
      containers:
        - name: web
          image: myapp:1.0
          volumeMounts:
            - name: data
              mountPath: /var/lib/data
  volumeClaimTemplates:
    - metadata:
        name: data
      spec:
        accessModes: ["ReadWriteOnce"]
        resources:
          requests:
            storage: 10Gi

Instead of one shared PVC, Kubernetes creates a separate PVC for each replica, named by combining the template name and the Pod's ordinal: data-web-0, data-web-1, data-web-2. Each replica's Pod mounts only its own correspondingly-named PVC.

The critical guarantee: identity-to-storage binding survives rescheduling

If web-1's Pod is deleted and recreated — whether due to a crash, a node failure and rescheduling elsewhere, or a rolling update — the replacement Pod, still named web-1, reattaches to the exact same data-web-1 PVC, not a fresh empty volume. This is what makes StatefulSets suitable for databases and other stateful applications: each instance's identity and its data are durably linked together, regardless of which physical node it currently happens to be running on.

What happens when you scale a StatefulSet down, then back up

Scaling a StatefulSet from 3 replicas down to 1 does not delete data-web-1 or data-web-2's PVCs by default — they're retained. Scaling back up to 3 later re-creates web-1 and web-2, and (depending on the StatefulSet's PVC retention policy setting, a more recent addition to the API) they typically reattach to their original, now-still-existing PVCs, meaning previously-removed replicas can come back with their old data intact — a deliberate safety choice, since silently deleting a stateful replica's data just because it was temporarily scaled down would be dangerous default behavior.

Manual cleanup responsibility

Because PVCs created via volumeClaimTemplates are not automatically deleted when the StatefulSet itself is deleted (again, a deliberate safety default — protecting against accidental data loss), cleaning up a StatefulSet's storage after you genuinely want it gone requires explicitly deleting the underlying PVCs yourself (kubectl delete pvc -l app=web) — an easy step to forget, and a common source of "why is this old data/cost still around" surprises after decommissioning a StatefulSet-based application.

The key distinguishing fact to articulate clearly: it's not just that StatefulSets can use persistent storage (Deployments technically can reference PVCs too) — it's that StatefulSets provide per-replica storage that's durably bound to that replica's stable identity, which is precisely the capability a Deployment's architecture has no mechanism to express at all.

Related Resources

Kubernetes: StatefulSets - Volume Claim Templates

Open as page

The three access modes

Mode	Meaning
ReadWriteOnce (RWO)	Can be mounted read-write, but only by one node at a time (multiple Pods on that same node can still share it)
ReadOnlyMany (ROX)	Can be mounted read-only, simultaneously, by many nodes
ReadWriteMany (RWX)	Can be mounted read-write, simultaneously, by many nodes

(A newer, more granular ReadWriteOncePod mode further restricts RWO to exactly one Pod — not just one node — for storage backends and use cases needing that stricter guarantee.)

Why most cloud block storage only supports RWO

Standard cloud block storage (AWS EBS, GCP Persistent Disk, Azure Disk) is fundamentally built like an attached hard drive — it can only be attached to one virtual machine (node) at a time, which is why these are backed by PVs that only support ReadWriteOnce. This is perfectly fine for a single-replica database, or for a StatefulSet where each replica gets its own separate RWO volume via volumeClaimTemplates (see that question) — the "many replicas" case is handled by giving each replica its own distinct RWO volume, not by sharing one.

The common mistake this causes

# A Deployment with 3 replicas, all referencing the SAME PVC
spec:
  replicas: 3
  template:
    spec:
      volumes:
        - name: shared-data
          persistentVolumeClaim:
            claimName: shared-pvc   # backed by a standard cloud disk (RWO)

If shared-pvc is backed by a typical cloud block-storage PV (RWO only), and the 3 replicas get scheduled across 3 different nodes, at most one of them will actually be able to mount the volume — the other two Pods will get stuck Pending or fail to start, unable to attach a volume that's already attached (read-write) elsewhere. This is a frequent source of confusion for engineers newer to Kubernetes storage, who reasonably assume "a PVC is just shared storage" without realizing the access mode of the underlying storage backend fundamentally limits this.

What actually supports ReadWriteMany

Network file systems and cloud file-share services are what genuinely support simultaneous multi-node read-write access: NFS, AWS EFS, Azure Files, GCP Filestore, and certain distributed storage systems (Ceph, GlusterFS) configured appropriately. If a workload genuinely needs multiple Pods across multiple nodes to read and write the same shared storage concurrently (a shared upload directory, a shared cache multiple services write to), the StorageClass/PV must specifically be backed by one of these RWX-capable systems — not standard cloud block storage.

Before designing a multi-replica workload that needs shared storage, explicitly check: does each replica actually need its own independent storage (→ use a StatefulSet with volumeClaimTemplates, RWO is fine), or do they genuinely need to concurrently read/write the same files (→ requires an RWX-capable storage backend, a real architectural decision with real performance/consistency tradeoffs of its own, not just a checkbox to enable). Assuming "any PVC can just be shared across replicas" without checking the access mode and underlying storage backend's actual capability is one of the more common Kubernetes storage misconceptions that only surfaces once Pods start failing to schedule or attach.

Related Resources

Kubernetes: Access Modes