What is RBAC in Kubernetes, and what are Roles, ClusterRoles, RoleBindings, and ClusterRoleBindings?

RBAC (Role-Based Access Control) governs what actions a given identity can perform against the Kubernetes API. A **Role** defines a set of permissions (verbs on resources) scoped to one namespace; a **ClusterRole** defines the same kind of permissions but cluster-wide (or for cluster-scoped resources). A **RoleBinding** grants a Role's (or a ClusterRole's) permissions to specific users/groups/ServiceAccounts, scoped to one namespace; a **ClusterRoleBinding** grants a ClusterRole's permissions cluster-wide. The permissions and the grant are deliberately separate objects, so the same Role can be reused across many bindings.

What is a ServiceAccount, and how do pods use them to authenticate to the API server?

A ServiceAccount is an identity for processes running inside Pods to use when talking to the Kubernetes API server — distinct from a human User account, which Kubernetes doesn't manage as an object at all (it's handled by an external authentication mechanism). Every Pod runs with a ServiceAccount (defaulting to a namespace's `default` ServiceAccount if none is specified), and Kubernetes automatically mounts that ServiceAccount's credentials (a token) into the Pod, letting any code inside the Pod authenticate to the API server as that identity, subject to whatever RBAC permissions are bound to it.

What is a SecurityContext, and what does it control?

A SecurityContext (settable at the Pod level, applying to all its containers, and/or overridden per-container) configures Linux-level security settings for how a container actually runs — whether it runs as root or a specific non-root user ID, whether it can escalate privileges, which Linux capabilities it has beyond the container default set, and whether its root filesystem is read-only. These settings are a core part of hardening containers against exploitation, since a container running as root with unnecessary privileges gives an attacker who compromises it a much larger blast radius.

What replaced PodSecurityPolicy, and what does Pod Security Admission do?

PodSecurityPolicy (PSP) was deprecated in Kubernetes 1.21 and removed in 1.25, replaced by **Pod Security Admission** — a built-in admission controller that enforces one of three predefined, non-customizable Pod Security Standards (`privileged`, `baseline`, `restricted`) at the namespace level, via a simple label on the namespace, rather than PSP's more complex and notoriously hard-to-reason-about custom policy objects and RBAC-based binding model.

What is an admission controller, and what's the difference between a validating and mutating webhook?

Admission controllers are the last stage of the API server's request pipeline (after authentication and authorization) — they can inspect, modify, or reject a request before it's persisted to etcd. **Mutating** admission webhooks run first and can modify the object (e.g., automatically injecting a sidecar container, or setting a default label); **validating** admission webhooks run after mutation and can only accept or reject the (possibly now-modified) object, never change it further — this ordering guarantees validation always sees the final, fully-mutated version of an object.

How should Secrets be managed securely in a production cluster?

Enable **encryption at rest** for Secrets in etcd (not on by default), restrict who can read Secret objects as tightly as possible via RBAC (least privilege), avoid committing raw Secret manifests to version control (use sealed/encrypted representations, like Sealed Secrets or SOPS, that are safe to commit), and for the most sensitive values, consider an external secrets manager (HashiCorp Vault, AWS/Azure/GCP's secrets services) integrated via a tool like External Secrets Operator, which syncs values in without them ever needing to be hand-authored as plain Kubernetes Secret manifests.

How do you secure the Kubernetes API server itself?

Restrict network exposure (the API server should generally not be reachable from the public internet without restriction — private networking, IP allowlisting, or a VPN/bastion is standard), require strong authentication (client certificates, OIDC/SSO integration, or cloud-provider IAM — never rely on static tokens alone for humans), enforce tight RBAC so authenticated identities only have the minimum permissions they need, enable audit logging to record who did what, and keep the Kubernetes version patched, since the API server is the single most security-critical component in the entire cluster.

What is the principle of least privilege as applied to Kubernetes RBAC design?

Least privilege in a Kubernetes context means every user, ServiceAccount, and automated process should hold only the specific verbs on the specific resources (and only in the specific namespaces) it genuinely needs to do its job — never broad, cluster-wide, or wildcard permissions granted for convenience. This limits the damage any single compromised credential, buggy controller, or over-permissioned CI pipeline can do, exactly mirroring the same principle applied to database access, applied here to the Kubernetes API's own permission model.

Security

RBAC, ServiceAccounts, Pod-level security controls, admission control, and securing the API server.

Difficulty

Open as page

Role — namespace-scoped permissions

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: production
  name: pod-reader
rules:
  - apiGroups: [""]
    resources: ["pods"]
    verbs: ["get", "list", "watch"]

Defines a set of permitted actions (verbs) on specific resource types, scoped to exist only within the production namespace — this Role has no effect on Pods in any other namespace.

RoleBinding — granting a Role to an identity, within a namespace

apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: read-pods-binding
  namespace: production
subjects:
  - kind: User
    name: alice
    apiGroup: rbac.authorization.k8s.io
roleRef:
  kind: Role
  name: pod-reader
  apiGroup: rbac.authorization.k8s.io

Grants the pod-reader Role's permissions specifically to the user alice, and specifically within the production namespace. A RoleBinding can also reference a ClusterRole (not just a namespace-scoped Role) — in that case, it grants that ClusterRole's permissions, but still only within the binding's own namespace, which is a useful pattern for reusing one common set of permissions (defined once as a ClusterRole) across many different namespace-scoped bindings.

ClusterRole — cluster-wide (or cluster-scoped-resource) permissions

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: node-reader
rules:
  - apiGroups: [""]
    resources: ["nodes"]     # Nodes are a cluster-scoped resource -- no namespace applies
    verbs: ["get", "list", "watch"]

Needed for any permission concerning cluster-scoped resources (Nodes, PersistentVolumes, ClusterRoles/ClusterRoleBindings themselves) since these don't belong to any namespace at all — a namespaced Role simply has no way to grant access to them.

ClusterRoleBinding — granting a ClusterRole cluster-wide

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: read-nodes-global
subjects:
  - kind: Group
    name: platform-team
    apiGroup: rbac.authorization.k8s.io
roleRef:
  kind: ClusterRole
  name: node-reader
  apiGroup: rbac.authorization.k8s.io

Grants the node-reader ClusterRole's permissions across the entire cluster, to everyone in the platform-team group — the broadest possible scope of grant.

The four-way combination, summarized

	Permissions defined	Grant scoped to
Role + RoleBinding	One namespace	That same namespace
ClusterRole + RoleBinding	Cluster-wide definition, reused	That RoleBinding's namespace only
ClusterRole + ClusterRoleBinding	Cluster-wide definition	The whole cluster

Why the separation between "permission definition" and "grant" exists

Defining permissions (Role/ClusterRole) separately from granting them (RoleBinding/ClusterRoleBinding) to a specific subject lets one reusable permission set be bound to many different users/teams/namespaces without redefining the underlying rules each time — e.g., a single pod-reader ClusterRole can be bound via separate RoleBindings in team-a's namespace and team-b's namespace, each granting the same read-only pod access, scoped independently to each team's own namespace.

Default to the most narrowly-scoped combination that satisfies the real need — a namespaced Role + RoleBinding for anything that can be namespace-scoped, reserving ClusterRole + ClusterRoleBinding for genuinely cluster-wide needs (platform/infrastructure teams, cluster-scoped resources) — this is a direct application of the least-privilege principle to Kubernetes's own access model.

Related Resources

Kubernetes: Using RBAC Authorization

Open as page

Why Pods need their own identity

Some applications running inside a Pod need to talk to the Kubernetes API server themselves — a custom controller watching for changes to a Custom Resource, a CI/CD tool creating new Deployments, or simply an application that needs to look up its own Pod's metadata. This requires an identity to authenticate as, distinct from any human user's own credentials — that's exactly what a ServiceAccount provides.

Creating and using a ServiceAccount

apiVersion: v1
kind: ServiceAccount
metadata:
  name: pod-manager
  namespace: production

apiVersion: v1
kind: Pod
metadata:
  name: my-controller
spec:
  serviceAccountName: pod-manager   # explicitly assign this ServiceAccount
  containers:
    - name: controller
      image: my-controller:1.0

If serviceAccountName isn't specified, the Pod automatically uses that namespace's default ServiceAccount — a detail worth knowing, since it means every Pod always authenticates as some identity, even if you never explicitly thought about which one.

How the credential actually gets into the Pod

Kubernetes automatically mounts a projected volume into every Pod at /var/run/secrets/kubernetes.io/serviceaccount/, containing a short-lived, auto-rotating bound service account token (a JWT), along with the cluster's CA certificate and the current namespace — any code inside the container can read this token and present it as a bearer token when calling the API server directly.

# Inside a Pod, this is how application code (or a Kubernetes client library)
# authenticates to the API server as the Pod's ServiceAccount:
TOKEN=$(cat /var/run/secrets/kubernetes.io/serviceaccount/token)
curl -H "Authorization: Bearer $TOKEN" \
     --cacert /var/run/secrets/kubernetes.io/serviceaccount/ca.crt \
     https://kubernetes.default.svc/api/v1/namespaces/production/pods

Most Kubernetes client libraries (used when building custom controllers/Operators — see the extensibility topic) handle this automatically via "in-cluster config" detection, so application code rarely constructs these requests by hand.

Binding permissions to a ServiceAccount

A bare ServiceAccount has no permissions by default — it must be granted permissions the same way any other RBAC subject is, via a RoleBinding or ClusterRoleBinding:

apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: pod-manager-binding
  namespace: production
subjects:
  - kind: ServiceAccount
    name: pod-manager
    namespace: production
roleRef:
  kind: Role
  name: pod-editor
  apiGroup: rbac.authorization.k8s.io

Why the default ServiceAccount should almost never be granted broad permissions

Because every Pod that doesn't explicitly specify a ServiceAccount silently uses default, granting broad permissions to a namespace's default ServiceAccount effectively grants those permissions to every Pod in that namespace, including ones that never intended to need API access at all — a common, easy-to-introduce security misconfiguration. Best practice is to leave default unprivileged, and create dedicated, narrowly-scoped ServiceAccounts (with correspondingly narrow RoleBindings) for the specific Pods that genuinely need API access.

disabling auto-mounting when not needed

spec:
  automountServiceAccountToken: false

For Pods that don't need to talk to the API server at all (the majority of ordinary application workloads), explicitly disabling the automatic token mount removes an unnecessary credential from the Pod's filesystem entirely — a small but meaningful hardening step, reducing what an attacker could exfiltrate from a compromised container that had no legitimate need for API access in the first place.

Related Resources

Kubernetes: Service Accounts

Open as page

A hardened example

apiVersion: v1
kind: Pod
metadata:
  name: app
spec:
  securityContext:              # Pod-level: applies to all containers by default
    runAsNonRoot: true
    runAsUser: 1000
    fsGroup: 2000
  containers:
    - name: app
      image: myapp:1.0
      securityContext:          # container-level: can override the Pod-level settings
        allowPrivilegeEscalation: false
        readOnlyRootFilesystem: true
        capabilities:
          drop:
            - ALL
          add:
            - NET_BIND_SERVICE   # only add back the specific capability actually needed

Key settings and what each one hardens against

runAsNonRoot / runAsUser — many container images default to running as root (UID 0) inside the container unless told otherwise; forcing a non-root UID means that even if an attacker achieves code execution inside the container, they don't automatically have root-level privileges within that container's own namespace, limiting what they can further tamper with (though container root is still not equivalent to host root, given proper isolation — this is defense in depth, not the only layer).
allowPrivilegeEscalation: false — prevents a process from gaining more privileges than its parent process had (blocking, among other things, setuid binaries from escalating privilege inside the container) — a meaningful hardening step against a specific class of container escape/privilege-escalation technique.
readOnlyRootFilesystem: true — makes the container's own root filesystem immutable at runtime; an attacker who achieves code execution can't write a persistent backdoor or modify application binaries on disk, though the application must then explicitly mount a writable volume (like emptyDir) for any directory it legitimately needs to write to (temp files, caches).
capabilities: drop: [ALL], then selectively add — Linux capabilities are fine-grained permissions that break up what used to be the monolithic "root" privilege (e.g., NET_BIND_SERVICE for binding to ports below 1024, SYS_ADMIN for a wide range of administrative operations). Dropping all capabilities and adding back only the specific ones a container genuinely needs is a direct application of least privilege at the kernel-capability level — most containers need zero or very few capabilities beyond the default set the runtime already restricts.
fsGroup — sets the group ownership of mounted volumes, letting a non-root user still have appropriate write access to volume-backed storage without needing to run as root.

Why this matters: limiting the blast radius of a compromised container

Container isolation (namespaces, cgroups) already provides real separation from the host, but it's not an absolute security boundary — container escape vulnerabilities do periodically get discovered, and a poorly-hardened container (running as root, with unnecessary Linux capabilities, a writable root filesystem, and unrestricted privilege escalation) gives an attacker who achieves code execution inside it a much larger set of tools to work with than a properly hardened one. SecurityContext settings are exactly the mechanism for closing off unnecessary privilege a container was never going to legitimately need.

Enforcing this cluster-wide, not just per-Pod

Rather than relying on every team to remember to set these fields correctly in every Pod spec, most clusters enforce baseline SecurityContext requirements cluster-wide (or per-namespace) via Pod Security Admission (see that question) — rejecting or flagging Pods that don't meet a minimum security bar (e.g., the restricted Pod Security Standard requires most of the settings shown above) rather than trusting every individual manifest to have gotten it right voluntarily.

Related Resources

Kubernetes: Configure a Security Context for a Pod or Container

Open as page

Why PodSecurityPolicy was deprecated and removed

PSP let you define arbitrarily customizable security policies (allowed capabilities, allowed volume types, required user IDs, and much more) — but applying a PSP to a Pod worked through an unusually indirect mechanism: a PSP had to be granted via RBAC to a user or ServiceAccount, and which PSP actually applied to a given Pod depended on RBAC evaluation order in ways that were widely regarded as confusing and error-prone in practice. This complexity was cited as the primary reason for deprecating it in favor of something simpler.

Pod Security Admission — the replacement

apiVersion: v1
kind: Namespace
metadata:
  name: production
  labels:
    pod-security.kubernetes.io/enforce: restricted
    pod-security.kubernetes.io/audit: restricted
    pod-security.kubernetes.io/warn: restricted

Rather than custom policy objects, Pod Security Admission enforces one of three predefined, standardized levels (the Pod Security Standards), simply by labeling a namespace:

Level	Behavior
privileged	Unrestricted — no security requirements enforced at all
baseline	Blocks known privilege-escalation paths (e.g., disallows privileged containers, host namespaces) while remaining broadly compatible with common workloads
restricted	The most hardened standard — requires non-root, disallows privilege escalation, requires dropping all Linux capabilities except a small allowed set, requires `seccompProfile`, and more

Three separate label keys control three independent modes: enforce (actually reject non-compliant Pods), audit (allow them, but log a warning in the audit log), and warn (allow them, but return a warning to the user submitting the Pod) — commonly used together to warn/audit at a stricter level while only enforce-ing a more lenient one, giving teams visibility into what would be rejected under a stricter policy before actually flipping enforcement on.

Why this is a meaningful simplification

Pod Security Admission is deliberately less flexible than PSP was — you choose from three fixed levels rather than defining arbitrary custom rules — but this tradeoff was a deliberate design choice: most of PSP's complexity came from unlimited customizability that few teams actually needed and many implemented incorrectly. The three-level model covers the vast majority of real-world security posture needs with drastically simpler, namespace-label-based configuration that's much easier to reason about and audit.

What if you need more customization than the three standard levels offer

For genuinely custom policy needs beyond the three built-in levels, the ecosystem has moved toward general-purpose policy engines — OPA Gatekeeper and Kyverno are the two most widely adopted — which use admission webhooks (see that question) to enforce arbitrary custom policies, not limited to Pod security specifically (they can validate/mutate any Kubernetes object type against custom rules). Many clusters run Pod Security Admission for the baseline hardening it provides simply and natively, layered with a policy engine like Kyverno or Gatekeeper for anything requiring finer-grained or organization-specific rules.

Knowing that PSP was removed (not just deprecated) as of 1.25, and that Pod Security Admission trades PSP's flexibility for three simple, standardized levels specifically to fix PSP's usability/complexity problems, demonstrates awareness of a real, relatively recent, and commonly-tested Kubernetes ecosystem shift — not just familiarity with security concepts in the abstract.

Related Resources

Kubernetes: Pod Security Admission

Open as page

Where admission control fits in the request pipeline

Recall the API server's request pipeline (see the API server question): authentication (who are you) → authorization (are you allowed to do this action) → admission control (should this specific request actually be allowed/modified, given business/policy rules) → persist to etcd. Even a fully authenticated and authorized request can still be rejected or altered at the admission stage — this is where cluster-specific policy enforcement lives, distinct from the more general "is this identity allowed to do this kind of thing at all" question RBAC answers.

Built-in admission controllers

Kubernetes ships with several built-in admission controllers compiled into the API server (enabled via a startup flag) — examples include NamespaceLifecycle (prevents creating objects in a namespace that's being deleted), LimitRanger (enforces LimitRange defaults/constraints), ResourceQuota (enforces namespace resource quotas), and, notably, PodSecurity (implementing Pod Security Admission — see that question).

Custom admission via webhooks

Beyond the built-in set, the API server can call out to external webhook services for custom admission logic — this is how tools like OPA Gatekeeper, Kyverno, Istio's sidecar injector, and cert-manager all plug into the cluster's request pipeline without needing to be built into Kubernetes itself.

MutatingAdmissionWebhook — can modify the object

Incoming request: create a Pod
   → Mutating webhook (e.g., Istio's injector) intercepts it
   → Modifies the Pod spec to add an Envoy sidecar container
   → The MODIFIED Pod spec continues through the pipeline

A mutating webhook receives the incoming object and can return a modified version of it (typically expressed as a JSON patch) — this is exactly the mechanism a service mesh uses to automatically inject a sidecar proxy into every Pod without the Pod's original author needing to include it themselves, and how tools might automatically inject default resource requests/limits, labels, or annotations onto objects that don't specify them.

ValidatingAdmissionWebhook — can only accept or reject

Incoming request: create a Pod (now possibly already mutated above)
   → Validating webhook (e.g., a policy engine) checks it against custom rules
   → Either allows it through unchanged, or rejects it with an error

A validating webhook cannot modify the object at all — it can only inspect the (already fully mutated) object and return an allow/deny decision, optionally with an explanatory message shown back to whoever submitted the request. This is how organization-specific policies get enforced — e.g., "every Deployment must have resource limits set," "container images must come from our approved internal registry," "no Pod may run as root" (for organizations wanting rules beyond what the built-in Pod Security Standards cover).

Why mutating webhooks run before validating ones

This ordering is deliberate and important: mutation happens first, so that by the time validation runs, it's evaluating the final state of the object — including anything automatically added by mutating webhooks — rather than validating an intermediate, incomplete version that's about to change. If this order were reversed, a validating webhook might approve an object that a subsequent mutation then changes into something that would have failed validation, silently undermining the policy enforcement's whole purpose.

Why this matters operationally

Admission webhooks are themselves a critical-path dependency for the entire cluster's ability to create/modify objects — if a webhook service is down, misconfigured, or slow, it can (depending on its configured failurePolicy) either fail open (allow requests through, defeating its purpose) or fail closed (block all matching requests cluster-wide, including entirely legitimate ones, until the webhook service recovers). Deploying webhook services with high availability and sensible timeouts is a genuine, non-trivial operational responsibility for any cluster relying on them for critical policy enforcement.

Related Resources

Kubernetes: Admission Controllers

Open as page

This builds directly on the earlier Secret question's core point: base64 encoding provides no confidentiality, so real security depends entirely on these additional layers.

1. Enable encryption at rest

By default, Secret data sits in etcd effectively as plaintext (base64 is trivially reversible) — anyone with direct etcd access, or an etcd backup file, can read every Secret in the cluster. Encryption at rest (a control-plane API server configuration, specifying an EncryptionConfiguration with a chosen provider — e.g., using a KMS-backed envelope encryption provider for production) ensures Secret data is actually encrypted before being written to etcd, closing this specific gap.

# Simplified EncryptionConfiguration concept (actual setup requires
# API server flags pointing at this config file)
apiVersion: apiserver.config.k8s.io/v1
kind: EncryptionConfiguration
resources:
  - resources: ["secrets"]
    providers:
      - kms:
          name: my-kms-provider
          endpoint: unix:///var/run/kmsplugin/socket.sock
      - identity: {}   # fallback, unencrypted -- should generally come last, not first

Managed Kubernetes services often provide this as a simple toggle (e.g., integrating with the cloud provider's own KMS) rather than requiring you to hand-configure it.

2. Apply least privilege via RBAC

Restrict get/list/watch access on secrets as tightly as possible — most application code needs to consume a Secret (by having it injected as env vars/volumes at Pod creation, which the kubelet handles) without ever needing direct RBAC permission to read arbitrary Secret objects via the API itself. Broad get secrets access granted casually (e.g., to an overly permissive default ServiceAccount, or a CI/CD pipeline's ServiceAccount) is one of the most common real ways Secret data actually leaks in practice.

3. Never commit raw Secret manifests to version control

A plain Secret YAML file, even with base64-encoded values, is not safe to commit to git — anyone with repository access (and, for public repos, the entire internet, plus automated credential-scanning bots) can decode it instantly. Tools like Sealed Secrets (Bitnami) or SOPS encrypt Secret values with a key only the cluster (or a designated key holder) can decrypt, producing a genuinely safe-to-commit encrypted representation that's only decrypted back into a real Secret object inside the cluster itself.

# A "SealedSecret" -- safe to commit, since only the cluster's private key can decrypt it
apiVersion: bitnami.com/v1alpha1
kind: SealedSecret
metadata:
  name: db-credentials
spec:
  encryptedData:
    password: AgBy8hCi8vY2Kj...    # ciphertext, useless without the cluster's decryption key

4. Consider an external secrets manager for the most sensitive data

For genuinely sensitive values (production database credentials, payment processor API keys), many teams keep the actual secret value in a dedicated external secrets manager (HashiCorp Vault, AWS Secrets Manager) — which provides stronger audit trails, automated rotation, and centralized access control than native Kubernetes Secrets alone — and use a tool like the External Secrets Operator to sync those values into native Kubernetes Secrets automatically, or have applications fetch them directly at runtime via a sidecar/init pattern, never requiring the raw secret value to be manually authored as a Kubernetes manifest at all.

Practical guidance, layered

Not every cluster needs every layer here — a reasonable baseline for most production clusters is encryption at rest plus tight RBAC plus never committing raw Secrets to git; reaching for a full external secrets manager integration is justified once the sensitivity of the data, compliance requirements, or the scale/complexity of secret rotation needs genuinely warrant the additional operational complexity.

Related Resources

Kubernetes: Encrypting Confidential Data at Rest

Open as page

Since every single interaction with a cluster goes through the API server (see the fundamentals topic), it's the single highest-value target for an attacker — compromising it means compromising the entire cluster's ability to read and modify anything.

Restrict network exposure

Most managed Kubernetes offerings let you configure the API server's endpoint as private (reachable only from within a VPC/private network) or restrict access via an IP allowlist, rather than leaving it open to the public internet by default. For self-managed clusters, placing the API server behind a firewall, requiring VPN access, or routing access through a bastion host are standard practices — an unrestricted, publicly-reachable API server is a direct, high-value attack surface that should essentially never be left wide open in a production environment.

Require strong authentication

Kubernetes supports several authentication methods, and choosing well matters:

Client certificates — strong, but require careful certificate lifecycle/rotation management.
OIDC integration (tying cluster authentication to an existing identity provider — Okta, Google Workspace, Azure AD) — lets human user authentication piggyback on an organization's existing SSO, MFA, and offboarding processes, rather than managing a separate credential set just for cluster access.
Cloud provider IAM integration (e.g., AWS IAM authenticator for EKS) — similarly ties cluster access to infrastructure the organization already manages centrally.
Static tokens — simple, but lack rotation, revocation granularity, and audit integration comparable to the above; generally discouraged for anything beyond narrow, well-controlled use cases.

Enforce least-privilege RBAC

Authentication only establishes who — RBAC (see that question) determines what they're allowed to do once authenticated, and should be scoped as narrowly as the real need allows for every user, group, and ServiceAccount, exactly mirroring the least-privilege principle covered for database access in other stacks (the same underlying security principle, applied to Kubernetes's own access model).

Enable and monitor audit logging

# Simplified audit policy concept -- what to log, and at what detail level
apiVersion: audit.k8s.io/v1
kind: Policy
rules:
  - level: Metadata
    resources:
      - group: ""
        resources: ["secrets", "configmaps"]
  - level: RequestResponse
    resources:
      - group: "rbac.authorization.k8s.io"

The API server can be configured to emit a detailed audit log of every request — who made it, what they did, what the result was — which is essential both for after-the-fact incident investigation and for detecting suspicious activity (unusual access patterns, unexpected privilege escalation attempts) in something closer to real time when paired with log monitoring/alerting.

Keep the control plane patched and current

Kubernetes releases security patches regularly, and running a significantly outdated version means missing fixes for known vulnerabilities — managed Kubernetes services typically handle control-plane patching largely automatically (though worker node upgrades usually still require explicit action), while self-managed clusters require a deliberate, disciplined upgrade cadence.

Limit what runs with cluster-admin-equivalent power

cluster-admin (a built-in ClusterRole with unrestricted access to everything) should be granted extremely sparingly — to a small number of genuinely trusted human operators or automation processes, never as a default convenience for application teams or CI pipelines that only need narrow access to specific namespaces/resources.

A strong answer treats API server security as a defense-in-depth problem spanning network exposure, authentication strength, authorization scope, and observability — not a single setting to flip, reflecting that the API server's centrality to the whole cluster's security model means weaknesses in any one of these layers can undermine the others.

Related Resources

Kubernetes: Controlling Access to the Kubernetes API

Open as page

The anti-pattern: broad, convenient permissions

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: way-too-broad
rules:
  - apiGroups: ["*"]
    resources: ["*"]
    verbs: ["*"]     # everything, on everything, cluster-wide

Granting this to a ServiceAccount used by, say, a CI/CD pipeline that only actually needs to deploy Deployments and Services into one specific namespace means that if the CI pipeline's credentials are ever compromised (a leaked token, a supply-chain attack on a pipeline dependency), the attacker gets complete control of the entire cluster — every namespace, every Secret, every Node — vastly beyond what the pipeline ever legitimately needed.

The least-privilege alternative

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: staging
  name: ci-deployer
rules:
  - apiGroups: ["apps"]
    resources: ["deployments"]
    verbs: ["get", "list", "create", "update", "patch"]
  - apiGroups: [""]
    resources: ["services"]
    verbs: ["get", "list", "create", "update", "patch"]

Scoped to exactly the resource types, exactly the verbs, and exactly the namespace the CI pipeline actually needs — a compromise of this specific credential is now bounded to "can mess with Deployments and Services in staging," a vastly smaller blast radius than full cluster-admin.

Practical principles for applying this

Namespace-scope by default — prefer Role + RoleBinding over ClusterRole + ClusterRoleBinding whenever the need genuinely doesn't span the whole cluster (which is most of the time).
Enumerate specific resources and verbs, avoid wildcards — resources: ["pods"] and verbs: ["get", "list"] rather than resources: ["*"] and verbs: ["*"], even when it's more tedious to write out.
Distinct ServiceAccounts per distinct responsibility — don't reuse one broadly-permissioned ServiceAccount across multiple unrelated controllers/pipelines; each should have its own narrowly-scoped identity, so a compromise of one doesn't cascade into unrelated systems' access.
Regularly audit granted permissions against what's actually used — kubectl auth can-i --list --as=system:serviceaccount:<namespace>:<name> and similar tooling can reveal permissions granted "just in case" long ago that were never actually needed and were never revoked.
Avoid cluster-admin except for a small number of genuinely trusted humans/processes — this built-in role should be treated as an exceptional, rarely-granted escape hatch, not a convenient default for anyone who occasionally needs broader access.

Why this is worth stating as a deliberate design principle, not just a checklist

The value of articulating least privilege explicitly (rather than just listing RBAC syntax) is recognizing that RBAC's flexibility to grant broad permissions doesn't mean broad permissions are ever the right default — every grant should be justified by an actual, specific need, and the discipline of scoping down to exactly that need (even when a broader grant would technically "just work" with less upfront effort) is what limits real-world damage when — not if — some credential eventually leaks or some component is compromised.

Related Resources

Kubernetes: Using RBAC Authorization