Kubernetes Resource Requests and Limits

Container-level CPU and memory controls that let the scheduler fit Pods onto nodes and protect nodes from runaway workloads. Synthesized from CKA Day 16 - Kubernetes Requests and Limits.

What Are Requests and Limits?

Kubernetes schedules Pods by evaluating many filters: node health, taints, tolerations, node affinity, selectors, and available resources. Resource requests and limits are the CPU/memory side of that decision.

Field	Meaning	When It Matters
`resources.requests.cpu`	CPU capacity reserved for scheduling	Before the Pod is placed
`resources.requests.memory`	Memory capacity reserved for scheduling	Before the Pod is placed
`resources.limits.cpu`	Maximum CPU the container may consume	While the container runs
`resources.limits.memory`	Maximum memory the container may consume	While the container runs

Request = scheduler promise. The kube-scheduler only places a Pod on a node if the node has enough remaining allocatable capacity for the Pod’s requests. Limit = runtime guardrail. If a container exceeds its memory limit, Kubernetes kills the container with OOMKilled rather than allowing it to exhaust the node. Source: CKA Day 16

YAML Anatomy

Resource settings are container fields because CPU and memory are consumed by containers, not by the Pod object itself:

apiVersion: v1
kind: Pod
metadata:
  name: memory-demo
  namespace: mem-example
spec:
  containers:
  - name: memory-demo-ctr
    image: polinux/stress
    resources:
      requests:
        memory: "100Mi"
        cpu: "250m"
      limits:
        memory: "200Mi"
        cpu: "500m"
    command: ["stress"]
    args: ["--vm", "1", "--vm-bytes", "150M", "--vm-hang", "1"]

CKA syntax memory: spec -> containers[] -> resources -> requests/limits -> cpu/memory.

Scheduler Behaviour

Requests participate in scheduling alongside the placement primitives from Manual Scheduling:

Scheduler sees an unscheduled Pod.
It filters nodes that cannot fit the Pod’s requested CPU/memory.
It also filters by taints/tolerations, node selectors, node affinity, and node conditions.
If at least one node fits, the scheduler binds the Pod.
If no node fits, the Pod remains Pending and kubectl describe pod shows events such as Insufficient memory or Insufficient cpu.

This is why a Pod requesting 1000Gi memory stays Pending even if its command would only try to use 150M: scheduling uses declared requests, not future actual usage. Source: CKA Day 16

Runtime Behaviour

Limits govern what happens after the Pod starts:

Runtime Condition	Result
Usage stays between request and limit	Pod continues running
Memory usage exceeds limit	Container is killed with `OOMKilled`
Request exceeds node allocatable capacity	Pod does not schedule; remains `Pending`
CPU demand exceeds CPU limit	CPU is throttled rather than immediately killed

The lesson’s memory stress demo uses polinux/stress to show the difference between running within the limit, exceeding the limit, and requesting impossible capacity. The key operational idea is blast-radius control: prefer killing one over-consuming Pod to letting it exhaust the whole node. Source: CKA Day 16

Metrics Server and `kubectl top`

Metrics Server exposes CPU and memory usage for nodes and Pods. The lesson installs a Metrics Server manifest, verifies the Pod in the kube-system Namespace, and then uses:

kubectl top node
kubectl top pod memory-demo -n mem-example

Metrics Server is also the data source for autoscaling flows such as HPA and VPA, which the course treats as later topics. For Day 16, the immediate value is visibility: you can verify whether a stress-test Pod is consuming the memory you expected. Source: CKA Day 16

Namespace Governance Connection

Requests and limits become more powerful when combined with Namespace-level policy:

ResourceQuota caps aggregate requested and limited CPU/memory for a Namespace.
LimitRange can define default requests/limits so users cannot create unconstrained Pods by accident.
A demo Namespace like mem-example isolates stress tests from other workloads.

In production, this is how platform teams prevent one team, app, or environment from consuming the whole shared cluster.

Troubleshooting Matrix

Symptom	Likely Cause	Command	Fix
Pod stuck `Pending`	Request cannot fit on any node	`kubectl describe pod <pod>`	Lower requests or add capacity
Event says `Insufficient memory`	`requests.memory` exceeds available allocatable memory	`kubectl describe pod <pod>`	Reduce request or schedule to larger node
Container repeatedly restarts with `OOMKilled`	Actual memory usage exceeds `limits.memory`	`kubectl describe pod <pod>`	Fix leak, reduce load, or raise memory limit
`kubectl top` has no data	Metrics Server not installed or not ready	`kubectl get pods -n kube-system`	Install/fix Metrics Server
Node pressure after workload deploy	Missing/too-high limits allow runaway consumption	`kubectl top node`	Add limits and validate workload profile

CKA Exam Speed Patterns

# Create namespace for resource demos
kubectl create ns mem-example
 
# Apply a Pod with resource settings
kubectl apply -f mem-request.yaml
 
# Inspect scheduling failures and OOMKilled states
kubectl describe pod <pod> -n mem-example
 
# Observe live resource usage
kubectl top node
kubectl top pod <pod> -n mem-example
 
# Generate a Pod manifest quickly, then add resources manually
kubectl run stress --image=polinux/stress --restart=Never \
  --dry-run=client -o yaml > pod.yaml

Pod Fundamentals - the object that carries container resource specs
Kubernetes Architecture - kube-scheduler, kubelet, and Metrics Server context
Kubernetes Manual Scheduling - resource fitting alongside node selectors, taints, and affinity
Kubernetes Node Affinity - placement constraints that combine with resource requests
Kubernetes Taints and Tolerations - node filters evaluated with resource availability
Kubernetes Namespaces - ResourceQuota and LimitRange governance boundary
Deployment, ReplicaSet & Replication Controller - controllers recreate Pods killed by limits
Kubernetes DaemonSet - node agents and Metrics Server-style add-ons need budgets
Multi-Container Pods - each container can have distinct requests and limits
CKA Certification - exam domains and troubleshooting relevance
CKA Study Roadmap - Day 16 in the 40-day plan
Tech Tutorials with Piyush - course source
Kubernetes Autoscaling - requests/limits are the prerequisite for HPA and VPA
Horizontal Pod Autoscaler (HPA) - utilization calculations depend on declared requests

Tags: kubernetes resource-requests resource-limits metrics-server scheduling cka devops troubleshooting

Rakesh's Brain

Explorer

Kubernetes Resource Requests and Limits

Kubernetes Resource Requests and Limits

What Are Requests and Limits?

YAML Anatomy

Scheduler Behaviour

Runtime Behaviour

Metrics Server and `kubectl top`

Namespace Governance Connection

Troubleshooting Matrix

CKA Exam Speed Patterns

Table of Contents

Graph View

Latest Blog Posts

Backlinks

Rakesh's Brain

Explorer

Kubernetes Resource Requests and Limits

Kubernetes Resource Requests and Limits

What Are Requests and Limits?

YAML Anatomy

Scheduler Behaviour

Runtime Behaviour

Metrics Server and kubectl top

Namespace Governance Connection

Troubleshooting Matrix

CKA Exam Speed Patterns

Related Pages

Table of Contents

Graph View

Latest Blog Posts

Backlinks

Metrics Server and `kubectl top`