Question 1

What are resource requests?

Accepted Answer

The amount of CPU/memory Kubernetes guarantees to your container. The scheduler uses requests to decide which node to place your pod on. Always set requests.

Question 2

What are resource limits?

Accepted Answer

The maximum CPU/memory your container can use. Exceeding memory limit = OOMKilled. Exceeding CPU limit = throttled. Some teams don't set CPU limits (controversial).

Question 3

What does "m" mean in CPU?

Accepted Answer

Millicores. 1000m = 1 full CPU core. 250m = 25% of a core. 100m = 10% of a core. You can use decimals too: 0.25 = 250m.

Question 4

What does "Mi" mean in memory?

Accepted Answer

Mebibytes (1 Mi = 1,048,576 bytes). Also: Ki (kibibytes), Gi (gibibytes). These are base-2 units. Don't confuse with MB (base-10).

Question 5

Should I always set limits?

Accepted Answer

Always set memory limits (prevents OOMKilled of other pods). CPU limits are debated — some teams prefer no CPU limits to avoid throttling. Always set requests.

Question 6

How do I find actual usage?

Accepted Answer

Use kubectl top pods, metrics-server, or Prometheus. Look at P95 actual usage over a week. Set requests to P95 usage + 20% buffer.

Question 7

Why does Java need more memory?

Accepted Answer

The JVM has heap memory, metaspace, thread stacks, GC overhead, and native memory. A "256MB" Java app often needs 512MB-1GB total. Set -Xmx to 75% of memory limit.

Question 8

What's a good request:limit ratio?

Accepted Answer

1:2 for production (safe). 1:4 for development (efficient). 1:1 for critical services (Guaranteed QoS). Never set requests higher than limits.

Question 9

How do I handle memory leaks?

Accepted Answer

Set memory limits to catch leaks early (container gets OOMKilled and restarted). Use readiness probes so traffic shifts before the restart.

Question 10

Should I use Vertical Pod Autoscaler?

Accepted Answer

VPA automatically adjusts requests based on actual usage. Good for right-sizing but can cause pod restarts. Use in "recommend" mode first.

Question 11

Why is my pod OOMKilled?

Accepted Answer

Container exceeded its memory limit. Increase memory limit, fix memory leaks, or reduce in-process caching. Check: kubectl describe pod for OOMKilled status.

Question 12

Why is my pod stuck in Pending?

Accepted Answer

Insufficient cluster resources to satisfy requests. Either reduce requests, add nodes, or check resource quotas in the namespace.

Question 13

Why is my app slow but CPU usage is low?

Accepted Answer

CPU throttling. Kubernetes limits CPU in 100ms windows. Your app may spike to limit in bursts. Increase CPU limit or remove it.

Question 14

Why does my pod keep restarting?

Accepted Answer

OOMKilled (memory), failed liveness probe (too slow to respond), or application crash. Check kubectl describe pod and kubectl logs.

Question 15

How does Warden help with container issues?

Accepted Answer

Warden monitors your services externally. When pods are OOMKilled or throttled, Warden detects the degraded response times and alerts you before users complain.

Workload	CPU Req	CPU Limit	Mem Req	Mem Limit
Web Server	100m	500m	128Mi	256Mi
API Service	250m	1000m	256Mi	512Mi
Background Worker	500m	2000m	512Mi	1Gi
Database	500m	2000m	1Gi	2Gi
Cache (Redis)	100m	500m	256Mi	512Mi
Message Queue	250m	1000m	512Mi	1Gi

Container Size Calculator

Recommended Resources

Kubernetes YAML

Common Resource Profiles by Workload

How to Use This Calculator

The Essentials

Frequently Asked Questions

Sources & References

Related Tools

K8s Cost Estimator

Error Budget Calculator

Uptime SLA Calculator

Running containers in production?