Scaling & Capacity Planning

Plan and configure resources based on your concurrent scanning requirements.

Understanding Resource Units

Kubernetes uses specific units for CPU and memory:

For example, 200m CPU means 0.2 CPU cores (20% of one core), and 256Mi means approximately 256 MB of memory.

When a scan is triggered:

Each scan-manager pod can handle 5 concurrent scans by default.

Key formula: scan_manager_replicas × 5 = max concurrent scans

These components run continuously regardless of scan activity:

These pods are created on-demand for each running scan:

Use this table to plan resources based on your concurrent scan requirements:

Concurrent Scans	Scan Manager Replicas	Scan Scheduler Replicas	Chrome Controller	Total CPU	Total Memory
5	1	1	1	~2 vCPU	~8 Gi
10	2	2	1	~4 vCPU	~16 Gi
20	4	2	1	~8 vCPU	~32 Gi
50	10	3	2	~16 vCPU	~64 Gi
100	20	3	2	~32 vCPU	~128 Gi

Total CPU/Memory = aggregate across all nodes. Your cluster’s node autoscaler distributes this across multiple nodes automatically.

Calculation formula:

No hard limit: There is no maximum concurrent scan limit enforced by the software. Scale as high as your cluster resources allow.

Most managed Kubernetes services (EKS, AKS, GKE) support automatic node scaling. When enabled:

Nodes are created on demand - When pods are scheduled, the autoscaler provisions nodes
Right-sized instances - Selects appropriate instance types based on pod resource requests
Horizontal scaling - Creates multiple smaller nodes rather than one large node
Scale to zero - Terminates unused nodes when scans complete

You don’t need to pre-provision large nodes. The autoscaler handles it automatically.

Example: For 20 concurrent scans needing ~8 vCPU / ~32 Gi total, the autoscaler might create:

Configure replica counts and autoscaling in your Terraform module or Helm chart values. See the Terraform module documentation or Helm chart documentation for available variables.