Horizontal Pod Autoscaler Operations
Create an Autoscaler
CPU-Based Autoscaler
For the autoscaler to work, the pods subject to autoscaling must have their CPU requests set.
With CLI
kubectl autoscale deployment <deployment-name> --cpu-percent=50 --min=1 --max=10
With Metadata
apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
name: example
minReplicas: 1
maxReplicas: 5
apiVersion: apps/v1
kind: Deployment
name: httpd
- type: Resource
name: cpu
type: Utilization
averageUtilization: 50
Horizontal Pod Autoscaler Controller Configuration
Sets the period of the control loop. By default, the value is 15 seconds.
The value for this option is a duration that specifies how long the autoscaler has to wait before another downscale operation can be performed after the current one has completed. The default value is 5 minutes. Also see:
This should be set to true or be unset to configure the autoscaler to read metrics from metrics APIs. Setting this to false switches to Heapster-based autoscaling, which is deprecated. Also see:
Horizontal Pod Autoscaler Controller Troubleshooting
Horizontal Pod Autoscaler operations are logged in the controller-manager logs. Failures such as not being able to pull a specific metric from a certain metrics API are logged there.