Kubernetes Horizontal Pod Autoscaler
Jump to navigation
Jump to search
External
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale-walkthrough/
- https://towardsdatascience.com/kubernetes-hpa-with-custom-metrics-from-prometheus-9ffc201991e
- https://github.com/kubernetes/community/blob/master/contributors/design-proposals/instrumentation/custom-metrics-api.md
Internal
Overview
Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a scalable higher-level controller. The scaling is performed by a horizontal controller and it is controlled by a horizontal pod autoscaler Kubernetes API resource.
A pre-requisite, if we intend the autoscaler to monitor resource metrics, a source of resource metrics must be deployed. The simplest source of resource metrics is the metrics server.
The horizontal pod autoscalers works with a previously deployed higher level controller that supports scaling (deployments, replica sets, stateful sets).