Kubernetes Horizontal Pod Autoscaler: Difference between revisions
Jump to navigation
Jump to search
Line 10: | Line 10: | ||
=Overview= | =Overview= | ||
Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a scalable higher-level controller. The scaling is performed by a horizontal controller and it is controlled by a horizontal pod autoscaler Kubernetes API resource. | Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a scalable higher-level controller. The scaling is performed by a horizontal controller and it is controlled by a [[Kubernetes API Resources Concepts#HorizontalPodAutoscaler|horizontal pod autoscaler Kubernetes API resource]]. | ||
A pre-requisite, if we intend the autoscaler to monitor resource metrics, a source of [[Metrics_in_Kubernetes#Resource_Metrics|resource metrics]] must be deployed. The simplest source of resource metrics is the [[Kubernetes Metrics Server|metrics server]]. | A pre-requisite, if we intend the autoscaler to monitor resource metrics, a source of [[Metrics_in_Kubernetes#Resource_Metrics|resource metrics]] must be deployed. The simplest source of resource metrics is the [[Kubernetes Metrics Server|metrics server]]. |
Revision as of 02:50, 7 October 2020
External
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale-walkthrough/
- https://towardsdatascience.com/kubernetes-hpa-with-custom-metrics-from-prometheus-9ffc201991e
- https://github.com/kubernetes/community/blob/master/contributors/design-proposals/instrumentation/custom-metrics-api.md
Internal
Overview
Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a scalable higher-level controller. The scaling is performed by a horizontal controller and it is controlled by a horizontal pod autoscaler Kubernetes API resource.
A pre-requisite, if we intend the autoscaler to monitor resource metrics, a source of resource metrics must be deployed. The simplest source of resource metrics is the metrics server.
The horizontal pod autoscalers works with a previously deployed higher level controller that supports scaling (deployments, replica sets, stateful sets).