Kubernetes Horizontal Pod Autoscaler: Difference between revisions
Jump to navigation
Jump to search
Line 10: | Line 10: | ||
=Overview= | =Overview= | ||
Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a higher level controller that supports scaling ([[Kubernetes_Deployments|deployments]], replica sets, stateful sets).. The scaling is performed by a horizontal controller and it is controlled by a [[Kubernetes API Resources Concepts#HorizontalPodAutoscaler|horizontal pod autoscaler Kubernetes API resource]]. For an horizontal pod autoscaler to work correctly, a source of metrics, in particular [[Metrics_in_Kubernetes#Resource_Metrics|resource metrics]], must be deployed. The simplest source of resource metrics is the [[Kubernetes Metrics Server|metrics server]]. | Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a higher level controller that supports scaling ([[Kubernetes_Deployments#Scaling|deployments]], replica sets, stateful sets).. The scaling is performed by a horizontal controller and it is controlled by a [[Kubernetes API Resources Concepts#HorizontalPodAutoscaler|horizontal pod autoscaler Kubernetes API resource]]. For an horizontal pod autoscaler to work correctly, a source of metrics, in particular [[Metrics_in_Kubernetes#Resource_Metrics|resource metrics]], must be deployed. The simplest source of resource metrics is the [[Kubernetes Metrics Server|metrics server]]. | ||
=HPA Manifest= | =HPA Manifest= |
Revision as of 02:54, 7 October 2020
External
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/
- https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale-walkthrough/
- https://towardsdatascience.com/kubernetes-hpa-with-custom-metrics-from-prometheus-9ffc201991e
- https://github.com/kubernetes/community/blob/master/contributors/design-proposals/instrumentation/custom-metrics-api.md
Internal
Overview
Horizontal pod autoscaling is the automatic increase or decrease the number of pod replicas managed by a higher level controller that supports scaling (deployments, replica sets, stateful sets).. The scaling is performed by a horizontal controller and it is controlled by a horizontal pod autoscaler Kubernetes API resource. For an horizontal pod autoscaler to work correctly, a source of metrics, in particular resource metrics, must be deployed. The simplest source of resource metrics is the metrics server.