Resource metrics pipeline

    The HorizontalPodAutoscaler (HPA) and (VPA) use data from the metrics API to adjust workload replicas and resources to meet customer demand.

    You can also view the resource metrics using the kubectl top command.

    Note: The Metrics API, and the metrics pipeline that it enables, only offers the minimum CPU and memory metrics to enable automatic scaling using HPA and / or VPA. If you would like to provide a more complete set of metrics, you can complement the simpler Metrics API by deploying a second that uses the Custom Metrics API.

    Figure 1 illustrates the architecture of the resource metrics pipeline.

    flowchart RL subgraph cluster[Cluster] direction RL S[

    ] A[Metrics-
    Server] subgraph B[Nodes] direction TB D[cAdvisor] —> C[kubelet] E[Container
    runtime] —> D E1[Container
    runtime] —> D P[pod data] -.- C end L[API
    server] W[HPA] C ——>|Summary
    API| A —>|metrics
    API| L —> W end L —-> K[kubectl
    top] classDef box fill:#fff,stroke:#000,stroke-width:1px,color:#000; class W,B,P,K,cluster,D,E,E1 box classDef spacewhite fill:#ffffff,stroke:#fff,stroke-width:0px,color:#000 class S spacewhite classDef k8s fill:#326ce5,stroke:#fff,stroke-width:1px,color:#fff; class A,L,C k8s

    JavaScript must be enabled to view this content

    Figure 1. Resource Metrics Pipeline

    The architecture components, from right to left in the figure, consist of the following:

    • : Daemon for collecting, aggregating and exposing container metrics included in Kubelet.

    • kubelet: Node agent for managing container resources. Resource metrics are accessible using the and /stats kubelet API endpoints.

    • : API provided by the kubelet for discovering and retrieving per-node summarized stats available through the /stats endpoint.

    • metrics-server: Cluster addon component that collects and aggregates resource metrics pulled from each kubelet. The API server serves Metrics API for use by HPA, VPA, and by the kubectl top command. Metrics Server is a reference implementation of the Metrics API.

    • : Kubernetes API supporting access to CPU and memory used for workload autoscaling. To make this work in your cluster, you need an API extension server that provides the Metrics API.

    FEATURE STATE: Kubernetes 1.8 [beta]

    The metrics-server implements the Metrics API. This API allows you to access CPU and memory usage for the nodes and pods in your cluster. Its primary role is to feed resource usage metrics to K8s autoscaler components.

    Here is an example of the Metrics API request for a minikube node piped through jq for easier reading:

    Here is the same API call using curl:

    1. curl http://localhost:8080/apis/metrics.k8s.io/v1beta1/nodes/minikube

    Sample response:

    Here is an example of the Metrics API request for a kube-scheduler-minikube pod contained in the kube-system namespace and piped through jq for easier reading:

    1. kubectl get --raw "/apis/metrics.k8s.io/v1beta1/namespaces/kube-system/pods/kube-scheduler-minikube" | jq '.'

    Here is the same API call using curl:

    Sample response:

    1. "kind": "PodMetrics",
    2. "apiVersion": "metrics.k8s.io/v1beta1",
    3. "metadata": {
    4. "name": "kube-scheduler-minikube",
    5. "namespace": "kube-system",
    6. "selfLink": "/apis/metrics.k8s.io/v1beta1/namespaces/kube-system/pods/kube-scheduler-minikube",
    7. "creationTimestamp": "2022-01-27T19:25:00Z"
    8. },
    9. "timestamp": "2022-01-27T19:24:31Z",
    10. "window": "30s",
    11. "containers": [
    12. {
    13. "name": "kube-scheduler",
    14. "cpu": "9559630n",
    15. "memory": "22244Ki"
    16. }
    17. }
    18. ]
    19. }

    The Metrics API is defined in the k8s.io/metrics repository. You must enable the and register an APIService for the metrics.k8s.io API.

    To learn more about the Metrics API, see , the metrics-server repository and the .

    Note: You must deploy the metrics-server or alternative adapter that serves the Metrics API to be able to access it.

    CPU is reported as the average core usage measured in cpu units. One cpu, in Kubernetes, is equivalent to 1 vCPU/Core for cloud providers, and 1 hyper-thread on bare-metal Intel processors.

    This value is derived by taking a rate over a cumulative CPU counter provided by the kernel (in both Linux and Windows kernels). The time window used to calculate CPU is shown under window field in Metrics API.

    To learn more about how Kubernetes allocates and measures CPU resources, see meaning of CPU.

    In an ideal world, the “working set” is the amount of memory in-use that cannot be freed under memory pressure. However, calculation of the working set varies by host OS, and generally makes heavy use of heuristics to produce an estimate.

    The Kubernetes model for a container’s working set expects that the container runtime counts anonymous memory associated with the container in question. The working set metric typically also includes some cached (file-backed) memory, because the host OS cannot always reclaim pages.

    To learn more about how Kubernetes allocates and measures memory resources, see .

    The metrics-server fetches resource metrics from the kubelets and exposes them in the Kubernetes API server through the Metrics API for use by the HPA and VPA. You can also view these metrics using the kubectl top command.

    The metrics-server uses the Kubernetes API to track nodes and pods in your cluster. The metrics-server queries each node over HTTP to fetch metrics. The metrics-server also builds an internal view of pod metadata, and keeps a cache of pod health. That cached pod health information is available via the extension API that the metrics-server makes available.

    For example with an HPA query, the metrics-server needs to identify which pods fulfill the label selectors in the deployment.

    The metrics-server calls the kubelet API to collect metrics from each node. Depending on the metrics-server version it uses:

    • Metrics resource endpoint /metrics/resource in version v0.6.0+ or
    • Summary API endpoint /stats/summary in older versions

    To learn more about the metrics-server, see the .

    You can also check out the following:

    The gathers stats at the node, volume, pod and container level, and emits this information in the Summary API for consumers to read.

    Here is an example of a Summary API request for a minikube node:

    Here is the same API call using curl:

      Note: The summary API endpoint will be replaced by the /metrics/resource endpoint beginning with metrics-server 0.6.x.

      Last modified April 29, 2022 at 6:43 PM PST: