How do I set up Kubernetes Metrics Server and Horizontal Pod Autoscaler on Amazon EKS?

3 minute read

I want to set up Kubernetes Metrics Server and Horizontal Pod Autoscaler (HPA) on Amazon Elastic Kubernetes Service (Amazon EKS).

Resolution

Set up your environment

Create an Amazon Elastic Compute Cloud (Amazon EC2) instance.
Install kubectl on your EC2 instance.
Install and configure the latest version of the AWS Command Line Interface (AWS CLI) on your EC2 instance.
Configure your kubeconfig file to point to the Amazon EKS cluster.

Create a Kubernetes Metrics Server

1. To install Metrics Server, run the following command:

kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

2. To confirm that Metrics Server is running, run the following command:

kubectl get pods -n kube-system -l k8s-app=metrics-server

The output is similar to the following:

kubectl get pods -n kube-system -l k8s-app=metrics-server
metrics-server-85cc795fbf-79d72   1/1     Running   0          22s

Create a php-apache deployment and a service

1. To create a php-apache deployment, run the following command:

kubectl create deployment php-apache --image=k8s.gcr.io/hpa-example

2. To set the CPU requests, run the following command:

kubectl patch deployment php-apache -p='{"spec":{"template":{"spec":{"containers":[{"name":"hpa-example","resources":{"requests":{"cpu":"200m"}}}]}}}}'

Important: If you don't set the value for cpu correctly, then the CPU utilization metric for the pod isn't defined and the HPA can't scale.

3. To expose the deployment as a service, run the following command:

kubectl create service clusterip php-apache --tcp=80

4. To create an HPA, run the following command:

kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=10

5. To confirm that the HPA was created, run the following command:

kubectl get hpa

6. To create a pod to connect to the deployment that you created earlier, run the following command:

kubectl run -i --tty load-generator --image=busybox /bin/sh

7. To test a load on the pod in the namespace that you used in step 1, run the following script:

while true; do wget -q -O- http://php-apache; done

Note: To exit the while loop and the tty session of the load generator pod, use CTRL + C to cancel the loop. Then, use CTRL + D to exit the session.

8. To see how the HPA scales the pod based on CPU utilization metrics, run the following command (preferably from another terminal window):

kubectl get hpa -w

The Metrics Server is now up and running, and you can use it to get resource-based metrics.

9. To clean up the resources used for testing the HPA, run the following commands:

kubectl delete hpa,service,deployment php-apache
kubectl delete pod load-generator

Related information

Horizontal Pod Autoscaling (on the Kubernetes website)

Topics

Containers

Relevant content

Amazon Elastic Kubernetes Service (EKS) maximum pod calculation with secondary VPC CIDR
Accepted Answer
Shawn_O
asked 4 years ago
Setting up Microsoft Exchange Server on EC2
Accepted Answer
jvg
asked a year ago
Setting up an EKS Node Group on a private subnet
Xun
asked 6 months ago
How do I set up dynamic port mapping for Amazon ECS?
learn2skills
asked 3 years ago
How do i remove Amazon Elastic Compute Cloud NatGateway?
mosalem
asked 3 months ago
How do I set up an Application Load Balancer through the AWS Load Balancer Controller on an Amazon EC2 node group in Amazon EKS?
AWS OFFICIALUpdated 3 months ago
How do I set up a Kubernetes Dashboard on an Amazon EKS cluster?
AWS OFFICIALUpdated 2 months ago
How do I troubleshoot issues when setting up Cluster Autoscaler on an Amazon EKS cluster?
AWS OFFICIALUpdated a year ago
How do I set up ExternalDNS with Amazon EKS?
AWS OFFICIALUpdated a month ago
Amazon EKS and Kubernetes sessions at AWS re:Invent 2023
EXPERT
chrpotr
published 5 months ago