Autoscaling in GCP

Introduction to Autoscaling?

The autoscaling features of managed instance groups (MIGs) enable you to add or remove virtual machine (VM) instances from an MIG in response to changes in load. This blog on Autoscaling in GCP helps you implement apps to function more smoothly and cheaper when fewer resources are needed. Based on the parameters you set and the measured load, the autoscale automatically scales according to the autoscaling policy you designate.

Freedom Month Sale — Upgrade Your Skills, Save Big!

Up to 80% OFF AWS Courses
Up to 30% OFF Microsoft Certs

Act Fast!

Specification

Only managed instance groups (MIGs) can be used with Autoscaling. Support is not provided for unmanaged instance groups.
When Autoscaling is enabled, instances with specific names cannot be created. However, after virtual machines (VMs) with certain names are built, you can activate Autoscaling.
Except for Cloud Monitoring metrics, where an autoscaling can handle up to five signals based on Monitoring metrics, an autoscaling can only manage one signal per metric type when making scaling decisions. After determining the optimal number of virtual machines for each signal, the autoscaling scales according to the signal that produces the greatest number of virtual machines overall.
Autohealing and Autoscaling operate separately. The managed instance group tries to recreate the instance if you have auto healing enabled for your group and an instance fails the health check. There may be a smaller number of instances operating in the group than the minimum required for the group when an instance is being recreated by the managed instance group.

Fundamentals

Managed instance groups
One characteristic of managed instance groups (MIGs) is Autoscaling. A group of virtual machine (VM) instances created from a common instance template is known as a managed instance group. Based on the autoscaling policy of the group, an autoscaler adds or removes instances from a managed instance group. While Compute Engine offers both managed and unmanaged instance groups, autoscalers are compatible with only managed instance groups.

Autoscaling in GCP policy
You designate one or more signals that the Autoscaling will use to scale the group when you define an autoscaling policy for your group. Autoscaling determines the optimal number of virtual machines (VMs) for every signal you specify in a policy and sets the recommended size of your group to the highest number when you set multiple signals.

Target utilization metrics
One or more of the following metrics, which indicate the instance group’s load, can be used to determine the Autoscaling in GCP:

Average CPU utilization
HTTP load balancing serving capacity
Cloud Monitoring metrics

Based on the chosen utilization metric, the autoscaler continually collects usage data. It then compares actual utilization to your target utilization and uses the results to decide whether the group should add or remove instances (scale out) or both.

Pricing
The setup fee for an autoscaling policy is non-existent. Because Autoscaler dynamically adds and removes virtual machine instances, you only pay for the resources that your MIG actually uses. By setting the minimum and maximum number of instances in the autoscaling policy, you can manage resource costs.

Automatic repair and auto-healing with Autoscaling in GCP
By proactively keeping your instances available, managed instance groups ensure high application availability. A MIG recreates failed instances to fix them automatically.

It could be a good idea to fix applications that crash, freeze, or run out of memory. By using a health-checking signal to identify application-specific problems like freezing, crashing, or overloading, application-based autohealing increases the availability of applications. The group instantly recreates the virtual machine instance if a health check finds that an application has failed on that machine.

Unmanaged instance groups with Autoscaling in GCP
You have the freedom to add and remove diverse instances from unmanaged instance groups at will. Unmanaged instance groups are not suitable for deploying highly available and scalable workloads because they lack the ability to use instance templates, Autoscaling, auto-healing, rolling update support, and multi-zone support. If you need to manage the instances yourself or apply load balancing to groups of heterogeneous instances, use unmanaged instance groups.

Steps to perform Autoscaling

Task 1: Create Instance Template

Login to the console and first create an instance template, which we will require while creating the instance group and load balancer.
Go to the menu and search for Compute engine. Under the compute engine, select instance template. Then click on Create Instance Template.

Give the template name, then select as below

Location – Global
Machine Configuration – N1
Firewall – Allow HTTP traffic and Allow HTTPS traffic

Keep the rest as it is.

And then click on create

Task 2: Create an Instance Group

Go to the menu and search for Compute engine. Under the compute engine, select instance group.

Click on Create instance group.

Please fill in the below details

Name: Instancegroup
Instance template – Select the instance template that we have created.
Location – Single Zone and select the region
Autoscaling – On: add and remove instances to the group
Maximum number of instances – 1
Minimum number of instances – 3

We can keep the rest as it is.

Now go to self-healing and create the health check as below –

Task 3: Creating a Health Check

Once you click on Create a health check, just give the name of the health check and keep the rest as it is. Click on save. This will create a health check. Select the same from the health check dropdown.

Once you select the health check, click on the create button for instance group.

You will get the below page. We must wait for some for the instance group to get created.

Once this instance group is created, one virtual machine will be created automatically under the compute engine. To check that, go to the menu, click on Compute Engine, and select Virtual Machine.

Now, to check whether the Autoscaling works or not, we have to SSH the instance and increase the CPU utilization on it.
SSH into the machine and Run the command – cat /dev/zero >> /dev/null && then press enter

Once we run the above command, the CPU utilization will increase. To check that, just type the command as top and press enter.

After some time, if you check, the number of instances will be increased to 3 automatically as Autoscaling will increase the instances for distributing the traffic. Please find the screenshot below. There was only one instance running, and now there are three instances running.

Now, let’s release the traffic from the instance, and then Autoscaling will automatically decrease the instances. To release the traffic, just press k and press enter two times. This will decrease the traffic from the instance, and now Autoscaling will decrease the instances, and we will have only one instance running.

Freedom Month Sale — Discounts That Set You Free!

Up to 80% OFF AWS Courses
Up to 30% OFF Microsoft Certs

Act Fast!

About CloudThat

CloudThat is an award-winning company and the first in India to offer cloud training and consulting services worldwide. As a Microsoft Solutions Partner, AWS Advanced Tier Training Partner, and Google Cloud Platform Partner, CloudThat has empowered over 850,000 professionals through 600+ cloud certifications winning global recognition for its training excellence including 20 MCT Trainers in Microsoft’s Global Top 100 and an impressive 12 awards in the last 8 years. CloudThat specializes in Cloud Migration, Data Platforms, DevOps, IoT, and cutting-edge technologies like Gen AI & AI/ML. It has delivered over 500 consulting projects for 250+ organizations in 30+ countries as it continues to empower professionals and enterprises to thrive in the digital-first world.