Introduction to Load Balancing

Codecademy Team
Learn about load balancers and how resource load is distributed using various load balancing algorithms.

What is a Load Balancer?

When dealing with the scalability of our software systems, we may often come across the challenge of dealing with an influx of requests. For example, imagine running a popular web app that sells dog apparel. We just released a new product (a fashionable dog sweater) and have noticed a significant increase of requests hitting our single server. Users really want to purchase our new dog sweater for their pups! Right now, our web app architecture looks something like this:

A representation of a basic web app architecture, consisting of Users, a Webpage, a Server, and a Database. Users are able to interact with the Webpage via the Internet. Based on the User's activity, the Webpage will send requests to the Server for processing. Depending on the request, the Server will either respond directly to the Webpage or query the Database to fetch additional information or complete a transaction. Once the Database has fully processed all queries, the Server will perform any remaining calculations before finally responding to the Webpage. In this graphic, there is a vertical bar next to the Server that is nearly full and colored red, indicating that the Server is struggling to keep up with the volume of requests being submitted.

In this architecture, our single server is becoming a bottleneck and is putting our application at risk of performing suboptimally. In order to alleviate the load on our single server, we decide to scale our web app horizontally and purchase a few additional servers. Each server will host a replica of our app now we will be able to distribute request load more effectively. However, with multiple servers, we have the following issue:

A representation of a basic web app architecture, consisting of Users, a Webpage, multiple Servers, and a Database. This graphic is mostly similar to the previous graphic, except that there are four Servers instead of one. Between the Webpage and the Servers, there is a box labeled "Which Server" accompanied by several question marks. When scaling an app horizontally, it is necessary to tell the app to which server it should send each request. Continue on to learn how this is done.

We need a way to direct the traffic! Our app won’t know where to send the requests unless we provide guidance on which server to send the request to. This is where a load balancer comes into play.

A load balancer is a piece of hardware or software (and sometimes both) that helps distribute requests between different system resources. Load balancers are not just an essential aspect when scaling a system horizontally; they also help prevent specific system resources from getting overloaded and possibly going offline. In addition, load balancers are flexible enough to be placed in various places in a software systems architecture. In our web app example, since we are primarily trying to distribute the load between our servers, here is what our new web app architecture will look like with a load balancer:

A representation of a basic web app architecture, consisting of Users, a Webpage, a Load Balancer, multiple Servers, and a Database. This graphic is mostly similar to the previous graphic, except that the box labeled "Which Server" has been replaced with a Load Balancer, which is capable of communicating with both the Webpage and the Servers. When a new request is produced from the Webpage, it is now routed through the Load Balancer first. The Load Balancer uses an algorithm to determine which Server should receive the request. This prevents any single Server from having to handle all the requests produced by the application.

When we examine the above image, the way requests route to individual servers for our web app may seem a bit like magic. How exactly is the load balancer deciding which server is best fit to handle the incoming request? How does the load balancer make sure one server doesn’t end up taking all the requests by accident? These questions are all decided based on the load balancing algorithm that the load balancer uses. Let’s explore what these algorithms are and how they work!

Before we move on, try this question: