A Light Weight Load Balancer Powered By Docker And Nginx

PS - The title will be a bit too cumbersome should we add all the functionality the build supports Also the pronoun "we" means I and the CTO

It was December last year, the clock was ticking. Another delivery is to be fulfilled. If you are wondering what it was? ...it was a load balancer / API gateway / reverse proxy waiting to be cooked 🔥.

A load balancer, from the name, is a combination of two keywords load and balancer. The load means requests while the balancer means an effective distributor. A load balancer is just any system that distributes the load in this scenario - a request made by the client which may be via a browser, or an application on a phone in an effective way to another system typically the backend server handling the request or fulfilling the other.

Nah, that was too basic, right? Let's crank it up a notch technically.

A load balancer is a device that sits right in between a user (client) and a server group (sets of servers) and acts as an invisible facilitator (according to AWS's definition) ensuring equal usage of resources by the servers in the group. Well in essence, load balancing is the process of distributing a set of tasks or requests over a set of resources (computing units), to make their overall processing more efficient and also to optimise response time by avoiding unevenly overloading some compute nodes while other compute nodes are left idle.

Back to the Christmas build conversation, we needed to integrate a new third-party provider. Initially, we allowed direct integration but in this case, things were different, the third party also used a different platform which requires a direct registration of our service IP address which cannot be changed without some setbacks. This led to a conversation with my CTO to figure out a way to leverage our deployment platform to avoid unnecessary costs from using AWS load balancing instances in summary avoid complexity and KISS (keep it simple ) lol.

To do this, we highlighted a couple of requirements which were:

Running a load balancer at a reduced cost
Opting in for auto-scaling
Allowing request routing to necessary services
An API gateway and proxy for all access to our platform
An extensible application that can use things offered freely by Nginx

If you are wondering why we didn't use an out-of-the-box solution, well we didn't because we need to avoid bleeding out of cash just by running an overly complicated service for our use case.

Christmas morning, I think I had just finished going through the Nginx documentation and also checked out Hussein's course on Nginx but since I don't understand well until I can figure out why the internals of the system works the way it does. I took my time to study the brief internals of Nginx. Let's take a quick look together so we can get a better understanding of the conf files later.

Nginx - a Proxy / API gateway / Load-balancer

Nginx acts as a web server capable of serving web content and can also act as a reverse proxy, load balancer for backend routing, caching layer and API gateway.

An API Gateway is a server that acts as an API front-end, receiving API requests, enforcing throttling and security policies, passing requests to the back-end service, and then passing the response back to the requester. It acts as an entry point for one or multiple micro-services, providing a unified interface to clients. A proxy / API gateway / load-balancer should not be blocking but as efficient as possible so that another backend server can scale independently but with additional cost.

NGINX has a concept of frontend end (client side) and back end (server side). It has layer 4/7 (Transport layer) of the OSI model support. NGINX spins up a worker process per CPU core by default. When NGINX reverse proxy starts it creates one thread per CPU core and these worker threads do the heavy lifting. The number of worker threads is configurable but NGINX recommends one thread per CPU core to avoid context switching and cache thrashing.

Since this article is more about how to use docker and nginx to build a full-fledged proxy and API gateway and load-balancer. You can check out this link for more info on nginx but before that let's have a quick look at what docker is.

Docker - a Containerizer

Docker is a powerful tool that allows you to create, deploy, and run applications in containers. These containers are lightweight, standalone, and executable packages that include everything needed to run a piece of software: the code, runtime, libraries, and dependencies. Essentially, Docker containers act like mini virtual machines but are much more efficient and faster.

Imagine you're setting up a big application that includes a user interface (UI), a database, and several microservices, each of which requires different programs and dependencies (like Java, Node.js, or Python). Without Docker, you'd need to manually install and configure all these components on a new server, which could take days and be prone to errors.

With Docker, you can package each component of your application into a separate container, complete with all its dependencies. You then simply deploy these containers to the server. The setup is automated and takes minutes instead of days, and you can be confident that everything will work correctly. The biggest advantage of dockers is to ability to have the same setup everywhere and also you can very quickly bootstrap the whole infrastructure. And containers can build very quickly.

You can check this link for more info on docker

I figured explaining what NGINX and docker stood for would give you an insight into what we built eventually. Anyway, let's get into it 🚀

XCRUX

xcrux provides Nginx configurations for setting up an API gateway, load balancer, and reverse proxy. It was designed to help us efficiently manage, secure, and optimise API requests in our backend application while keeping all our requirements in check. We opted in to support the following features:

API Gateway: Direct incoming API requests to appropriate backend services.
Load Balancer: Distribute traffic across multiple backend servers for improved performance and reliability.
Reverse Proxy: Handle requests on behalf of backend servers, providing additional features such as SSL termination.

One key thing we strived for was the project file organisation to cater for extensibility. We divided the project into the following:

client configuration
location groups configuration - for each backend server upstream
static file definitions

By default, Nginx comes with its very own configuration spec which is loaded on startup. This configuration is used to set up how the master-worker process of nginx will operate in terms of load balancing, request forwarding etc.

Using simple examples, I will give an explanatory approach to how this works using a couple of Xcrux examples (not exact but just configuration syntax )

One thing about Nginx is the fact that it runs a master-worker process. However, it allows developers to specify how the processes should function. If you look closely at the configuration spec above you will see that one can specify the number of workers to run to handle requests. Less I forget, the Nginx configuration is divided into a bunch of directives, the one we are currently in is called the global directive and in this, an error log configuration is set up to help when debugging requests and response logs.

The next directive is the event directive, which specifies things like the queue logic to use depending on the operating system also you can specify the number of incoming requests the workers can handle. All connection configurations are set here

directives act as “containers” that group together related directives, enclosing them in curly braces ( {} ); these are often referred to as blocks

A Light Weight Load Balancer Powered By Docker And Nginx - A Christmas & NewYear Build

How to implement an effective light weight load balancer with docker and Nginx

Table of contents

Nginx - a Proxy / API gateway / Load-balancer

Docker - a Containerizer

XCRUX