Reviewing the Load-Balancing Process

Cloud-based solutions should scale on demand. Meaning, if an application’s user demand reaches a specific threshold, one or more servers should be added dynamically to support the application. Likewise, when the demand decreases, the application should scale down its resource use. When an application uses multiple servers, one server, as shown in FIGURE 19-1, must perform the task of load balancing.

An illustration shows load balancing server and server 01, server 02, and server 03 placed in a cloud. Request from a user is received by the load balancing server and distributes it to server 01.

FIGURE 19-1 The load-balancing server distributes workload across an application’s server resources.

The load-balancing server receives client requests and distributes each request to one of the available servers. To determine which server gets the request, the load balancer may use a round-robin technique, a random algorithm, or a more complex technique based upon each server’s capacity and current workload. For an application to fully exploit load balancing, the application developers must design the application for scaling.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset