Cloud computing is a modern exemplar to provide services through the internet. The development of cloud computing has eliminated the need of manpower, which is mainly used for the management of resources. During the cloud computing process, the term cloud balancing is a vital one. It deals with distribution of workloads and computing resources. The load balancing allows the company to balance the load according to the demands by the allocation of the resources to multiple servers or networks. The quality of service (QoS) metrics, including cost, response time, performance, throughput, and resource utilization are improved by means of load balancing. In this chapter, the authors study the literature on the load-balancing algorithms in heterogeneous cluster cloud environment with some of its classification. Additionally, they provide a review in each of these categories. Also, they provide discernment into the identification of open issues and guidance for future research work.