Energy efficiency constitutes a pivotal performance indicator for 5G New Radio (NR) networks and beyond, and achieving optimal efficiency necessitates the meticulous consideration of trade-offs against other performance parameters, including latency, throughput, connection densities, and reliability. Energy efficiency assumes it is of paramount importance for both User Equipment (UE) to achieve battery prologue and base stations to achieve savings in power and operation cost. This paper presents an exhaustive review of power-saving research conducted for 5G and beyond 5G networks in recent years, elucidating the advantages, disadvantages, and key characteristics of each technique. Reinforcement learning, heuristic algorithms, genetic algorithms, Markov Decision Processes, and the hybridization of various standard algorithms inherent to 5G and 5G NR represent a subset of the available solutions that shall undergo scrutiny. In the final chapters, this work identifies key limitations, namely, computational expense, deployment complexity, and scalability constraints, and proposes a future research direction by theoretically exploring online learning, the clustering of the network base station, and hard HO to lower the consumption of networks like 2G or 4G. In lowering carbon emissions and lowering OPEX, these three additional features could help mobile network operators achieve their targets.