Cloud data centers try to achieve two primary but competing goals for latency-sensitive services: meeting latency servicelevel objective (SLO) and reducing energy utilization. In this paper, we perform a joint optimization of both goals, such that given a tail latency constraint and request rate, the number of active servers and their corresponding core frequencies which cause the least energy overhead can be derived. Both the server sleep mode and setup delay are taken into account. Based on the assumption that requests arrive following a Poisson process, we provide a numerical method to compute the optimal number of active servers and a closed-form expression for their corresponding core frequency. We propose how the optimization technique can be used for real, bursty, non-Poisson data center traffic.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.