The proliferation of deep models characterized by an abundance of parameters has catalyzed research enthusiasm in the domain of AI systems. The emergence of novel computational modalities has brought forth numerous fresh challenges within the realm of cloud computing, encompassing aspects such as cost, performance, elasticity, and the intricate tradeoffs entailed therein.