Proceedings of the ACM Web Conference 2022 2022
DOI: 10.1145/3485447.3511979
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Serverless Computing by Harvesting Idle Resources

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…Instead, serverless computing executes tasks with lightweight containers, thus allowing fine-grained resource provisioning with instant function launch/release, which charges users by the amount of resources (e.g., CPU/GPU and memory) only in actual execution (e.g., second). Due to the unique features, serverless computing is particularly appealing for tasks that require elasticity and high concurrency, such as scientific computing (Chard et al 2020;Roy et al 2022) and distributed training (Wang, Niu, and Li 2019; Guo et al 2022;Thorpe et al 2021;Yu et al 2021Yu et al , 2022.…”
Section: Serverless Drl Trainingmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead, serverless computing executes tasks with lightweight containers, thus allowing fine-grained resource provisioning with instant function launch/release, which charges users by the amount of resources (e.g., CPU/GPU and memory) only in actual execution (e.g., second). Due to the unique features, serverless computing is particularly appealing for tasks that require elasticity and high concurrency, such as scientific computing (Chard et al 2020;Roy et al 2022) and distributed training (Wang, Niu, and Li 2019; Guo et al 2022;Thorpe et al 2021;Yu et al 2021Yu et al , 2022.…”
Section: Serverless Drl Trainingmentioning
confidence: 99%
“…Unlike physical clusters and traditional cloud computing that require tedious configuration, serverless computing packages and executes tasks (e.g., DRL actors and learner) as functions with instant toggling (i.e., sub-second level) and auto-scaling. Thus, serverless computing has been widely deployed to serve computation-intensive applications, such as deep learning (Ali et al 2020;Carreira et al 2019;Wang, Niu, and Li 2019;Yu et al 2021Yu et al , 2022 and scientific computing (Chard et al 2020;Roy et al 2022). Fig.…”
Section: Introductionmentioning
confidence: 99%
“…This is seen as an alternative or used in conjunction with horizontal scaling in order to meet intended targets, in the face of changing traffic levels. An actor critic architecture with Proximal Policy Optimization (PPO) is used in [25] to harvest idle resources from functions and direct them to under-provisioned instances. A Q-Learning based solution is given in [26] to identify the level of concurrency, i.e the number of concurrent requests served per instance, to optimize function latency and system throughput.…”
Section: A Serverless Resource Scalingmentioning
confidence: 99%
“…For example, Zafeiropoulos et al [14] proposed an approach in which autoscaling is assisted with a DQL agent that trains in an environment with continuous state space and discrete action space. Meanwhile, the PPO agent in [42] learns to make resource adjustments per invocation based on the realistic serverless environment. Furthermore, Qiu et al [43] customize the implementation of PPO to fit the multi-agent training approach, while the TD A2C agent in [44] predicts the future idle container window by learning the past invocation patterns of serverless functions to mitigate the cold start problem.…”
Section: Ai-based Techniquesmentioning
confidence: 99%