Cost-optimal Operation of Latency Constrained Serverless Applications: From Theory to Practice.

NOMS(2023)

引用 0|浏览0
暂无评分
摘要
Serverless computing and the function as a service model are new paradigms enabling the fine granular, bottomup construction of cloud-native applications. It can significantly reduce operating costs while shifting the management tasks from developers and application providers towards the cloud operators. But these benefits are provided at the cost of less control over the underlying infrastructure and the application performance, including the end-to-end latency. However, grouping of functions into deployable serverless software artifacts remains still under our control, which has a considerable impact on performance and operation costs. In this paper, we propose fast and efficient algorithms that can partition an application’s functions into separate deployment artifacts in a cost-optimal way while meeting user-defined average end-to-end latency bounds. Moreover, our approach supports the dynamic redesign and reconfiguration of the current deployment setup in response to changes in monitored metrics. Our main contribution is threefold. First, we establish the relevant theoretical models capturing the behavior of the serverless ecosystem and we define the main problem. In addition, the concept of the integrated application management is introduced. Second, we propose novel algorithms providing optimal solutions for different variants of the core problem and the complexity of the methods are analyzed. Third, we demonstrate the applicability and the benefits of our solution by evaluating different deployment scenarios of a realistic use case in Amazon’s public cloud environment.
更多
查看译文
关键词
latency constrained serverless applications,cost-optimal
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要