An Automatic Scaling System for Online Application with Microservices Architecture

Youmei Song, Chaoran Li, Kuoran Zhuang, Tianjiao Ma,Tianyu Wo

2022 IEEE International Conference on Joint Cloud Computing (JCC)(2022)

Cited 1|Views14
No score
Abstract
Auto-scaling is an efficient technique to handle fluctuations of application workloads by acquiring or releasing resources. However, performing auto-scaling in a microservice system for online applications faces critical challenges, including unpredictably massive microservice requests, without fine-granularity performance metrics, and complex dependencies among services. In this paper, we design a cost-efficient autoscaling system, which pinpoints the scaling-needed services as quickly as possible and makes decisions on the right resource amount allocation toward them. Specifically, we first propose a multi-level microservice monitoring mechanism to capture historical and latest service-level performance metrics, and detect the over-provisioning services and under-provisioning services via jointly considering the changes of latency and throughput. For the overload anomalies, a random walk method is further adopted for detecting the root causes based on the dependency topology of microservices. When anomalies are detected, we design a threshold-based method by incorporating the ARIMI method for predicting resource usage status to allocate or recycle the right number of computation resources for them. Extensive and systematic evaluations of different algorithm modules with real-world and simulated workload data confirm the superiority of our mechanism over multiple algorithms.
More
Translated text
Key words
automatic scaling system,online application,microservices architecture,auto-scaling,application workloads,microservice system,unpredictably massive microservice requests,fine-granularity performance metrics,complex dependencies,cost-efficient autoscaling system,resource amount allocation,multilevel microservice monitoring mechanism,historical service-level performance metrics,latest service-level performance metrics,over-provisioning services,under-provisioning services,random walk method,threshold-based method,resource usage status,computation resources,simulated workload data
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined