Adaptive Compression Offloading and Resource Allocation for Edge Vision Computing

IEEE Transactions on Cognitive Communications and Networking(2024)

引用 0|浏览11
暂无评分
摘要
The rapid progress in edge computing (EC) and 5G wireless communication technology has opened up novel opportunities for intelligent applications driven by Deep Neural Networks (DNNs). In particular, machine vision tasks are widely used in mobile/edge computing scenarios. However, the realtime and dense data transmission involved in vision inference services impose significant communication burdens on wireless networks. Thus, this paper investigates the general vision services strategy with cognitive computing network and proposes a communication-efficient edge inference deployment architecture for vision analytic tasks. In this framework, users dynamically perceive the inference data in local, and then compress and offload them to the edge server to perform inference. Specifically, we present a collaborative optimization model of compression ratio and network bandwidth to generate the reliable compression offloading and resource allocation scheme. For this model, the offloading scheme carefully considers the constraints imposed by delay and resources and maximizes the success probability of the vision inference tasks. To improve the vision inference performance in the edge network, we further propose a flexible data compression algorithm for images or video frames, which can preserve the more important visual information under a fixed compression rate to reduce the inference accuracy loss from compression. This algorithm first perceives the importance of visual information at different pixel positions, and then compresses different visual regions to varying degrees according to their importance, enabling content-aware adaptive vision data coding. Experimental results show that our proposed offloading model and compression strategy outperform other algorithms, achieving significant communication improvements and performance gains.
更多
查看译文
关键词
Edge computing,computer vision service,compression offloading,resource allocation,data compression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要