RobustDiCE: Robust and Distributed CNN Inference at the Edge

2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)(2024)

引用 0|浏览1
暂无评分
摘要
Prevalent large CNN models pose a significant challenge in terms of computing resources for resource-constrained devices at the Edge. Distributing the computations and coefficients over multiple edge devices collaboratively has been well studied but these works generally do not consider the presence of device failures (e.g., due to temporary connectivity issues, overload, discharged battery, etc. of edge devices). Such unpredictable failures can compromise the reliability of edge devices, inhibiting the proper execution of distributed CNN inference. In this paper, we present a novel partitioning method, called RobustDiCE, for robust distribution and inference of CNN models over multiple edge devices. Our method can tolerate intermittent and permanent device failures in a distributed system at the Edge, offering a tunable trade-off between robustness (i.e., retaining model accuracy after failures) and resource utilization. We evaluate RobustDiCE using the ImageNet-1K dataset on several representative CNN models under various device failure scenarios and compare it with several state-of-the-art partitioning methods as well as an optimal robustness approach (i.e., full neuron replication). In addition, we demonstrate RobustDiCE’s advantages in terms of memory usage and energy consumption per device, and system throughput for various system set-ups with different device counts.
更多
查看译文
关键词
Convolutional Neural Network,Convolutional Neural Networks Inference,Energy Consumption,Resource Utilization,Computational Resources,Distribution System,Convolutional Neural Network Model,Memory Usage,Multiple Devices,Multiple Edges,Partitioning Method,System Throughput,Robust Inference,Edge Devices,Device Failure,Group Setting,Neurons In Layer,Failure Events,System Configuration,Blue Bars,Convolutional Neural Network Layers,Top-1 Accuracy,Node Failure,Optimal Scenario,Importance Scores,Inference Accuracy,Groups Of Neurons,Neuronal Clusters,Design Space Exploration,Partition Model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要