Programmable Data Parallel Accelerator For Mobile Computer Vision

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)(2015)

引用 2|浏览34
暂无评分
摘要
The demand for high performance yet extremely low-power multimedia accelerators for mobile communication is ever growing. To meet this challenge a novel approach with a very low-power programmable TTA processor is proposed in this paper. The processor is benchmarked with two OpenCL computer vision applications; depth estimation and face detection. The former is an excellent example of a highly parallel algorithm that suits our TTA processor extremely well whereas the latter is an example of a more serial algorithm that poses a challenge for GPU-style parallel platforms. Both algorithms are also implemented and optimized for a high throughput AMD Radeon HD 7750 GPU, Qualcomm Adreno 330 mobile GPU and Intel Core i5-480M for a fair comparison of performance and energy efficiency. These platforms are chosen because they all can be programmed with OpenCL with equivalent programming efforts. In this paper we show that our novel approach can achieve real-time requirements and easily outperform both GPUs as well as the CPU in terms of throughput per watt criterion, making it an excellent candidate for power-constrained mobile platforms.
更多
查看译文
关键词
programmable data parallel accelerator,mobile computer vision,extremely low-power multimedia accelerators,mobile communication,very low-power programmable TTA processor,OpenCL computer vision applications,depth estimation,face detection,highly parallel algorithm,GPU-style parallel platforms,AMD Radeon HD 7750 GPU,Qualcomm Adreno 330 mobile GPU,Intel Core i5-480M,power-constrained mobile platforms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要