Blk-Tune: Blocking Parameter Auto-Tuning To Minimize Input-Output Traffic For Flash-Based Out-Of-Core Stencil Computations

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)(2016)

引用 4|浏览7
暂无评分
摘要
This paper proposes the auto-tuning system designed for flash-based out-of-core stencil computations. Blk-Tune is a runtime blocking parameter auto-tuning system that enables the use of flash memory as an extension of main memory. It incorporates automatic hardware information retrieval using Portable Hardware Locality and minimizes the amount of data transferred between the flash device and DRAM, which is the most dominant factor affecting the performance of out-of-core algorithms using flash. The use of explicit highly parallel asynchronous I/O to a flash device together with this auto tuning system offers great advantages over the mmap method, in which a flash file is memory mapped. Blk-Tune allows users to easily achieve maximum performance of large-scale stencil computations in different hardware and application settings.
更多
查看译文
关键词
auto-tuning,non-volatile memory,flash memory,memory hierarchy,tiling,temporal blocking,stencil,out-of-core,asynchronous IO,mmap,block,memory extension
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要