Smart resource allocation of concurrent execution of parallel applications

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE(2023)

引用 2|浏览14
暂无评分
摘要
Thread-level parallelism (TLP) has been widely exploited to optimize computational resource usage in high-performance systems. However, as many applications do not scale as the number of threads increase, resources will be wasted when the application executes with the maximum possible number of threads (i.e., the default execution) rather than fewer threads (thread throttling) that may use the resources more efficiently. Hence, instead of executing only one application with as many threads as possible, one can run more applications simultaneously by applying thread throttling to each one. The primary outcome of this strategy is a significant reduction in the total execution time and energy consumption when the system needs to execute a list of applications. Given that, we propose a smart resource allocation (SRA) for concurrent parallel application execution. It automatically finds the ideal degree of TLP for each application and guides the simultaneous parallel applications execution. When running 25 well-known benchmarks on three multicore systems and comparing SRA to state-of-the-art strategies (e.g., Batch, Equal policy, and Scalability), SRA improves the EDP by 87.4% over the Batch strategy; 75.5% over the Equal policy; and 38.8% over the scalability strategy.
更多
查看译文
关键词
energy-delay product,parallel computing,run-time optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要