Multi-Agent Reinforcement Learning for Thermally-Restricted Performance Optimization on Manycores.

Heba Khdr, Mustafa Enes Batur, Kanran Zhou,Mohammed Bakr Sikal,Jörg Henkel

Design, Automation, and Test in Europe(2024)

引用 0|浏览6
暂无评分
摘要
The problem of performance maximization under a thermal constraint has been tackled by means of dynamic voltage and frequency scaling (DVFS) in many system-level optimization techniques. State-of-the-art ones have exploited Su-pervised Learning (SL) to develop models that predict power and performance characteristics of applications and temperature of the cores. Such predictions enable proactive and efficient optimization decisions that exploit performance potentials under a temperature constraint. SL- based models are built at design time based on training data generated considering specific environment settings, i.e., processor architecture, cooling system, ambient temperature, etc. Hence, these models cannot adapt at runtime to different environment settings. In contrast, Reinforcement Learning (RL) employs an agent that explores and learns the environment at runtime, and hence can adapt to its potential changes. Nonetheless, using an RL agent to perform optimization on manycores is challenging because of the inherent large state/action spaces that might hinder the agent's ability to converge. To get the advantages of RL while tackling this challenge, we employ for the first time multi -agent RL to perform thermally-restricted performance optimization for manycores through DVFS. We investigated two RL algorithms-Table-based Q-Learning (TQL) and Deep Q-Learning (DQL)-and demonstrated that the latter outperforms the former. Compared to the state of the art, our DQL delivers a significant performance improvement of 34.96% on average, while also guaranteeing thermally -safe operation on the manycore. Our evaluation reveals the runtime adaptability of our DQL to varying workloads and ambient temperatures.
更多
查看译文
关键词
Multi-agent Reinforcement Learning,Training Data,Environmental Settings,Processing Architecture,Reinforcement Learning Agent,Deep Q-learning,Thermal Constraints,Temperature Constraints,Neural Network,System Performance,Deep Neural Network,State Space,Downscaling,Core Level,Core Temperature,Reward Function,Power Constraint,System Utility,Comparison Of Techniques,Reinforcement Learning Algorithm,Evaluation Scenarios,Performance Constraints,Deep Q-network,Reinforcement Learning Methods,Arrival Rate,Random Action,Single Batch
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要