JetEsti: A New DLT Job Scheduling Simulator Based on Fine-Grained Process Modeling.

ICDCS(2023)

引用 0|浏览4
暂无评分
摘要
Large-scale Deep Learning Training(DLT) jobs consume a large amount of time and are usually carried out in a distributed cluster environment. However, existing DLT framework like TensorFlow does not contain adhoc optimizations at parallelism and scheduling, which results in seriously low efficiency. Due to this problem, researchers need to choose appropriate scheduling algorithms for cluster jobs. Consider the expensiveness of hardware resources, using job scheduling simulator(JSS) to verify the performance of different scheduling algorithms in advance is necessary.
更多
查看译文
关键词
Deep Learning Training, Job Execution Time, Modeling and Simulation, Job Scheduling Simulator.
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要