Dynamic Thread Partition Algorithm Based on Sharing Data on CMP

Parallel and Distributed Computing, Applications and Technologies(2011)

引用 1|浏览0
暂无评分
摘要
At the level of multi-core processors that share the same cache, data sharing among threads which belong to different cores may not enjoy the benifit of non-uniform cache access because it is difficult to determine which core should be set as the local position of data block while each cache block is setting as one of the core's local block. Studies have found that the cost of long latency access can be reduced by using a proper thread partition/allocation algorithm [5]. However, at present work, researchers pay little attention to thread partitioning algorithms which can reduce the cost of long latency access. In this paper, we present a dynamic thread partitioning algorithm according to data sharing among threads at the level of cache-shared-multicore processers. In our design, the algorithm makes the best effort to minimize shared block accessed by threads of different cores. Compared with the existing work, our new algorithm achieves a performance improvement. We perform experiments on 4 cores and more than 100 threads and the result show that our algorithm can reduce the interaction of threads belonging to different cores between 30% and 50% over the previously known solutions.
更多
查看译文
关键词
cmp,latency access,data sharing,thread partition,different core,proper thread partition,cache storage,dynamic thread,cache-shared-multicore processer,local block,multi-threading,on-chip latency,allocation algorithm,non-uniform cache access,long latency access,data block,multiprocessing systems,dynamic thread partition algorithm,nonuniform cache access,core local block,peer-to-peer computing,new algorithm,multicore thread,cache block,multi core processor,instruction sets,best effort,algorithm design,multi threading,clustering algorithms,chip,algorithm design and analysis,heuristic algorithm,multicore processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要