Chrome Extension
WeChat Mini Program
Use on ChatGLM

Enabling Region Merging Optimizations in OpenMP.

Lecture Notes in Computer Science(2015)

Cited 2|Views62
No score
Abstract
Maximizing the scope of a parallel region, which avoids the costs of barriers and of launching additional parallel regions, is among the first recommendations in any optimization guide for OpenMP. While clearly beneficial and easily accomplished for code where regions are visibly contiguous, regions often become contiguous only after compiler optimization or resolution of abstraction layers. This paper explores changes to the OpenMP specification that would allow implementations to merge adjacent parallel regions automatically, including the removal of issues that make the transformation non-conforming and the addition of hints that facilitate the optimization. Beyond simple merging, we explore hints to fuse workshared loops that occur in syntactically distinct parallel regions or to apply nowait to such loops. Our evaluation shows these changes can provide an overall speedup of 2-8x for a microbenchmark, or 6% for a representative physics application.
More
Translated text
Key words
Region Merging, Parallel Region, OpenMP Specification, OpenMP Runtime, Nowait Clause
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined