TRACO: Source-to-Source Parallelizing Compiler
keywords: Source-to-source parallelizing compiler, loop parallelization, iteration space slicing, fine- and coarse-grained parallelism, free scheduling, transitive closure
The paper presents a source-to-source compiler, TRACO, for automatic extraction of both coarse- and fine-grained parallelism available in C/C++ loops. Parallelization techniques implemented in TRACO are based on the transitive closure of a relation describing all the dependences in a loop. Coarse- and fine-grained parallelism is represented with synchronization-free slices (space partitions) and a legal loop statement instance schedule (time partitions), respectively. TRACO enables also applying scalar and array variable privatization as well as parallel reduction. On its output, TRACO produces compilable parallel OpenMP C/C++ and/or OpenACC C/C++ code. The effectiveness of TRACO, efficiency of parallel code produced by TRACO, and the time of parallel code production are evaluated by means of the NAS Parallel Benchmark and Polyhedral Benchmark suites. These features of TRACO are compared with closely related compilers such as ICC, Pluto, Par4All, and Cetus. Feature work is outlined.
mathematics subject classification 2000: 68N20, 65Y05, 52Bxx, 97E60, 05-XX
reference: Vol. 35, 2016, No. 6, pp. 1277–1306