Using AVX2 Instruction Set to Increase Performance of High Performance Computing Code

keywords: FMA operations, performance of AVX2 instruction set, benchmarking, Haswell processor, 4 th generation Intel Core processor, Intel Xeon E5-2600v3 series processor
In this paper we discuss new Intel instruction extensions -- Intel Advance Vector Extensions 2 (AVX2) and what these bring to high performance computing (HPC). To illustrate this new systems utilizing AVX2 are evaluated to demonstrate how to effectively exploit AVX2 for HPC types of the code and expose the situation when AVX2 might not be the most effective way to increase performance.
mathematics subject classification 2000: 68M07, 68M20
reference: Vol. 36, 2017, No. 5, pp. 1001–1018