Assessing the State of Autovectorization Support based on SVE

Brank, Bine; Pleiter, Dirk

doi:10.1109/CLUSTER51413.2022.00073

Contribution to a conference proceedings

FZJ-2022-04463

Assessing the State of Autovectorization Support based on SVE

Brank, B. (Corresponding author)FZJ* ; Pleiter, D.

2022
IEEE

2022 IEEE International Conference on Cluster Computing (CLUSTER), Heidelberg, Germany, 5 Sep 2022 - 8 Sep 2022 IEEE 556–562 (2022) [10.1109/CLUSTER51413.2022.00073]

This record in other databases:

Please use a persistent id in citations: http://hdl.handle.net/2128/32799 doi:10.1109/CLUSTER51413.2022.00073

Abstract: So-called SIMD instructions, which trigger operations that process in each clock cycle a data tuple, have become widespread in modern processor architectures. In particular, processors for high-performance computing (HPC) systems rely on this additional level of parallelism to reach a high throughput of arithmetic operations. Leveraging these SIMD instructions can still be challenging for application software developers. This challenge has become simpler due to a compiler technique called auto-vectorization. In this paper, we explore the current state of auto-vectorization capabilities using state-of-the-art compilers using a recent extension of the Arm instruction set architecture, called SVE. We measure the performance gains on a recent processor architecture supporting SVE, namely the Fujitsu A64FX processor.

Contributing Institute(s):