Contribution to a conference proceedings FZJ-2022-04463

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Assessing the State of Autovectorization Support based on SVE

 ;

2022
IEEE

2022 IEEE International Conference on Cluster Computing (CLUSTER), HeidelbergHeidelberg, Germany, 5 Sep 2022 - 8 Sep 20222022-09-052022-09-08 IEEE 556–562 () [10.1109/CLUSTER51413.2022.00073]

This record in other databases:  

Please use a persistent id in citations:   doi:

Abstract: So-called SIMD instructions, which trigger operations that process in each clock cycle a data tuple, have become widespread in modern processor architectures. In particular, processors for high-performance computing (HPC) systems rely on this additional level of parallelism to reach a high throughput of arithmetic operations. Leveraging these SIMD instructions can still be challenging for application software developers. This challenge has become simpler due to a compiler technique called auto-vectorization. In this paper, we explore the current state of auto-vectorization capabilities using state-of-the-art compilers using a recent extension of the Arm instruction set architecture, called SVE. We measure the performance gains on a recent processor architecture supporting SVE, namely the Fujitsu A64FX processor.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  2. Mont-Blanc 2020 - Mont-Blanc 2020, European scalable, modular and power efficient HPC processor (779877) (779877)

Appears in the scientific report 2022
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2022-11-08, last modified 2023-02-24


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)