Journal Article FZJ-2025-05349

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Processor simulation as a tool for performance engineering

 ;  ;  ;  ;

2025
Frontiers Media SA Beijing

Frontiers in high performance computing 3, 1669101 () [10.3389/fhpcp.2025.1669101]

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: The diversity of processor architectures used for High-Performance Computing (HPC) applications has increased significantly over the last few years. This trend is expected to continue for different reasons, including the emergence of various instruction set extensions. Examples are the renewed interest in vector instructions like Arm's Scalable Vector Extension (SVE) or RISC-V's RVV. For application developers, research software developers, and performance engineers, the increased diversity and complexity of architectures have led to the following challenges: Limited access to these different processor architectures and more difficult root cause analysis in case of performance issues. To address these challenges, we propose leveraging the much-improved capabilities of processor simulators such as gem5. We enhanced this simulator with a performance analysis framework. We extend available performance counters and introduce new analysis capabilities to track the temporal behaviour of running applications. An algorithm has been implemented to link these statistics to specific regions. The resulting performance profiles allow for the identification of code regions with the potential for optimization. The focus is on observables to monitor quantities that are usually not directly accessible on real hardware. Different algorithms have been implemented to identify potential performance bottlenecks. The framework is evaluated for different types of HPC applications like the molecular-dynamics application GROMACS, Ligra, which implements the breadth-first search (BFS) algorithm, and a kernel from the Lattice QCD solver DD-αAMG.

Classification:

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  2. EPI SGA2 (16ME0507K) (16ME0507K)
  3. The European PILOT - Pilot using Independent Local & Open Technologies (101034126) (101034126)
  4. AQTIVATE - Advanced computing, quantum algorithms, and data-driven approaches for science, technology and engineering (101072344) (101072344)

Appears in the scientific report 2025
Click to display QR Code for this record

The record appears in these collections:
Document types > Articles > Journal Article
Workflow collections > Public records
Institute Collections > JSC
Online First

 Record created 2025-12-14, last modified 2025-12-23


Restricted:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)