TY  - CONF
AU  - Schlütter, Marc
AU  - Philippen, Peter
AU  - Morin, Laurent
AU  - Geimer, Markus
AU  - Mohr, Bernd
TI  - Profiling Hybrid HMPP Applications with Score-P on Heterogeneous Hardware
VL  - 25
PB  - IOS Press
M1  - FZJ-2014-01861
SN  - 978-1-61499-380-3
T2  - Advances in Parallel Computing
SP  - 773 - 782
PY  - 2014
AB  - In heterogeneous environments with multi-core systems and accelerators, programming and optimizing large parallel applications turns into a time-intensive and hardware-dependent challenge. To assist application developers in this process, a number of tools and high-level compilers have been developed. Directive-based programming models such as HMPP and OpenACC provide abstractions over low-level GPU programming models,such as CUDA or OpenCL. The compilers developed by CAPS automatically transform the pragma-annotated application code into low-level code, thereby allowing the parallelization and optimization for a given accelerator hardware. To analyze the performance of parallel applications, multiple partners in Germany and the US jointly develop the community measurement infrastructure Score-P. Score-P gathers performance execution profiles, which can be presented and analyzed within the CUBE result browser, and collects detailed event traces to be processed by post-mortem analysis tools such as Scalasca and Vampir.In this paper we present the integration and combined use of Score-P and the CAPS compilers as one approach to efficiently parallelize and optimize codes. Specifically, we describe the PHMPP profiling interface, it's implementation in Score-P, and the presentation of preliminary results in CUBE.
T2  - International Conference on Parallel Computing
CY  - 10 Sep 2013 - 13 Sep 2013, Munich (Germany)
Y2  - 10 Sep 2013 - 13 Sep 2013
M2  - Munich, Germany
LB  - PUB:(DE-HGF)8 ; PUB:(DE-HGF)7
UR  - <Go to ISI:>//WOS:000452120400078
DO  - DOI:10.3233/978-1-61499-381-0-773
UR  - https://juser.fz-juelich.de/record/152041
ER  -