Contribution to a conference proceedings/Contribution to a book FZJ-2014-01861

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Profiling Hybrid HMPP Applications with Score-P on Heterogeneous Hardware

 ;  ;  ;  ;

2014
IOS Press
ISBN: 978-1-61499-380-3

Parallel Computing: Accelerating Computational Science and Engineering (CSE)
International Conference on Parallel Computing, ParCo 2013, MunichMunich, Germany, 10 Sep 2013 - 13 Sep 20132013-09-102013-09-13
IOS Press, Advances in Parallel Computing 25, 773 - 782 () [10.3233/978-1-61499-381-0-773]

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: In heterogeneous environments with multi-core systems and accelerators, programming and optimizing large parallel applications turns into a time-intensive and hardware-dependent challenge. To assist application developers in this process, a number of tools and high-level compilers have been developed. Directive-based programming models such as HMPP and OpenACC provide abstractions over low-level GPU programming models,such as CUDA or OpenCL. The compilers developed by CAPS automatically transform the pragma-annotated application code into low-level code, thereby allowing the parallelization and optimization for a given accelerator hardware. To analyze the performance of parallel applications, multiple partners in Germany and the US jointly develop the community measurement infrastructure Score-P. Score-P gathers performance execution profiles, which can be presented and analyzed within the CUBE result browser, and collects detailed event traces to be processed by post-mortem analysis tools such as Scalasca and Vampir.In this paper we present the integration and combined use of Score-P and the CAPS compilers as one approach to efficiently parallelize and optimize codes. Specifically, we describe the PHMPP profiling interface, it's implementation in Score-P, and the presentation of preliminary results in CUBE.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 411 - Computational Science and Mathematical Methods (POF2-411) (POF2-411)
  2. ATMLPP - ATML Parallel Performance (ATMLPP) (ATMLPP)

Appears in the scientific report 2014
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2014-04-01, last modified 2025-03-14


Restricted:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)