%0 Conference Paper
%A Herten, Andreas
%A Kraus, Jiri
%A Hernandez, Oscar
%A Ravindar, Archana
%A Hagleitner, Christoph
%T Tutorial: Application Porting and Optimization on GPU-Accelerated POWER Architectures
%M FZJ-2016-06735
%D 2016
%Z Full-day Tutorial with six talks and three hands-on sessions.
%X The POWER processor has re-emerged as a technology for supercomputer architectures. One major reason is the tight integration of processor and GPU accelerator through the new NVLink technology. Two major sites in the US, ORNL and LLNL, have already decided to have their pre-exascale systems being based on this new architecture. This tutorial will give an opportunity to obtain in-depth knowledge and experience with GPU-accelerated POWER nodes. It focuses on porting applications to a single node and covers the topics architecture, compilers, performance analysis, and multi-GPU programming. The tutorial will include an overview of the new NVLink based node architectures, lectures on first-hand experience in porting to this architecture, and will conclude with exercises using tools to focus on performance.
%B International Conference for High Performance Computing, Networking, Storage and Analysis (The Supercomputing Conference)
%C 13 Nov 2016 - 18 Nov 2016, Salt Lake City, UT (United States of America)
Y2 13 Nov 2016 - 18 Nov 2016
M2 Salt Lake City, UT, United States of America
%F PUB:(DE-HGF)31
%9 Talk (non-conference)
%U https://juser.fz-juelich.de/record/824111