001     1008814
005     20250314084121.0
037 _ _ |a FZJ-2023-02486
041 _ _ |a English
100 1 _ |a Corbin, Gregor
|0 P:(DE-Juel1)188670
|b 0
|e Corresponding author
|u fzj
111 2 _ |a ISC High Performance '23
|g ISC '23
|c Hamburg
|d 2023-05-21 - 2023-05-21
|w Germany
245 _ _ |a Hands-on Practical Hybrid Parallel Application Performance Engineering
260 _ _ |c 2023
336 7 _ |a lecture
|2 DRIVER
336 7 _ |a Generic
|0 31
|2 EndNote
336 7 _ |a MISC
|2 BibTeX
336 7 _ |a Lecture
|b lecture
|m lecture
|0 PUB:(DE-HGF)17
|s 1688357466_15155
|2 PUB:(DE-HGF)
|x After Call
336 7 _ |a LECTURE_SPEECH
|2 ORCID
336 7 _ |a Text
|2 DataCite
520 _ _ |a This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on the community-developed Score-P instrumentation and measurement infrastructure, demonstrating how they can be used for performance engineering of effective scientific applications based on standard MPI, OpenMP, hybrid combination of both, and increasingly common usage of accelerators. Parallel performance tools from the Virtual Institute – High Productivity Supercomputing (VI-HPS) are introduced and featured in hands-on exercises with Score-P, Scalasca, Vampir, and TAU. We present the complete workflow of performance engineering, including instrumentation, measurement (profiling and tracing, timing and PAPI hardware counters), data storage, analysis, tuning, and visualization. Emphasis is placed on how tools are used in combination for identifying performance problems and investigating optimization alternatives. Using their own notebook computers, participants will conduct exercises on a contemporary HPC system where remote access will be provided for the hands-on sessions through AWS running an E4S [http://e4s.io] image containing all of the necessary tools. This will help to prepare participants to locate and diagnose performance bottlenecks in their own parallel programs.
536 _ _ |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5112
|c POF4-511
|f POF IV
|x 0
536 _ _ |a EUPEX - EUROPEAN PILOT FOR EXASCALE (101033975)
|0 G:(EU-Grant)101033975
|c 101033975
|f H2020-JTI-EuroHPC-2020-1
|x 1
536 _ _ |0 G:(DE-Juel-1)ATMLPP
|a ATMLPP - ATML Parallel Performance (ATMLPP)
|c ATMLPP
|x 2
700 1 _ |a Shende
|0 P:(DE-HGF)0
|b 1
700 1 _ |a Williams, William
|0 P:(DE-HGF)0
|b 2
909 C O |o oai:juser.fz-juelich.de:1008814
|p openaire
|p VDB
|p ec_fundedresources
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)188670
910 1 _ |a External Institute
|0 I:(DE-HGF)0
|k Extern
|b 1
|6 P:(DE-HGF)0
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5112
|x 0
914 1 _ |y 2023
920 _ _ |l no
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a lecture
980 _ _ |a VDB
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21