000151320 001__ 151320
000151320 005__ 20250314084110.0
000151320 0247_ $$2Handle$$a2128/5921
000151320 037__ $$aFZJ-2014-01299
000151320 041__ $$aEnglish
000151320 1001_ $$0P:(DE-Juel1)132112$$aGeimer, Markus$$b0$$eCorresponding author$$ufzj
000151320 1112_ $$a25th International Conference for High Performance Computing, Networking, Storage and Analysis$$cDenver$$d2013-11-17 - 2013-11-22$$gSC13$$wUSA
000151320 245__ $$aHands-on Practical Hybrid Parallel Application Performance Engineering
000151320 260__ $$c2013
000151320 3367_ $$0PUB:(DE-HGF)6$$2PUB:(DE-HGF)$$aConference Presentation$$bconf$$mconf$$s151320$$xAfter Call
000151320 3367_ $$033$$2EndNote$$aConference Paper
000151320 3367_ $$2DataCite$$aOther
000151320 3367_ $$2ORCID$$aLECTURE_SPEECH
000151320 3367_ $$2DRIVER$$aconferenceObject
000151320 3367_ $$2BibTeX$$aINPROCEEDINGS
000151320 520__ $$aThis tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on the Score-P community instrumentation and measurement infrastructure, demonstrating how they can be used for performance engineering of effective scientific applications based on standard MPI, OpenMP, hybrid MPI+OpenMP, and increasingly common usage of accelerators. Parallel performance evaluation tools from the VI-HPS (Virtual Institute High Productivity Supercomputing) are introduced and featured in hands-on exercises with Scalasca, Vampir and TAU. We present the complete workflow of performance engineering, including instrumentation, measurement (profiling and tracing, timing and PAPI hardware counters), data storage, analysis, and visualization. Emphasis is placed on how tools are used in combination for identifying performance problems and investigating optimization alternatives. Using their own notebook computers with a provided Linux Live-ISO image containing the tools (booted from DVD/USB or within a virtual machine) will help to prepare participants to locate and diagnose performance bottlenecks in their own parallel programs.
000151320 536__ $$0G:(DE-HGF)POF2-411$$a411 - Computational Science and Mathematical Methods (POF2-411)$$cPOF2-411$$fPOF II$$x0
000151320 536__ $$0G:(DE-Juel-1)ATMLPP$$aATMLPP - ATML Parallel Performance (ATMLPP)$$cATMLPP$$x1
000151320 7001_ $$0P:(DE-HGF)0$$aShende, Sameer$$b1
000151320 7001_ $$0P:(DE-HGF)0$$aWesarg, Bert$$b2
000151320 7001_ $$0P:(DE-Juel1)132302$$aWylie, Brian J. N.$$b3$$ufzj
000151320 8564_ $$uhttps://juser.fz-juelich.de/record/151320/files/FZJ-2014-01299.pdf$$yOpenAccess
000151320 8564_ $$uhttps://juser.fz-juelich.de/record/151320/files/FZJ-2014-01299.jpg?subformat=icon-144$$xicon-144$$yOpenAccess
000151320 8564_ $$uhttps://juser.fz-juelich.de/record/151320/files/FZJ-2014-01299.jpg?subformat=icon-180$$xicon-180$$yOpenAccess
000151320 8564_ $$uhttps://juser.fz-juelich.de/record/151320/files/FZJ-2014-01299.jpg?subformat=icon-640$$xicon-640$$yOpenAccess
000151320 909CO $$ooai:juser.fz-juelich.de:151320$$pdriver$$pVDB$$popen_access$$popenaire
000151320 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000151320 9141_ $$y2013
000151320 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132112$$aForschungszentrum Jülich GmbH$$b0$$kFZJ
000151320 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132302$$aForschungszentrum Jülich GmbH$$b3$$kFZJ
000151320 9131_ $$0G:(DE-HGF)POF2-411$$1G:(DE-HGF)POF2-410$$2G:(DE-HGF)POF2-400$$3G:(DE-HGF)POF2$$4G:(DE-HGF)POF$$aDE-HGF$$bSchlüsseltechnologien$$lSupercomputing$$vComputational Science and Mathematical Methods$$x0
000151320 920__ $$lyes
000151320 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000151320 980__ $$aconf
000151320 980__ $$aUNRESTRICTED
000151320 980__ $$aFullTexts
000151320 980__ $$aI:(DE-Juel1)JSC-20090406
000151320 980__ $$aVDB
000151320 9801_ $$aFullTexts