001028928 001__ 1028928
001028928 005__ 20250314084122.0
001028928 0247_ $$2datacite_doi$$a10.34734/FZJ-2024-04872
001028928 037__ $$aFZJ-2024-04872
001028928 041__ $$aEnglish
001028928 1001_ $$0P:(DE-HGF)0$$aGarcia-Gasulla, Marta$$b0
001028928 1112_ $$aISC High Performance$$cHamburg$$d2024-05-12 - 2024-05-12$$gISC$$wGermany
001028928 245__ $$aDetermining parallel application execution efficiency & scaling using the POP methodology
001028928 260__ $$c2024
001028928 3367_ $$2DRIVER$$alecture
001028928 3367_ $$031$$2EndNote$$aGeneric
001028928 3367_ $$2BibTeX$$aMISC
001028928 3367_ $$0PUB:(DE-HGF)17$$2PUB:(DE-HGF)$$aLecture$$blecture$$mlecture$$s1734435274_31719$$xAfter Call
001028928 3367_ $$2ORCID$$aLECTURE_SPEECH
001028928 3367_ $$2DataCite$$aText
001028928 520__ $$aHPC application developers encounter significant challenges getting their codes to run correctly on leadership computer systems consisting of large numbers of interconnected multi-socket multicore processor nodes often with attached accelerator devices. They also need effective tools and methods to track and assess their codes’ execution performance as they aim to get ready for production on current or prospective exascale computer systems. This tutorial presents the methodology developed and applied over several years within the EU HPC Centre of Excellence Performance Optimisation and Productivity (POP). Its focus is the hierarchy of execution efficiency and scaling metrics that identify the most critical issues and quantify potential benefits of remedies. The metrics can be readily compared and determined by a variety of tools for applications in any language employing standard MPI, OpenMP/OpenACC and other multi-threading and offload paradigms. Using their own notebook computers, tutorial participants will follow exercises with widely-deployed open-source tools and provided performance measurements of actual HPC application executions (ranging from CFD to neuroscience), preparing them to locate and diagnose efficiency and scalability issues in their own parallel application codes.
001028928 536__ $$0G:(DE-HGF)POF4-5112$$a5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0
001028928 536__ $$0G:(EU-Grant)101143931$$aPOP3 - Performance Optimisation and Productivity 3 (101143931)$$c101143931$$fHORIZON_HORIZON-EUROHPC-JU-2023-COE-01-01$$x1
001028928 536__ $$0G:(DE-Juel-1)ATMLPP$$aATMLPP - ATML Parallel Performance (ATMLPP)$$cATMLPP$$x2
001028928 7001_ $$0P:(DE-HGF)0$$aMendez, Sandra$$b1
001028928 7001_ $$0P:(DE-Juel1)132282$$aVisser, Anke$$b2$$ufzj
001028928 7001_ $$0P:(DE-Juel1)132302$$aWylie, Brian J. N.$$b3$$eCorresponding author$$ufzj
001028928 8564_ $$uhttps://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20BSC%20exercise%20slides.pdf$$yOpenAccess
001028928 8564_ $$uhttps://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20BSC%20section.pdf$$yOpenAccess
001028928 8564_ $$uhttps://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20JSC%20section.pdf$$yOpenAccess
001028928 8564_ $$uhttps://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20POP%20slides.pdf$$yOpenAccess
001028928 8564_ $$uhttps://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20preparation%20guide.pdf$$yOpenAccess
001028928 909CO $$ooai:juser.fz-juelich.de:1028928$$pec_fundedresources$$pdriver$$pVDB$$popen_access$$popenaire
001028928 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132282$$aForschungszentrum Jülich$$b2$$kFZJ
001028928 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132302$$aForschungszentrum Jülich$$b3$$kFZJ
001028928 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5112$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
001028928 9141_ $$y2024
001028928 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
001028928 920__ $$lyes
001028928 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
001028928 980__ $$alecture
001028928 980__ $$aVDB
001028928 980__ $$aUNRESTRICTED
001028928 980__ $$aI:(DE-Juel1)JSC-20090406
001028928 9801_ $$aFullTexts