Home > Publications database > Determining parallel application execution efficiency & scaling using the POP methodology > print |
001 | 1028928 | ||
005 | 20250314084122.0 | ||
024 | 7 | _ | |a 10.34734/FZJ-2024-04872 |2 datacite_doi |
037 | _ | _ | |a FZJ-2024-04872 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Garcia-Gasulla, Marta |0 P:(DE-HGF)0 |b 0 |
111 | 2 | _ | |a ISC High Performance |g ISC |c Hamburg |d 2024-05-12 - 2024-05-12 |w Germany |
245 | _ | _ | |a Determining parallel application execution efficiency & scaling using the POP methodology |
260 | _ | _ | |c 2024 |
336 | 7 | _ | |a lecture |2 DRIVER |
336 | 7 | _ | |a Generic |0 31 |2 EndNote |
336 | 7 | _ | |a MISC |2 BibTeX |
336 | 7 | _ | |a Lecture |b lecture |m lecture |0 PUB:(DE-HGF)17 |s 1734435274_31719 |2 PUB:(DE-HGF) |x After Call |
336 | 7 | _ | |a LECTURE_SPEECH |2 ORCID |
336 | 7 | _ | |a Text |2 DataCite |
520 | _ | _ | |a HPC application developers encounter significant challenges getting their codes to run correctly on leadership computer systems consisting of large numbers of interconnected multi-socket multicore processor nodes often with attached accelerator devices. They also need effective tools and methods to track and assess their codes’ execution performance as they aim to get ready for production on current or prospective exascale computer systems. This tutorial presents the methodology developed and applied over several years within the EU HPC Centre of Excellence Performance Optimisation and Productivity (POP). Its focus is the hierarchy of execution efficiency and scaling metrics that identify the most critical issues and quantify potential benefits of remedies. The metrics can be readily compared and determined by a variety of tools for applications in any language employing standard MPI, OpenMP/OpenACC and other multi-threading and offload paradigms. Using their own notebook computers, tutorial participants will follow exercises with widely-deployed open-source tools and provided performance measurements of actual HPC application executions (ranging from CFD to neuroscience), preparing them to locate and diagnose efficiency and scalability issues in their own parallel application codes. |
536 | _ | _ | |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5112 |c POF4-511 |f POF IV |x 0 |
536 | _ | _ | |a POP3 - Performance Optimisation and Productivity 3 (101143931) |0 G:(EU-Grant)101143931 |c 101143931 |f HORIZON_HORIZON-EUROHPC-JU-2023-COE-01-01 |x 1 |
536 | _ | _ | |0 G:(DE-Juel-1)ATMLPP |a ATMLPP - ATML Parallel Performance (ATMLPP) |c ATMLPP |x 2 |
700 | 1 | _ | |a Mendez, Sandra |0 P:(DE-HGF)0 |b 1 |
700 | 1 | _ | |a Visser, Anke |0 P:(DE-Juel1)132282 |b 2 |u fzj |
700 | 1 | _ | |a Wylie, Brian J. N. |0 P:(DE-Juel1)132302 |b 3 |e Corresponding author |u fzj |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20BSC%20exercise%20slides.pdf |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20BSC%20section.pdf |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20JSC%20section.pdf |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20POP%20slides.pdf |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1028928/files/ISC24%20tutorial%20preparation%20guide.pdf |
909 | C | O | |o oai:juser.fz-juelich.de:1028928 |p openaire |p open_access |p VDB |p driver |p ec_fundedresources |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 2 |6 P:(DE-Juel1)132282 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 3 |6 P:(DE-Juel1)132302 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5112 |x 0 |
914 | 1 | _ | |y 2024 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a lecture |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | 1 | _ | |a FullTexts |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|