001     1018549
005     20231213202050.0
024 7 _ |a 10.1145/3624062.3624179
|2 doi
024 7 _ |a 10.34734/FZJ-2023-04877
|2 datacite_doi
037 _ _ |a FZJ-2023-04877
100 1 _ |a Schlepphorst, Simon
|0 P:(DE-Juel1)198707
|b 0
|e Corresponding author
|u fzj
111 2 _ |a SC-W 2023: Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis
|c Denver CO USA
|d 2023-11-12 - 2023-11-17
|w USA
245 _ _ |a Benchmarking a portable lattice quantum chromodynamics kernel written in Kokkos and MPI
260 _ _ |c 2023
|b ACM New York, NY, USA
295 1 0 |a Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis - ACM New York, NY, USA, 2023. - ISBN 9798400707858 - doi:10.1145/3624062.3624179
300 _ _ |a 1027–1037
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a INPROCEEDINGS
|2 BibTeX
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1702453002_3807
|2 PUB:(DE-HGF)
336 7 _ |a Contribution to a book
|0 PUB:(DE-HGF)7
|2 PUB:(DE-HGF)
|m contb
520 _ _ |a Simulations of Lattice Quantum Chromodynamics (LQCD) are an important application (two digit percentage of cycles) on major High Performance Computing (HPC) installations, including systems high up on and leading the top500 list. In the rapidly changing hardware landscape of HPC, tying up manpower optimizing simulation software for every architecture becomes a sustainability issue. In this work we explore the feasibility of using performance portable parallel code for an important LQCD kernel. Fusing the Kokkos C++ Performance Portability EcoSystem with MPI allows applications to scale on massive parallel machines while still being able to target a plentitude of different architectures with the same simple code. We report on benchmarking results for a range of currently deployed and recently introduced systems, including AMD EPYC 7742, AMD MI250, Fujitsu A64FX, Nvidia A100 and Nvidia H100 components, with mostly encouraging results.
536 _ _ |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|f POF IV
|x 0
588 _ _ |a Dataset connected to CrossRef Conference
700 1 _ |a Krieg, Stefan
|0 P:(DE-Juel1)132171
|b 1
773 _ _ |a 10.1145/3624062.3624179
856 4 _ |y OpenAccess
|u https://juser.fz-juelich.de/record/1018549/files/Benchmarking_a_portable_lattice_quantum_chromodynamics_kernel_written_in_Kokkos_and_MPI__1_.pdf
856 4 _ |y OpenAccess
|x icon
|u https://juser.fz-juelich.de/record/1018549/files/Benchmarking_a_portable_lattice_quantum_chromodynamics_kernel_written_in_Kokkos_and_MPI__1_.gif?subformat=icon
856 4 _ |y OpenAccess
|x icon-1440
|u https://juser.fz-juelich.de/record/1018549/files/Benchmarking_a_portable_lattice_quantum_chromodynamics_kernel_written_in_Kokkos_and_MPI__1_.jpg?subformat=icon-1440
856 4 _ |y OpenAccess
|x icon-180
|u https://juser.fz-juelich.de/record/1018549/files/Benchmarking_a_portable_lattice_quantum_chromodynamics_kernel_written_in_Kokkos_and_MPI__1_.jpg?subformat=icon-180
856 4 _ |y OpenAccess
|x icon-640
|u https://juser.fz-juelich.de/record/1018549/files/Benchmarking_a_portable_lattice_quantum_chromodynamics_kernel_written_in_Kokkos_and_MPI__1_.jpg?subformat=icon-640
909 C O |o oai:juser.fz-juelich.de:1018549
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)198707
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)132171
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 0
914 1 _ |y 2023
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
920 1 _ |0 I:(DE-Juel1)CASA-20230315
|k CASA
|l Center for Advanced Simulation and Analytics
|x 1
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a contb
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a I:(DE-Juel1)CASA-20230315
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21