Home > Publications database > Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility Overview > print |
001 | 1018971 | ||
005 | 20250822121436.0 | ||
024 | 7 | _ | |a 10.1145/3624062.3624178 |2 doi |
024 | 7 | _ | |a 10.34734/FZJ-2023-05040 |2 datacite_doi |
037 | _ | _ | |a FZJ-2023-05040 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Herten, Andreas |0 P:(DE-Juel1)145478 |b 0 |e Corresponding author |
111 | 2 | _ | |a SC-W 2023: Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis |g SC-W23 |c Denver, CO |d 2023-11-12 - 2023-11-17 |w USA |
245 | _ | _ | |a Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility Overview |
260 | _ | _ | |c 2023 |b ACM New York, NY, USA |
295 | 1 | 0 | |a Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis |
300 | _ | _ | |a 1019–1026 |
336 | 7 | _ | |a CONFERENCE_PAPER |2 ORCID |
336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
336 | 7 | _ | |a conferenceObject |2 DRIVER |
336 | 7 | _ | |a Output Types/Conference Paper |2 DataCite |
336 | 7 | _ | |a Contribution to a conference proceedings |b contrib |m contrib |0 PUB:(DE-HGF)8 |s 1702637588_5921 |2 PUB:(DE-HGF) |
336 | 7 | _ | |a Contribution to a book |0 PUB:(DE-HGF)7 |2 PUB:(DE-HGF) |m contb |
500 | _ | _ | |a arXiv: https://arxiv.org/abs/2309.05445 HTML-version of table: https://x-dev.pages.jsc.fz-juelich.de/models/ Data repository: https://github.com/AndiH/gpu-lang-compat |
520 | _ | _ | |a In recent history, GPUs became a key driver of compute performance in HPC. With the installation of the Frontier supercomputer, they became the enablers of the Exascale era; further largest-scale installations are in progress (Aurora, El Capitan, JUPITER). But the early-day dominance by NVIDIA and their CUDA programming model has changed: The current HPC GPU landscape features three vendors (AMD, Intel, NVIDIA), each with native and derived programming models. The choices are ample, but not all models are supported on all platforms, especially if support for Fortran is needed; in addition, some restrictions might apply. It is hard for scientific programmers to navigate this abundance of choices and limits. This paper gives a guide by matching the GPU platforms with supported programming models, presented in a concise table and further elaborated in detailed comments. An assessment is made regarding the level of support of a model on a platform. |
536 | _ | _ | |a 5122 - Future Computing & Big Data Systems (POF4-512) |0 G:(DE-HGF)POF4-5122 |c POF4-512 |f POF IV |x 0 |
536 | _ | _ | |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5112 |c POF4-511 |f POF IV |x 1 |
536 | _ | _ | |a ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) |0 G:(DE-Juel-1)ATML-X-DEV |c ATML-X-DEV |x 2 |
588 | _ | _ | |a Dataset connected to CrossRef Conference |
773 | _ | _ | |a 10.1145/3624062.3624178 |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1018971/files/2309.05445.pdf |
856 | 4 | _ | |y OpenAccess |x icon |u https://juser.fz-juelich.de/record/1018971/files/2309.05445.gif?subformat=icon |
856 | 4 | _ | |y OpenAccess |x icon-1440 |u https://juser.fz-juelich.de/record/1018971/files/2309.05445.jpg?subformat=icon-1440 |
856 | 4 | _ | |y OpenAccess |x icon-180 |u https://juser.fz-juelich.de/record/1018971/files/2309.05445.jpg?subformat=icon-180 |
856 | 4 | _ | |y OpenAccess |x icon-640 |u https://juser.fz-juelich.de/record/1018971/files/2309.05445.jpg?subformat=icon-640 |
909 | C | O | |o oai:juser.fz-juelich.de:1018971 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)145478 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-512 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Supercomputing & Big Data Infrastructures |9 G:(DE-HGF)POF4-5122 |x 0 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5112 |x 1 |
914 | 1 | _ | |y 2023 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a contrib |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a contb |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | 1 | _ | |a FullTexts |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|