001034809 001__ 1034809 001034809 005__ 20250822121413.0 001034809 0247_ $$2datacite_doi$$a10.34734/FZJ-2024-07562 001034809 037__ $$aFZJ-2024-07562 001034809 1001_ $$0P:(DE-Juel1)132189$$aMeinke, Jan$$b0$$eCorresponding author$$ufzj 001034809 1112_ $$aJSC - as part of the Training Programme of Forschungszentrum Jülich$$cJülich / online$$d2024-06-03 - 2024-06-07$$wGermany 001034809 245__ $$aGPU Programming Part 2: Advanced GPU Programming 001034809 260__ $$c2024 001034809 3367_ $$2DRIVER$$alecture 001034809 3367_ $$031$$2EndNote$$aGeneric 001034809 3367_ $$2BibTeX$$aMISC 001034809 3367_ $$0PUB:(DE-HGF)17$$2PUB:(DE-HGF)$$aLecture$$blecture$$mlecture$$s1736858047_9881$$xOther 001034809 3367_ $$2ORCID$$aLECTURE_SPEECH 001034809 3367_ $$2DataCite$$aText 001034809 520__ $$aGPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.This advanced course consists of modules providing more in-depth coverage of multi-GPU programming, modern CUDA concepts, CUDA Fortran, and portable programming models such as OpenACC and C++ parallel STL algorithms.Topics covered will includeA) Advanced Multi-GPU Programming with MPIB) Advanced Multi-GPU Programming with NCCL and NVSHMEMC) Advanced and Modern CUDA Concepts (Cooperative Groups, CUDA Graphs, CUB Primitives, Modern C++ Programming)D) CUDA FortranE) GPU Programming with Abstractions (OpenACC, Standard Language Programming (pSTL)) 001034809 536__ $$0G:(DE-HGF)POF4-5111$$a5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0 001034809 536__ $$0G:(DE-HGF)POF4-5112$$a5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x1 001034809 536__ $$0G:(DE-HGF)POF4-5122$$a5122 - Future Computing & Big Data Systems (POF4-512)$$cPOF4-512$$fPOF IV$$x2 001034809 536__ $$0G:(DE-Juel-1)ATML-X-DEV$$aATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV)$$cATML-X-DEV$$x3 001034809 7001_ $$0P:(DE-Juel1)145478$$aHerten, Andreas$$b1$$eCorresponding author$$ufzj 001034809 7001_ $$0P:(DE-Juel1)180799$$aHrywniak, Markus$$b2$$ufzj 001034809 7001_ $$0P:(DE-Juel1)137023$$aKraus, Jiri$$b3$$ufzj 001034809 7001_ $$0P:(DE-Juel1)164813$$aBadwaik, Jayesh$$b4$$ufzj 001034809 7001_ $$0P:(DE-Juel1)176293$$aHaghighi Mood, Kaveh$$b5$$ufzj 001034809 8564_ $$uhttps://indico3-jsc.fz-juelich.de/event/175/overview 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/01-MPI.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/02-NCCL_NVSHMEM.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/03a-CUDA-C%2B%2B.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/03b-CUB_intro.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/03c-cooperative-groups.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/03d-cuda-graphs.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/04-kokkos.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/05a-openacc-intro.pdf$$yOpenAccess 001034809 8564_ $$uhttps://juser.fz-juelich.de/record/1034809/files/05b-pSTL.pdf$$yOpenAccess 001034809 909CO $$ooai:juser.fz-juelich.de:1034809$$pdriver$$pVDB$$popen_access$$popenaire 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132189$$aForschungszentrum Jülich$$b0$$kFZJ 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)145478$$aForschungszentrum Jülich$$b1$$kFZJ 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)180799$$aForschungszentrum Jülich$$b2$$kFZJ 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)137023$$aForschungszentrum Jülich$$b3$$kFZJ 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)164813$$aForschungszentrum Jülich$$b4$$kFZJ 001034809 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)176293$$aForschungszentrum Jülich$$b5$$kFZJ 001034809 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5111$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0 001034809 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5112$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x1 001034809 9131_ $$0G:(DE-HGF)POF4-512$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5122$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vSupercomputing & Big Data Infrastructures$$x2 001034809 9141_ $$y2024 001034809 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess 001034809 920__ $$lyes 001034809 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0 001034809 980__ $$alecture 001034809 980__ $$aVDB 001034809 980__ $$aUNRESTRICTED 001034809 980__ $$aI:(DE-Juel1)JSC-20090406 001034809 9801_ $$aFullTexts