000915940 001__ 915940
000915940 005__ 20250822121409.0
000915940 0247_ $$2Handle$$a2128/33226
000915940 037__ $$aFZJ-2022-05803
000915940 041__ $$aEnglish
000915940 1001_ $$0P:(DE-Juel1)145478$$aHerten, Andreas$$b0$$eCorresponding author$$ufzj
000915940 1112_ $$aPRACE Training Course at JSC$$conline$$d2022-04-25 - 2022-04-29
000915940 245__ $$aGPU Programming with CUDA
000915940 260__ $$c2022
000915940 3367_ $$2DRIVER$$alecture
000915940 3367_ $$031$$2EndNote$$aGeneric
000915940 3367_ $$2BibTeX$$aMISC
000915940 3367_ $$0PUB:(DE-HGF)17$$2PUB:(DE-HGF)$$aLecture$$blecture$$mlecture$$s1671615989_30090$$xOther
000915940 3367_ $$2ORCID$$aLECTURE_SPEECH
000915940 3367_ $$2DataCite$$aText
000915940 500__ $$aOnline course within the PRACE and FZJ training program.
000915940 520__ $$aGPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course covers basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C/C++ which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. Topics covered will include: Introduction to GPU/Parallel computing; Programming model CUDA; GPU libraries like CuBLAS and CuFFT; Tools for debugging and profiling; Performance optimizations; Advanced GPU programming model; CUDA Fortran in a nutshell.This course is a PRACE training course.
000915940 536__ $$0G:(DE-HGF)POF4-5112$$a5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0
000915940 536__ $$0G:(DE-HGF)POF4-5111$$a5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x1
000915940 536__ $$0G:(EU-Grant)823767$$aPRACE-6IP - PRACE 6th Implementation Phase Project (823767)$$c823767$$fH2020-INFRAEDI-2018-1$$x2
000915940 536__ $$0G:(DE-Juel-1)ATML-X-DEV$$aATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV)$$cATML-X-DEV$$x3
000915940 7001_ $$0P:(DE-Juel1)132189$$aMeinke, Jan$$b1$$eCorresponding author$$ufzj
000915940 7001_ $$0P:(DE-Juel1)176293$$aHaghighi Mood, Kaveh$$b2$$ufzj
000915940 7001_ $$0P:(DE-Juel1)137023$$aKraus, Jiri$$b3$$ufzj
000915940 7001_ $$0P:(DE-Juel1)180799$$aHrywniak, Markus$$b4$$ufzj
000915940 8564_ $$uhttps://www.fz-juelich.de/en/ias/jsc/news/events/training-courses/2022/ptc-gpu-cuda-2022
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/01-aherten-cuda-intro.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/02-mhrywniak-cuda_tools.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/03-haghighi-Matrix_Multiplication.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/04-mhrywniak-perf_opt.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/05-jkraus-Multi_GPU_Programming_with_MPI_and_CUDA.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/06-haghighi-CUDA_Streams_and_Events.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/07-haghighi-CUDA_DGEMM_Tiled.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/08-aherten-cooperative-groups.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/09-jmeinke-CUDA-C%2B%2B.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/10-jmeinke-CUB_intro.pdf$$yOpenAccess
000915940 8564_ $$uhttps://juser.fz-juelich.de/record/915940/files/11-haghighi-CudaFortran.pdf$$yOpenAccess
000915940 909CO $$ooai:juser.fz-juelich.de:915940$$pec_fundedresources$$pdriver$$pVDB$$popen_access$$popenaire
000915940 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)145478$$aForschungszentrum Jülich$$b0$$kFZJ
000915940 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132189$$aForschungszentrum Jülich$$b1$$kFZJ
000915940 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)176293$$aForschungszentrum Jülich$$b2$$kFZJ
000915940 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)137023$$aForschungszentrum Jülich$$b3$$kFZJ
000915940 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)180799$$aForschungszentrum Jülich$$b4$$kFZJ
000915940 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5112$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
000915940 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5111$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x1
000915940 9141_ $$y2022
000915940 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000915940 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000915940 980__ $$alecture
000915940 980__ $$aVDB
000915940 980__ $$aUNRESTRICTED
000915940 980__ $$aI:(DE-Juel1)JSC-20090406
000915940 9801_ $$aFullTexts