001     915940
005     20250822121409.0
024 7 _ |a 2128/33226
|2 Handle
037 _ _ |a FZJ-2022-05803
041 _ _ |a English
100 1 _ |a Herten, Andreas
|0 P:(DE-Juel1)145478
|b 0
|e Corresponding author
|u fzj
111 2 _ |a PRACE Training Course at JSC
|c online
|d 2022-04-25 - 2022-04-29
245 _ _ |a GPU Programming with CUDA
260 _ _ |c 2022
336 7 _ |a lecture
|2 DRIVER
336 7 _ |a Generic
|0 31
|2 EndNote
336 7 _ |a MISC
|2 BibTeX
336 7 _ |a Lecture
|b lecture
|m lecture
|0 PUB:(DE-HGF)17
|s 1671615989_30090
|2 PUB:(DE-HGF)
|x Other
336 7 _ |a LECTURE_SPEECH
|2 ORCID
336 7 _ |a Text
|2 DataCite
500 _ _ |a Online course within the PRACE and FZJ training program.
520 _ _ |a GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course covers basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C/C++ which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. Topics covered will include: Introduction to GPU/Parallel computing; Programming model CUDA; GPU libraries like CuBLAS and CuFFT; Tools for debugging and profiling; Performance optimizations; Advanced GPU programming model; CUDA Fortran in a nutshell.This course is a PRACE training course.
536 _ _ |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5112
|c POF4-511
|f POF IV
|x 0
536 _ _ |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|f POF IV
|x 1
536 _ _ |a PRACE-6IP - PRACE 6th Implementation Phase Project (823767)
|0 G:(EU-Grant)823767
|c 823767
|f H2020-INFRAEDI-2018-1
|x 2
536 _ _ |a ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV)
|0 G:(DE-Juel-1)ATML-X-DEV
|c ATML-X-DEV
|x 3
700 1 _ |a Meinke, Jan
|0 P:(DE-Juel1)132189
|b 1
|e Corresponding author
|u fzj
700 1 _ |a Haghighi Mood, Kaveh
|0 P:(DE-Juel1)176293
|b 2
|u fzj
700 1 _ |a Kraus, Jiri
|0 P:(DE-Juel1)137023
|b 3
|u fzj
700 1 _ |a Hrywniak, Markus
|0 P:(DE-Juel1)180799
|b 4
|u fzj
856 4 _ |u https://www.fz-juelich.de/en/ias/jsc/news/events/training-courses/2022/ptc-gpu-cuda-2022
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/01-aherten-cuda-intro.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/02-mhrywniak-cuda_tools.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/03-haghighi-Matrix_Multiplication.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/04-mhrywniak-perf_opt.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/05-jkraus-Multi_GPU_Programming_with_MPI_and_CUDA.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/06-haghighi-CUDA_Streams_and_Events.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/07-haghighi-CUDA_DGEMM_Tiled.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/08-aherten-cooperative-groups.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/09-jmeinke-CUDA-C%2B%2B.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/10-jmeinke-CUB_intro.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/915940/files/11-haghighi-CudaFortran.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:915940
|p openaire
|p open_access
|p VDB
|p driver
|p ec_fundedresources
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)145478
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)132189
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)176293
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)137023
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 4
|6 P:(DE-Juel1)180799
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5112
|x 0
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 1
914 1 _ |y 2022
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a lecture
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21