001     1034807
005     20250822121412.0
024 7 _ |a 10.34734/FZJ-2024-07560
|2 datacite_doi
037 _ _ |a FZJ-2024-07560
100 1 _ |a Herten, Andreas
|0 P:(DE-Juel1)145478
|b 0
|e Corresponding author
|u fzj
111 2 _ |a JSC - as part of the Training Programme of Forschungszentrum Jülich
|c Jülich
|d 2024-04-08 - 2024-04-10
|w Germany
245 _ _ |a GPU Programming Part 1: Foundations
260 _ _ |c 2024
336 7 _ |a lecture
|2 DRIVER
336 7 _ |a Generic
|0 31
|2 EndNote
336 7 _ |a MISC
|2 BibTeX
336 7 _ |a Lecture
|b lecture
|m lecture
|0 PUB:(DE-HGF)17
|s 1737529395_26572
|2 PUB:(DE-HGF)
|x Other
336 7 _ |a LECTURE_SPEECH
|2 ORCID
336 7 _ |a Text
|2 DataCite
520 _ _ |a GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.The course will cover aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C++, which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications.This course is a basic course covering the foundations of GPU programming including an introduction to GPU/parallel computing, programming with CUDA, GPU libraries, tools for debugging and profiling, and performance optimizations.Topics covered will include Introduction to GPUs and GPU computing, programming model CUDA, tools for debugging and profiling, GPU libraries (like cuBLAS, cuFFT), and introduction to nulti-GPU programming.
536 _ _ |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|f POF IV
|x 0
536 _ _ |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5112
|c POF4-511
|f POF IV
|x 1
536 _ _ |a 5122 - Future Computing & Big Data Systems (POF4-512)
|0 G:(DE-HGF)POF4-5122
|c POF4-512
|f POF IV
|x 2
536 _ _ |a ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV)
|0 G:(DE-Juel-1)ATML-X-DEV
|c ATML-X-DEV
|x 3
700 1 _ |a Meinke, Jan
|0 P:(DE-Juel1)132189
|b 1
|e Corresponding author
|u fzj
700 1 _ |a Haghighi Mood, Kaveh
|0 P:(DE-Juel1)176293
|b 2
|u fzj
700 1 _ |a Kraus, Jiri
|0 P:(DE-Juel1)137023
|b 3
|u fzj
700 1 _ |a Hrywniak, Markus
|0 P:(DE-Juel1)180799
|b 4
|u fzj
856 4 _ |u https://indico3-jsc.fz-juelich.de/event/158/
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/02_cuda_tools_mhrywniak.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/04_cuda_transpose_mhrywniak.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/3-Matrix_Multiplication.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/CUDA_Streams_and_Events.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/Multi_GPU_Programming_with_MPI_and_CUDA.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/1034807/files/aherten-cuda-intro.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:1034807
|p openaire
|p open_access
|p VDB
|p driver
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)145478
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)132189
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)176293
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)137023
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 4
|6 P:(DE-Juel1)180799
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 0
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5112
|x 1
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-512
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Supercomputing & Big Data Infrastructures
|9 G:(DE-HGF)POF4-5122
|x 2
914 1 _ |y 2024
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a lecture
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21