Hauptseite > Publikationsdatenbank > GPU Programming with CUDA > print |
001 | 915940 | ||
005 | 20250822121409.0 | ||
024 | 7 | _ | |a 2128/33226 |2 Handle |
037 | _ | _ | |a FZJ-2022-05803 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Herten, Andreas |0 P:(DE-Juel1)145478 |b 0 |e Corresponding author |u fzj |
111 | 2 | _ | |a PRACE Training Course at JSC |c online |d 2022-04-25 - 2022-04-29 |
245 | _ | _ | |a GPU Programming with CUDA |
260 | _ | _ | |c 2022 |
336 | 7 | _ | |a lecture |2 DRIVER |
336 | 7 | _ | |a Generic |0 31 |2 EndNote |
336 | 7 | _ | |a MISC |2 BibTeX |
336 | 7 | _ | |a Lecture |b lecture |m lecture |0 PUB:(DE-HGF)17 |s 1671615989_30090 |2 PUB:(DE-HGF) |x Other |
336 | 7 | _ | |a LECTURE_SPEECH |2 ORCID |
336 | 7 | _ | |a Text |2 DataCite |
500 | _ | _ | |a Online course within the PRACE and FZJ training program. |
520 | _ | _ | |a GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course covers basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C/C++ which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. Topics covered will include: Introduction to GPU/Parallel computing; Programming model CUDA; GPU libraries like CuBLAS and CuFFT; Tools for debugging and profiling; Performance optimizations; Advanced GPU programming model; CUDA Fortran in a nutshell.This course is a PRACE training course. |
536 | _ | _ | |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5112 |c POF4-511 |f POF IV |x 0 |
536 | _ | _ | |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5111 |c POF4-511 |f POF IV |x 1 |
536 | _ | _ | |a PRACE-6IP - PRACE 6th Implementation Phase Project (823767) |0 G:(EU-Grant)823767 |c 823767 |f H2020-INFRAEDI-2018-1 |x 2 |
536 | _ | _ | |a ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) |0 G:(DE-Juel-1)ATML-X-DEV |c ATML-X-DEV |x 3 |
700 | 1 | _ | |a Meinke, Jan |0 P:(DE-Juel1)132189 |b 1 |e Corresponding author |u fzj |
700 | 1 | _ | |a Haghighi Mood, Kaveh |0 P:(DE-Juel1)176293 |b 2 |u fzj |
700 | 1 | _ | |a Kraus, Jiri |0 P:(DE-Juel1)137023 |b 3 |u fzj |
700 | 1 | _ | |a Hrywniak, Markus |0 P:(DE-Juel1)180799 |b 4 |u fzj |
856 | 4 | _ | |u https://www.fz-juelich.de/en/ias/jsc/news/events/training-courses/2022/ptc-gpu-cuda-2022 |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/01-aherten-cuda-intro.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/02-mhrywniak-cuda_tools.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/03-haghighi-Matrix_Multiplication.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/04-mhrywniak-perf_opt.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/05-jkraus-Multi_GPU_Programming_with_MPI_and_CUDA.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/06-haghighi-CUDA_Streams_and_Events.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/07-haghighi-CUDA_DGEMM_Tiled.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/08-aherten-cooperative-groups.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/09-jmeinke-CUDA-C%2B%2B.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/10-jmeinke-CUB_intro.pdf |y OpenAccess |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/915940/files/11-haghighi-CudaFortran.pdf |y OpenAccess |
909 | C | O | |o oai:juser.fz-juelich.de:915940 |p openaire |p open_access |p VDB |p driver |p ec_fundedresources |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)145478 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 1 |6 P:(DE-Juel1)132189 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 2 |6 P:(DE-Juel1)176293 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 3 |6 P:(DE-Juel1)137023 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 4 |6 P:(DE-Juel1)180799 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5112 |x 0 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5111 |x 1 |
914 | 1 | _ | |y 2022 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a lecture |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | 1 | _ | |a FullTexts |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|