Lecture (Other) FZJ-2023-05224

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming Part 2: Advanced GPU Programming

 ;  ;  ;  ;

2023

Lecture at JSC - as part of the Training Programme of Forschungszentrum Jülich (online, Germany), 19 Jun 2023 - 23 Jun 20232023-06-192023-06-23 [10.34734/FZJ-2023-05224]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.The course covers aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C++, which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. The foundations of GPU programming are covered in another dedicated Basic Course.This advanced course consists of modules providing more in-depth coverage of multi-GPU programming, modern CUDA concepts, CUDA Fortran, and portable programming models such as OpenACC and C++ parallel STL algorithms.


Note: See https://juser.fz-juelich.de/record/1019123 for the JuSER entry for the Basic Course.

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  3. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  4. ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) (ATML-X-DEV)

Appears in the scientific report 2023
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2023-12-08, last modified 2025-08-22


OpenAccess:
01-MPI - Download fulltext PDF
03c-cuda_cpp - Download fulltext PDF
03d-CUB - Download fulltext PDF
03a-cooperative-groups - Download fulltext PDF
04-CudaFortran - Download fulltext PDF
05a-openacc - Download fulltext PDF
03b-cuda_graphs - Download fulltext PDF
02-NCCL_NVSHMEM - Download fulltext PDF
05b-pSTL - Download fulltext PDF
External link:
Download fulltextFulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)