Lecture (Other) FZJ-2023-05224

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming Part 2: Advanced GPU Programming

 ;  ;  ;  ;

2023

Lecture at JSC - as part of the Training Programme of Forschungszentrum Jülich (online, Germany), 19 Jun 2023 - 23 Jun 20232023-06-192023-06-23 [10.34734/FZJ-2023-05224]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.The course covers aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C++, which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. The foundations of GPU programming are covered in another dedicated Basic Course.This advanced course consists of modules providing more in-depth coverage of multi-GPU programming, modern CUDA concepts, CUDA Fortran, and portable programming models such as OpenACC and C++ parallel STL algorithms.


Note: See https://juser.fz-juelich.de/record/1019123 for the JuSER entry for the Basic Course.

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  3. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  4. ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) (ATML-X-DEV)

Appears in the scientific report 2023
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Präsentationen > Vorlesungen
Workflowsammlungen > Öffentliche Einträge
Institutssammlungen > JSC
Publikationsdatenbank
Open Access

 Datensatz erzeugt am 2023-12-08, letzte Änderung am 2025-08-22


OpenAccess:
01-MPI - Volltext herunterladen PDF
03c-cuda_cpp - Volltext herunterladen PDF
03d-CUB - Volltext herunterladen PDF
03a-cooperative-groups - Volltext herunterladen PDF
04-CudaFortran - Volltext herunterladen PDF
05a-openacc - Volltext herunterladen PDF
03b-cuda_graphs - Volltext herunterladen PDF
02-NCCL_NVSHMEM - Volltext herunterladen PDF
05b-pSTL - Volltext herunterladen PDF
Externer link:
Volltext herunterladenVolltext
Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)