Lecture (Other) FZJ-2024-07562

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming Part 2: Advanced GPU Programming

 ;  ;  ;  ;  ;

2024

Lecture at JSC - as part of the Training Programme of Forschungszentrum Jülich (Jülich / online, Germany), 3 Jun 2024 - 7 Jun 20242024-06-032024-06-07 [10.34734/FZJ-2024-07562]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.This advanced course consists of modules providing more in-depth coverage of multi-GPU programming, modern CUDA concepts, CUDA Fortran, and portable programming models such as OpenACC and C++ parallel STL algorithms.Topics covered will includeA) Advanced Multi-GPU Programming with MPIB) Advanced Multi-GPU Programming with NCCL and NVSHMEMC) Advanced and Modern CUDA Concepts (Cooperative Groups, CUDA Graphs, CUB Primitives, Modern C++ Programming)D) CUDA FortranE) GPU Programming with Abstractions (OpenACC, Standard Language Programming (pSTL))


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  2. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  3. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  4. ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) (ATML-X-DEV)

Appears in the scientific report 2024
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2024-12-20, last modified 2025-08-22


OpenAccess:
01-MPI - Download fulltext PDF
03b-CUB_intro - Download fulltext PDF
05a-openacc-intro - Download fulltext PDF
03d-cuda-graphs - Download fulltext PDF
02-NCCL_NVSHMEM - Download fulltext PDF
03a-CUDA-C++ - Download fulltext PDF
03c-cooperative-groups - Download fulltext PDF
05b-pSTL - Download fulltext PDF
04-kokkos - Download fulltext PDF
External link:
Download fulltextFulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)