Lecture (Other) FZJ-2024-07560

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming Part 1: Foundations

 ;  ;  ;  ;

2024

Lecture at JSC - as part of the Training Programme of Forschungszentrum Jülich (Jülich, Germany), 8 Apr 2024 - 10 Apr 20242024-04-082024-04-10 [10.34734/FZJ-2024-07560]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.The course will cover aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C++, which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications.This course is a basic course covering the foundations of GPU programming including an introduction to GPU/parallel computing, programming with CUDA, GPU libraries, tools for debugging and profiling, and performance optimizations.Topics covered will include Introduction to GPUs and GPU computing, programming model CUDA, tools for debugging and profiling, GPU libraries (like cuBLAS, cuFFT), and introduction to nulti-GPU programming.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  2. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  3. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  4. ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) (ATML-X-DEV)

Appears in the scientific report 2024
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2024-12-20, last modified 2025-08-22


OpenAccess:
Multi_GPU_Programming_with_MPI_and_CUDA - Download fulltext PDF
02_cuda_tools_mhrywniak - Download fulltext PDF
04_cuda_transpose_mhrywniak - Download fulltext PDF
3-Matrix_Multiplication - Download fulltext PDF
aherten-cuda-intro - Download fulltext PDF
CUDA_Streams_and_Events - Download fulltext PDF
External link:
Download fulltextFulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)