Lecture (Other) FZJ-2021-05271

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
PRACE Training Course: GPU Programming with CUDA

 ;  ;  ;  ;

2021

Lecture at (), 26 Apr 2021 - 30 Apr 20212021-04-262021-04-30

Please use a persistent id in citations:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course will cover basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA-C which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications.Topics covered include: Introduction to GPU/Parallel computing Programming model CUDA GPU libraries like CuBLAS and CuFFT Tools for debugging and profiling Performance optimizationsThis course is a PRACE training course.


Note: Online course within the PRACE and FZJ training program.

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5121 - Supercomputing & Big Data Facilities (POF4-512) (POF4-512)
  2. 5122 - Future Computing & Big Data Systems (POF4-512) (POF4-512)
  3. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  4. ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) (ATML-X-DEV)

Appears in the scientific report 2021
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2021-12-15, last modified 2025-08-22


OpenAccess:
05-Multi-GPU - Download fulltext PDF
08-Cooperative-Groups - Download fulltext PDF
04-Performance-Opt - Download fulltext PDF
00-Overview - Download fulltext PDF
10-CUB - Download fulltext PDF
09-CUDA-C++ - Download fulltext PDF
06-CUDA-Streams-Events - Download fulltext PDF
02-CUDA-Tools - Download fulltext PDF
07-Matrix-Mul-Tiled - Download fulltext PDF
11-CUDA-Fortran - Download fulltext PDF
01-CUDA-Intro - Download fulltext PDF
03-Matrix-Mul - Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)