Lecture (Other) FZJ-2019-06710

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming with CUDA

 ;  ;  ;

2019

Lecture at PRACE Training Course (Jülich, Germany), 1 Apr 2019 - 3 Apr 20192019-04-012019-04-03

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course covers basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA-C which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 511 - Computational Science and Mathematical Methods (POF3-511) (POF3-511)
  2. 513 - Supercomputer Facility (POF3-513) (POF3-513)

Appears in the scientific report 2019
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2019-12-17, last modified 2021-01-30


Restricted:
03-unified-memory - Download fulltext PDF Download fulltext PDF (PDFA)
01-2-CUDA_Intro - Download fulltext PDF Download fulltext PDF (PDFA)
07-CUDA_Streams_and_Events - Download fulltext PDF Download fulltext PDF (PDFA)
04-Matrix_Multiplication - Download fulltext PDF Download fulltext PDF (PDFA)
10-CUDA-C++ - Download fulltext PDF Download fulltext PDF (PDFA)
00-jsc-cuda19-overview - Download fulltext PDF Download fulltext PDF (PDFA)
11-CUB_intro - Download fulltext PDF Download fulltext PDF (PDFA)
02-cuda-tools - Download fulltext PDF Download fulltext PDF (PDFA)
08-CUDA_DGEMM_Tiled - Download fulltext PDF Download fulltext PDF (PDFA)
09-aherten-cooperative-groups - Download fulltext PDF Download fulltext PDF (PDFA)
01-1-cuda-intro - Download fulltext PDF Download fulltext PDF (PDFA)
06-Multi_GPU_Programming_with_MPI_and_CUDA - Download fulltext PDF Download fulltext PDF (PDFA)
05-Performance_Optimization - Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)