001049780 001__ 1049780
001049780 005__ 20251229194844.0
001049780 037__ $$aFZJ-2025-05564
001049780 1001_ $$0P:(DE-Juel1)145478$$aHerten, Andreas$$b0$$eCorresponding author
001049780 1112_ $$aJSC - as part of the Training Programme of Forschungszentrum Jülich$$cJülich$$d2025-03-31 - 2025-04-02$$wGermany
001049780 245__ $$aGPU Programming Part 1: Foundations
001049780 260__ $$c2025
001049780 3367_ $$2DRIVER$$alecture
001049780 3367_ $$031$$2EndNote$$aGeneric
001049780 3367_ $$2BibTeX$$aMISC
001049780 3367_ $$0PUB:(DE-HGF)17$$2PUB:(DE-HGF)$$aLecture$$blecture$$mlecture$$s1767034014_8816$$xOther
001049780 3367_ $$2ORCID$$aLECTURE_SPEECH
001049780 3367_ $$2DataCite$$aText
001049780 520__ $$aGPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to a GPU.The course will cover aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C++, which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications.This course is a basic course covering the foundations of GPU programming including an introduction to GPU/parallel computing, programming with CUDA, GPU libraries, tools for debugging and profiling, and performance optimizations.Topics covered will include Introduction to GPUs and GPU computing, programming model CUDA, tools for debugging and profiling, GPU libraries (like cuBLAS, cuFFT), and introduction to multi-GPU programming.
001049780 536__ $$0G:(DE-HGF)POF4-5111$$a5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0
001049780 536__ $$0G:(DE-HGF)POF4-5112$$a5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x1
001049780 536__ $$0G:(DE-HGF)POF4-5122$$a5122 - Future Computing & Big Data Systems (POF4-512)$$cPOF4-512$$fPOF IV$$x2
001049780 536__ $$0G:(DE-Juel1)FUEK412$$aGrundlagen für zukünftige Informationstechnologien (FUEK412)$$cFUEK412$$x3
001049780 536__ $$0G:(DE-Juel-1)ATML-X-DEV$$aATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV)$$cATML-X-DEV$$x4
001049780 536__ $$0G:(DE-Juel-1)DB001492$$aBMBF 01 1H1 6013, NRW 325 – 8.03 – 133340 - SiVeGCS (DB001492)$$cDB001492$$x5
001049780 588__ $$aDataset connected to DataCite
001049780 7001_ $$0P:(DE-Juel1)132189$$aMeinke, Jan$$b1$$eCorresponding author
001049780 7001_ $$0P:(DE-Juel1)176293$$aHaghighi Mood, Kaveh$$b2
001049780 7001_ $$0P:(DE-Juel1)192254$$aPenke, Carolin$$b3$$ufzj
001049780 7001_ $$0P:(DE-Juel1)137023$$aKraus, Jiri$$b4
001049780 7001_ $$0P:(DE-Juel1)208747$$aMorgenstern, Laura$$b5
001049780 8564_ $$uhttps://indico3-jsc.fz-juelich.de/event/218/
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/01_Introduction.pdf$$yRestricted
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/02_Tools.pdf$$yRestricted
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/03_Matrix_Multiplication.pdf$$yRestricted
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/04_Performance_Optimization.pdf$$yRestricted
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/05_Multi_GPU_Programming_w_MPI.pdf$$yRestricted
001049780 8564_ $$uhttps://juser.fz-juelich.de/record/1049780/files/06_Streams_and_Events.pdf$$yRestricted
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)145478$$aForschungszentrum Jülich$$b0$$kFZJ
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132189$$aForschungszentrum Jülich$$b1$$kFZJ
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)176293$$aForschungszentrum Jülich$$b2$$kFZJ
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)192254$$aForschungszentrum Jülich$$b3$$kFZJ
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)137023$$aForschungszentrum Jülich$$b4$$kFZJ
001049780 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)208747$$aForschungszentrum Jülich$$b5$$kFZJ
001049780 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5111$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
001049780 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5112$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x1
001049780 9131_ $$0G:(DE-HGF)POF4-512$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5122$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vSupercomputing & Big Data Infrastructures$$x2
001049780 9141_ $$y2025
001049780 920__ $$lyes
001049780 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
001049780 980__ $$alecture
001049780 980__ $$aEDITORS
001049780 980__ $$aVDBINPRINT
001049780 980__ $$aI:(DE-Juel1)JSC-20090406
001049780 980__ $$aUNRESTRICTED