| Home > Publications database > HPC CUDA Compiler Performance Comparison |
| Coursework | FZJ-2026-02359 |
;
2025
This record in other databases:
Please use a persistent id in citations: doi:10.34734/FZJ-2026-02359
Abstract: This seminar thesis compares the performance of selected CUDA samples and an real-world application between the compilers Clang and NVCC combined with GCC run on the HPC system JURECA-DC with five optimization flags -O0 up to -O3, -Ofast and -ffast-math. An empirical experiment was run to benchmark compilation and execution time as well as binary size. The benchmarks were automated with scripts written for compilation and execution with Clang and NVCC for all five optimization stages. The analysis of the results reveals that no compiler can be clearly recommended and the performance is highly dependent of the compiler, its optimization flag and the problem
|
The record appears in these collections: |