TY  - COMP
AU  - Wu, Xinzhe
AU  - Di Napoli, Edoardo
TI  - ChASE-library/ChASE: ChASE (v1.4.0); 1.4.0
M1  - FZJ-2023-05376
PY  - 2023
AB  - Introduced a new distributed GPU-build of ChASE entirely based on the NVIDIA NCCL library, which avoids the explicit data movement between host and device memory, and leads to much faster collective communications among the involved GPUs. This new release achieves between a 1.5x and 3x with respect to the traditional distributed multi-GPUs build. Now ChASE can be compiled and executed with the following distinct parallel configurations: Distributed CPU only Distributed multi-GPUs (traditionally based on host-device communication standards) Distributed multi-GPUs (using NVIDIA NCCL library)
LB  - PUB:(DE-HGF)33
DO  - DOI:10.5281/ZENODO.8220811
UR  - https://juser.fz-juelich.de/record/1019419
ER  -