Home > Publications database > Efficient parallel implementation of the ChASE library on distributed CPU-GPU architectures > print |
001 | 825379 | ||
005 | 20221109161714.0 | ||
024 | 7 | _ | |a 2128/13330 |2 Handle |
037 | _ | _ | |a FZJ-2016-07842 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Di Napoli, Edoardo |0 P:(DE-Juel1)144723 |b 0 |e Corresponding author |u fzj |
111 | 2 | _ | |a Joint Laboratory for Extreme Scale Computing |g JLESC |c Kobe |d 2016-11-30 - 2016-12-02 |w Japan |
245 | _ | _ | |a Efficient parallel implementation of the ChASE library on distributed CPU-GPU architectures |
260 | _ | _ | |c 2016 |
336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
336 | 7 | _ | |a Other |2 DataCite |
336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
336 | 7 | _ | |a conferenceObject |2 DRIVER |
336 | 7 | _ | |a LECTURE_SPEECH |2 ORCID |
336 | 7 | _ | |a Conference Presentation |b conf |m conf |0 PUB:(DE-HGF)6 |s 1482342796_11196 |2 PUB:(DE-HGF) |x Invited |
520 | _ | _ | |a The Chebyshev Accelerated Subspace iteration Eigensolver (ChASE) is an iterative eigensolver developed at the JSC by the SimLab ab initio. The solver target principally sequences of dense eigenvalue problems as they arise in Density functional Theory, but can also work on the single eigenproblem. ChASE leverages on the preponderant use of BLAS 3 subroutines to achieve close-to-peak performance. Currently, the library can be executed in parallel on many- and multi-core platforms. The latest development of this project dealt with the extension of the CUDA build to encompass multiple GPUs on distinct CPUs. As such this hybrid parallelization will use MPI as well as CUDA interfaces effectively exploiting heterogeneous multi-GPU platforms. The extended library was tested on large and dense eigenproblems extracted from excitonic Hamiltonian. The ultimate goal is to integrate this new parallel implementation of ChASE with the VASP-BSE code. |
536 | _ | _ | |a 511 - Computational Science and Mathematical Methods (POF3-511) |0 G:(DE-HGF)POF3-511 |c POF3-511 |f POF III |x 0 |
536 | _ | _ | |a Simulation and Data Laboratory Quantum Materials (SDLQM) (SDLQM) |0 G:(DE-Juel1)SDLQM |c SDLQM |f Simulation and Data Laboratory Quantum Materials (SDLQM) |x 2 |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.pdf |
856 | 4 | _ | |y OpenAccess |x icon |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.gif?subformat=icon |
856 | 4 | _ | |y OpenAccess |x icon-1440 |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.jpg?subformat=icon-1440 |
856 | 4 | _ | |y OpenAccess |x icon-180 |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.jpg?subformat=icon-180 |
856 | 4 | _ | |y OpenAccess |x icon-640 |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.jpg?subformat=icon-640 |
856 | 4 | _ | |y OpenAccess |x pdfa |u https://juser.fz-juelich.de/record/825379/files/JLESC_ChASE_slides.pdf?subformat=pdfa |
909 | C | O | |o oai:juser.fz-juelich.de:825379 |p openaire |p open_access |p driver |p VDB |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)144723 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |1 G:(DE-HGF)POF3-510 |0 G:(DE-HGF)POF3-511 |2 G:(DE-HGF)POF3-500 |v Computational Science and Mathematical Methods |x 0 |4 G:(DE-HGF)POF |3 G:(DE-HGF)POF3 |l Supercomputing & Big Data |
914 | 1 | _ | |y 2016 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
920 | 1 | _ | |0 I:(DE-82)080012_20140620 |k JARA-HPC |l JARA - HPC |x 1 |
980 | 1 | _ | |a FullTexts |
980 | _ | _ | |a conf |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | _ | _ | |a I:(DE-82)080012_20140620 |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|