001     911757
005     20221124131044.0
024 7 _ |a 2128/32759
|2 Handle
037 _ _ |a FZJ-2022-05009
041 _ _ |a English
100 1 _ |a Di Napoli, Edoardo
|0 P:(DE-Juel1)144723
|b 0
|e Corresponding author
|u fzj
111 2 _ |a CECAM workshop -- Challenges and Advances in Solving Eigenproblems for Electronic-Structure Theory
|c Lausanne
|d 2022-11-15 - 2022-11-18
|w Switzerland
245 _ _ |a ChAS(E)ing Hermitian dense eigenproblems with subspace iteration on large scale hybrid platforms with application to DFT
260 _ _ |c 2022
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a Other
|2 DataCite
336 7 _ |a INPROCEEDINGS
|2 BibTeX
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a LECTURE_SPEECH
|2 ORCID
336 7 _ |a Conference Presentation
|b conf
|m conf
|0 PUB:(DE-HGF)6
|s 1669205955_30325
|2 PUB:(DE-HGF)
|x Invited
520 _ _ |a As modern massively parallel clusters are getting larger with beefier compute nodes, traditional parallel eigensolvers, such as direct solvers, struggle keeping the pace with the hardware evolution and being able to scale efficiently due to additional layers of communication and synchronization. This difficulty is especially important when porting traditional libraries to heterogeneous computing architectures equipped with accelerators, such as Graphics Processing Unit (GPU). Recently, there have been significant scientific contributions to the development of filter-based subspace eigensolver to compute partial eigenspectrum. The simpler structure of these type of algorithms makes for them easier to avoid the communication and synchronization bottlenecks typical of direct solvers. The Chebyshev Accelerated Subspace Eigensolver (ChASE) is a modern subspace eigensolver to compute partial extremal eigenpairs of large-scale Hermitian eigenproblems with the acceleration of a filter based on Chebyshev polynomials.In this talk, we report on the latest versions of the ChASE library by describing (i) its support for distributed hybrid CPU-multi-GPU computing architectures, and (ii) the very recent development of partial distribution of a combination of Householder- Cholesk-QR factorization and its impact on time-to-solution and memory footprint. Benchmarks on a modern heterogeneous cluster (JURECA-DC) based on double socket AMD Epyc Rome CPU and 4 NVIDIA GPUs per node are provided. Typical application of ChASE are sequences of large Hermitian eigenproblems as they appear in LAPW methods. ChASE is also application-code ready, in the sense that comes with a simple C++/Fortran interface that allows its integration with typical electronic structure legacy codes.
536 _ _ |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|f POF IV
|x 0
536 _ _ |a Simulation and Data Laboratory Quantum Materials (SDLQM) (SDLQM)
|0 G:(DE-Juel1)SDLQM
|c SDLQM
|f Simulation and Data Laboratory Quantum Materials (SDLQM)
|x 1
700 1 _ |a Wu, Xinzhe
|0 P:(DE-Juel1)178969
|b 1
|u fzj
856 4 _ |u https://juser.fz-juelich.de/record/911757/files/CECAM_Lausanne_ChASE.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:911757
|p openaire
|p open_access
|p VDB
|p driver
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)144723
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)178969
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 0
914 1 _ |y 2022
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 _ _ |l no
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a conf
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21