000911757 001__ 911757
000911757 005__ 20221124131044.0
000911757 0247_ $$2Handle$$a2128/32759
000911757 037__ $$aFZJ-2022-05009
000911757 041__ $$aEnglish
000911757 1001_ $$0P:(DE-Juel1)144723$$aDi Napoli, Edoardo$$b0$$eCorresponding author$$ufzj
000911757 1112_ $$aCECAM workshop -- Challenges and Advances in Solving Eigenproblems for Electronic-Structure Theory$$cLausanne$$d2022-11-15 - 2022-11-18$$wSwitzerland
000911757 245__ $$aChAS(E)ing Hermitian dense eigenproblems with subspace iteration on large scale hybrid platforms with application to DFT
000911757 260__ $$c2022
000911757 3367_ $$033$$2EndNote$$aConference Paper
000911757 3367_ $$2DataCite$$aOther
000911757 3367_ $$2BibTeX$$aINPROCEEDINGS
000911757 3367_ $$2DRIVER$$aconferenceObject
000911757 3367_ $$2ORCID$$aLECTURE_SPEECH
000911757 3367_ $$0PUB:(DE-HGF)6$$2PUB:(DE-HGF)$$aConference Presentation$$bconf$$mconf$$s1669205955_30325$$xInvited
000911757 520__ $$aAs modern massively parallel clusters are getting larger with beefier compute nodes, traditional parallel eigensolvers, such as direct solvers, struggle keeping the pace with the hardware evolution and being able to scale efficiently due to additional layers of communication and synchronization. This difficulty is especially important when porting traditional libraries to heterogeneous computing architectures equipped with accelerators, such as Graphics Processing Unit (GPU).  Recently, there have been significant scientific contributions to the development of filter-based subspace eigensolver to compute partial eigenspectrum. The simpler structure of these type of algorithms makes for them easier to avoid the communication and synchronization bottlenecks typical of direct solvers. The Chebyshev Accelerated Subspace Eigensolver (ChASE) is a modern subspace eigensolver to compute partial extremal eigenpairs of large-scale Hermitian eigenproblems with the acceleration of a filter based on Chebyshev polynomials.In this talk, we report on the latest versions of the ChASE library by describing (i) its support for distributed hybrid CPU-multi-GPU computing architectures, and (ii) the very recent development of partial distribution of a combination of Householder- Cholesk-QR factorization and its impact on time-to-solution and memory footprint. Benchmarks on a modern heterogeneous cluster (JURECA-DC) based on double socket AMD Epyc Rome CPU and 4 NVIDIA GPUs per node are provided. Typical application of ChASE are sequences of large Hermitian eigenproblems as they appear in LAPW methods. ChASE is also application-code ready, in the sense that comes with a simple C++/Fortran interface that allows its integration with typical electronic structure legacy codes.
000911757 536__ $$0G:(DE-HGF)POF4-5111$$a5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0
000911757 536__ $$0G:(DE-Juel1)SDLQM$$aSimulation and Data Laboratory Quantum Materials (SDLQM) (SDLQM)$$cSDLQM$$fSimulation and Data Laboratory Quantum Materials (SDLQM)$$x1
000911757 7001_ $$0P:(DE-Juel1)178969$$aWu, Xinzhe$$b1$$ufzj
000911757 8564_ $$uhttps://juser.fz-juelich.de/record/911757/files/CECAM_Lausanne_ChASE.pdf$$yOpenAccess
000911757 909CO $$ooai:juser.fz-juelich.de:911757$$popenaire$$popen_access$$pVDB$$pdriver
000911757 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)144723$$aForschungszentrum Jülich$$b0$$kFZJ
000911757 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)178969$$aForschungszentrum Jülich$$b1$$kFZJ
000911757 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5111$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
000911757 9141_ $$y2022
000911757 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000911757 920__ $$lno
000911757 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000911757 980__ $$aconf
000911757 980__ $$aVDB
000911757 980__ $$aUNRESTRICTED
000911757 980__ $$aI:(DE-Juel1)JSC-20090406
000911757 9801_ $$aFullTexts