001     150535
005     20221109161708.0
037 _ _ |a FZJ-2014-00589
100 1 _ |0 P:(DE-Juel1)144723
|a Di Napoli, Edoardo
|b 0
111 2 _ |a Numerical Analysis and Scientific Computation with Applications
|g NASCA13
|c Calais
|d 2013-06-24 - 2013-06-26
|w France
245 _ _ |a Preconditioning Chebyshev subspace iteration applied to sequences of dense eigenproblems in ab initio simulations
260 _ _ |c 2013
336 7 _ |a Conference Presentation
|b conf
|m conf
|0 PUB:(DE-HGF)6
|s 1418826866_23273
|2 PUB:(DE-HGF)
|x Invited
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a Other
|2 DataCite
336 7 _ |a LECTURE_SPEECH
|2 ORCID
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a INPROCEEDINGS
|2 BibTeX
520 _ _ |a Research in several branches of chemistry and materials science relies on large ab initio numerical simulations. The majority of these simulations are based on computational methods developed within the framework of Density Functional Theory (DFT) [1]. Among all the DFT-based methods the Full-potential Linearized Augmented Plane Wave (FLAPW) [2, 3] method constitutes the most precise computational framework to calculate ground state energy of periodic and crystalline materials. FLAPW provides the means to solve a high-dimensional quantum mechanical problem by representing it as a non-linear generalized eigenvalue problem which is solved self-consistently through a series of successive outer-iteration cycles. As a consequence each self-consistent simulation is made of dozens of sequences of dense generalized eigenproblems P : Ax = λBx. Each sequence, P1 , . . . Pi . . . PN , groups together eigenproblems with increasing outer-iteration index i. Successive eigenproblems in a FLAPW-generated sequence possess a high degree of correlation. In particular it has been demonstrated that eigenvectors of adjacent eigenproblems become progressively more collinear to each other as the outer-iteration index increases [4]. This result suggests one could use eigenvectors, computed at a certain outer-iteration, as approximate solutions to improve the performance of the eigensolver at the next one. In order to maximally exploit the approximate solution, we developed a subspace iteration method augmented with an optimized Chebyshev polynomial accelerator together with an efficient locking mechanism (ChFSI). The resulting eigensolver was implemented in C language and parallelized for both shared and distributed architectures. Numerical tests show that, when the eigensolver is preconditioned with approximate solutions instead of random vectors, it achieves up to a 5X speedup. Moreover ChFSI takes great advantage of computational resources by obtaining levels of efficiency up to 80 % of the theoretical peak performance. In particular, by making better use of massively parallel architectures, the distributed memory version will allow the FLAPW method users to simulate larger physical systems than are currently accessible. Additionally, despite the eigenproblems in the sequence being relatively large and dense, the parallel ChFSI preconditioned with approximate solutions performs substantially better than the corresponding direct eigensolvers, even for a significant portion of the sought-after spectrum. [1] R. M. Dreizler, and E. K. U. Gross, Density Functional Theory (Springer-Verlag, 1990) [2] A. J. Freeman, H. Krakauer, M. Weinert, and E. Wimmer, Phys. Rev. B 24 (1981) 864. [3] A. J. Freeman, and H. J. F. Jansen, Phys. Rev. B 30 (1984) 561 [4] E. Di Napoli, S. Blu ̈gel, and P. Bientinesi, Comp. Phys. Comm. 183 (2012), pp. 1674- 1682, [arXiv:1108.2594]
536 _ _ |0 G:(DE-HGF)POF2-411
|a 411 - Computational Science and Mathematical Methods (POF2-411)
|c POF2-411
|f POF II
|x 0
536 _ _ |a Simulation and Data Laboratory Quantum Materials (SDLQM) (SDLQM)
|0 G:(DE-Juel1)SDLQM
|c SDLQM
|f Simulation and Data Laboratory Quantum Materials (SDLQM)
|x 2
700 1 _ |0 P:(DE-HGF)0
|a Berljafa, Mario
|b 1
773 _ _ |y 2013
909 C O |o oai:juser.fz-juelich.de:150535
|p VDB
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)144723
|a Forschungszentrum Jülich GmbH
|b 0
|k FZJ
913 1 _ |0 G:(DE-HGF)POF2-411
|1 G:(DE-HGF)POF2-410
|2 G:(DE-HGF)POF2-400
|a DE-HGF
|b Schlüsseltechnologien
|l Supercomputing
|v Computational Science and Mathematical Methods
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF2
914 1 _ |y 2013
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a conf
980 _ _ |a VDB
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21