Contribution to a conference proceedings FZJ-2015-03205

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Polynomial Preconditioning for the Conjugate Gradient Method on Massively Parallel Systems

 ;

1995
Institut für Informatik Clausthal-Zellerfeld

Proceedings des 13. Workshops über Parallelverarbeitung
13. Workshop über Parallelverarbeitung, LessachLessach, Austria, 25 Sep 1994 - 1 Oct 19941994-09-251994-10-01
Clausthal-Zellerfeld : Institut für Informatik, Informatik-Bericht 95/1, 150-167 ()

Please use a persistent id in citations:

Abstract: A frequently used iterative algorithm for solving large, sparse, symmetric and positiv definite systems of linear equations is the method of conjugate gradients (CG).This method requires one matrix-vector product and some dot products in each iteration. Convergence is dependent on the condition number of the coefficient matrix. So preconditioning techniques are used to reduce the number of iterations.In this context, polynomial preconditioning was developed. This method decreases the total number of dot products by reducing the total number of iterations. Of course, some additional work has to be done for the preconditioning. When a polynomial of degree k is used, k matrix-vector products per iteration have to be calculated rather than one. On scalar machines, this shift between matrix-vector products and dot products influences the performance of the algorithm only slightly. On massively parallel systems, dot products require global synchronization, while the calculation of matrix-vector products merely results in communication with a small number of processors. Hence, polynomial preconditioned CG seems to scale better than CG without preconditioning. Of course, this is not the case in general. The performance of this preconditioner depends on several issues, e.g., the sparsity pattern and the eigenvalue distribution of the matrix, an efficient communication scheme for the matrix-vector products and the time needed for global synchronization of the specific parallel machine. The actual implementation used here is based on Chebyshev polynomials. Performance tests were carried out on the Intel Paragon XP/S 10 with 140 nodes at the Research Centre Jülich (KFA). The CG method with polynomial preconditioning shows better performance and scalability than the basic method on massively parallel machines. Additionally there are some numerical advantages like a higher accuracy and an increased stability.

Classification:

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
  2. Zentralinstitut für Angewandte Mathematik (ZAM)
Research Program(s):
  1. 899 - ohne Topic (POF2-899) (POF2-899)

Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2015-05-19, last modified 2021-01-29


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)