Contribution to a conference proceedings/Contribution to a book FZJ-2019-01120

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
BLAS-3 for the quadrics parallel computer

 ;  ;  ;  ;

1997
Springer Berlin Heidelberg Berlin, Heidelberg
ISBN: 978-3-540-62898-9 (print), 978-3-540-69041-2 (electronic)

High-Performance Computing and Networking / Hertzberger, Bob (Editor) ; Berlin, Heidelberg : Springer Berlin Heidelberg, 1997, Chapter 32 ; ISSN: 0302-9743=1611-3349 ; ISBN: 978-3-540-62898-9=978-3-540-69041-2 ; doi:10.1007/BFb0031573
International Conference on High-Performance Computing and Networking, ViennaVienna, Austria, 28 Apr 1997 - 30 Apr 19971997-04-281997-04-30
Berlin, Heidelberg : Springer Berlin Heidelberg, Lecture Notes in Computer Science 1225, 332 - 341 () [10.1007/BFb0031605]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: A scalable parallel algorithm for matrix multiplication on SISAMD computers is presented. Our method enables us to implement an efficient BLAS library on the Italian APE100/Quadrics SISAMD massively parallel computer on which hitherto scalable parallel BLAS-3 were not available. The approach proposed is based on a one-dimensional ring connectivity. The flow of data is hyper-systolic. The communication overhead is competitive with that of established algorithms for SIMD and MIMD machines. Advantages are that (i) the layout of the matrices is preserved during the computation, (ii) BLAS-2 fit well into this layout and (iii) indexed addressing is avoided, which renders the algorithm suitable for SISAMD machines and, in this way, for all other types of parallel computers. On the APE100/Quadrics, a performance of nearly 25 % of the peak performance for multiplications of complex matrices is achieved.



Database coverage:
NationallizenzNationallizenz ; SCOPUS
Click to display QR Code for this record

The record appears in these collections:
Externe Publikationen > Vita Publikationen
Institutssammlungen > JSC
NIC

 Datensatz erzeugt am 2019-01-31, letzte Änderung am 2020-09-14



Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)