Contribution to a conference proceedings FZJ-2016-00642

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Portable Node-Level Performance Optimization for the Fast Multipole Method

 ;

2015
Springer International Publishing Cham
ISBN: 978-3-319-22996-6, 978-3-319-22997-3 (electronic)

Recent Trends in Computational Engineering - CE2014
3rd International Workshop on Computational Engineering, CE 2014, StuttgartStuttgart, Germany, 6 Oct 2014 - 10 Oct 20142014-10-062014-10-10
Cham : Springer International Publishing, Lecture Notes in Computational Science and Engineering 105, 29 - 46 () [10.1007/978-3-319-22997-3_2]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: This article provides an in-depth analysis and high-level C++ optimization strategies for the most time-consuming kernels of a Fast Multipole Method (FMM). The two main kernels of a Coulomb FMM are formulated to support different hardware features, such as unrolling, vectorization or threading without the need to rewrite the kernels in intrinsics or even assembly. The abstract description of the algorithm automatically allows optimal node-level peak performance on a broad class of available hardware platforms. Most of the presented optimization schemes allow a generic, hence platform-independent description for other kernels as well.

Classification:

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 511 - Computational Science and Mathematical Methods (POF3-511) (POF3-511)
  2. GromEx - Highly Scalable Unified Long-Range Electrostatics and Flexible Ionization for Realistic Biomolecular Simulations on the Exascale (230673686) (230673686)

Appears in the scientific report 2015
Database coverage:
No Authors Fulltext
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2016-01-18, last modified 2021-01-29



Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)