000280931 001__ 280931
000280931 005__ 20210129221443.0
000280931 020__ $$a978-3-319-22996-6
000280931 020__ $$a978-3-319-22997-3 (electronic)
000280931 0247_ $$2doi$$a10.1007/978-3-319-22997-3_2
000280931 037__ $$aFZJ-2016-00642
000280931 041__ $$aEnglish
000280931 082__ $$a510
000280931 1001_ $$0P:(DE-Juel1)157750$$aBeckmann, Andreas$$b0$$eCorresponding author
000280931 1112_ $$a3rd International Workshop on Computational Engineering$$cStuttgart$$d2014-10-06 - 2014-10-10$$gCE 2014$$wGermany
000280931 245__ $$aPortable Node-Level Performance Optimization for the Fast Multipole Method
000280931 260__ $$aCham$$bSpringer International Publishing$$c2015
000280931 29510 $$aRecent Trends in Computational Engineering - CE2014
000280931 300__ $$a29 - 46
000280931 3367_ $$0PUB:(DE-HGF)8$$2PUB:(DE-HGF)$$aContribution to a conference proceedings$$bcontrib$$mcontrib$$s1453379840_2884
000280931 3367_ $$033$$2EndNote$$aConference Paper
000280931 3367_ $$2ORCID$$aCONFERENCE_PAPER
000280931 3367_ $$2DataCite$$aOutput Types/Conference Paper
000280931 3367_ $$2DRIVER$$aconferenceObject
000280931 3367_ $$2BibTeX$$aINPROCEEDINGS
000280931 4900_ $$aLecture Notes in Computational Science and Engineering$$v105
000280931 520__ $$aThis article provides an in-depth analysis and high-level C++ optimization strategies for the most time-consuming kernels of a Fast Multipole Method (FMM). The two main kernels of a Coulomb FMM are formulated to support different hardware features, such as unrolling, vectorization or threading without the need to rewrite the kernels in intrinsics or even assembly. The abstract description of the algorithm automatically allows optimal node-level peak performance on a broad class of available hardware platforms. Most of the presented optimization schemes allow a generic, hence platform-independent description for other kernels as well.
000280931 536__ $$0G:(DE-HGF)POF3-511$$a511 - Computational Science and Mathematical Methods (POF3-511)$$cPOF3-511$$fPOF III$$x0
000280931 536__ $$0G:(GEPRIS)230673686$$aGromEx - Highly Scalable Unified Long-Range Electrostatics and Flexible Ionization for Realistic Biomolecular Simulations on the Exascale (230673686)$$c230673686$$x1
000280931 588__ $$aDataset connected to CrossRef Book Series
000280931 7001_ $$0P:(DE-Juel1)132152$$aKabadshow, Ivo$$b1
000280931 773__ $$a10.1007/978-3-319-22997-3_2
000280931 909CO $$ooai:juser.fz-juelich.de:280931$$pVDB
000280931 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)157750$$aForschungszentrum Jülich GmbH$$b0$$kFZJ
000280931 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132152$$aForschungszentrum Jülich GmbH$$b1$$kFZJ
000280931 9131_ $$0G:(DE-HGF)POF3-511$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$3G:(DE-HGF)POF3$$4G:(DE-HGF)POF$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data$$vComputational Science and Mathematical Methods$$x0
000280931 9141_ $$y2015
000280931 915__ $$0StatID:(DE-HGF)0550$$2StatID$$aNo Authors Fulltext
000280931 920__ $$lyes
000280931 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000280931 980__ $$acontrib
000280931 980__ $$aVDB
000280931 980__ $$aUNRESTRICTED
000280931 980__ $$aI:(DE-Juel1)JSC-20090406