Contribution to a conference proceedings/Contribution to a book FZJ-2019-00526

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Three Dirac operators on two architectures with one piece of code and no hassle



2018
SISSA Trieste

36th Annual International Symposium on Lattice Field Theory, Lattice 2018, Lattice 2018, East LansingEast Lansing, USA, 22 Jul 2018 - 28 Jul 20182018-07-222018-07-28 Trieste : SISSA, Proceedings of Science LATTICE2018, 7 p. ()

This record in other databases:

Please use a persistent id in citations:

Abstract: A simple minded approach to implement three discretizations of the Dirac operator (staggered, Wilson, Brillouin) on two architectures (KNL and core i7) is presented. The idea is to use a high-level compiler along with OpenMP parallelization and SIMD pragmas, but to stay away from cache-line optimization and/or assembly-tuning. The implementation is for N_v right-hand-sides, and this extra index is used to fill the SIMD pipeline. On one KNL node single precision performance figures for N_c=3, N_v=12 read 475 Gflop/s, 345 Gflop/s, and 790 Gflop/s for the three discretization schemes, respectively.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 511 - Computational Science and Mathematical Methods (POF3-511) (POF3-511)

Appears in the scientific report 2018
Database coverage:
Creative Commons Attribution-NonCommercial-NoDerivs CC BY-NC-ND 4.0 ; OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2019-01-18, last modified 2021-01-30