Hauptseite > Publikationsdatenbank > Three Dirac operators on two architectures with one piece of code and no hassle > print |
001 | 859687 | ||
005 | 20210130000351.0 | ||
024 | 7 | _ | |a arXiv:1808.05506 |2 arXiv |
024 | 7 | _ | |a 2128/21382 |2 Handle |
024 | 7 | _ | |a altmetric:46609682 |2 altmetric |
037 | _ | _ | |a FZJ-2019-00526 |
041 | _ | _ | |a English |
100 | 1 | _ | |a Durr, Stephan |0 P:(DE-Juel1)132580 |b 0 |e Corresponding author |
111 | 2 | _ | |a 36th Annual International Symposium on Lattice Field Theory, Lattice 2018 |g Lattice 2018 |c East Lansing |d 2018-07-22 - 2018-07-28 |w USA |
245 | _ | _ | |a Three Dirac operators on two architectures with one piece of code and no hassle |
260 | _ | _ | |a Trieste |c 2018 |b SISSA |
300 | _ | _ | |a 7 p. |
336 | 7 | _ | |a CONFERENCE_PAPER |2 ORCID |
336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
336 | 7 | _ | |a conferenceObject |2 DRIVER |
336 | 7 | _ | |a Output Types/Conference Paper |2 DataCite |
336 | 7 | _ | |a Contribution to a conference proceedings |b contrib |m contrib |0 PUB:(DE-HGF)8 |s 1548423558_27135 |2 PUB:(DE-HGF) |
336 | 7 | _ | |a Contribution to a book |0 PUB:(DE-HGF)7 |2 PUB:(DE-HGF) |m contb |
490 | 0 | _ | |a Proceedings of Science |v LATTICE2018 |
520 | _ | _ | |a A simple minded approach to implement three discretizations of the Dirac operator (staggered, Wilson, Brillouin) on two architectures (KNL and core i7) is presented. The idea is to use a high-level compiler along with OpenMP parallelization and SIMD pragmas, but to stay away from cache-line optimization and/or assembly-tuning. The implementation is for N_v right-hand-sides, and this extra index is used to fill the SIMD pipeline. On one KNL node single precision performance figures for N_c=3, N_v=12 read 475 Gflop/s, 345 Gflop/s, and 790 Gflop/s for the three discretization schemes, respectively. |
536 | _ | _ | |a 511 - Computational Science and Mathematical Methods (POF3-511) |0 G:(DE-HGF)POF3-511 |c POF3-511 |f POF III |x 0 |
588 | _ | _ | |a Dataset connected to arXivarXiv |
773 | _ | _ | |p 033 |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf |
856 | 4 | _ | |y OpenAccess |x pdfa |u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf?subformat=pdfa |
909 | C | O | |o oai:juser.fz-juelich.de:859687 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)132580 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |1 G:(DE-HGF)POF3-510 |0 G:(DE-HGF)POF3-511 |2 G:(DE-HGF)POF3-500 |v Computational Science and Mathematical Methods |x 0 |4 G:(DE-HGF)POF |3 G:(DE-HGF)POF3 |l Supercomputing & Big Data |
914 | 1 | _ | |y 2018 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
915 | _ | _ | |a Creative Commons Attribution-NonCommercial-NoDerivs CC BY-NC-ND 4.0 |0 LIC:(DE-HGF)CCBYNCND4 |2 HGFVOC |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a contrib |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a contb |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | 1 | _ | |a FullTexts |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|