| Home > Publications database > Three Dirac operators on two architectures with one piece of code and no hassle > print |
| 001 | 859687 | ||
| 005 | 20210130000351.0 | ||
| 024 | 7 | _ | |a arXiv:1808.05506 |2 arXiv |
| 024 | 7 | _ | |a 2128/21382 |2 Handle |
| 024 | 7 | _ | |a altmetric:46609682 |2 altmetric |
| 037 | _ | _ | |a FZJ-2019-00526 |
| 041 | _ | _ | |a English |
| 100 | 1 | _ | |a Durr, Stephan |0 P:(DE-Juel1)132580 |b 0 |e Corresponding author |
| 111 | 2 | _ | |a 36th Annual International Symposium on Lattice Field Theory, Lattice 2018 |g Lattice 2018 |c East Lansing |d 2018-07-22 - 2018-07-28 |w USA |
| 245 | _ | _ | |a Three Dirac operators on two architectures with one piece of code and no hassle |
| 260 | _ | _ | |a Trieste |c 2018 |b SISSA |
| 300 | _ | _ | |a 7 p. |
| 336 | 7 | _ | |a CONFERENCE_PAPER |2 ORCID |
| 336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
| 336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
| 336 | 7 | _ | |a conferenceObject |2 DRIVER |
| 336 | 7 | _ | |a Output Types/Conference Paper |2 DataCite |
| 336 | 7 | _ | |a Contribution to a conference proceedings |b contrib |m contrib |0 PUB:(DE-HGF)8 |s 1548423558_27135 |2 PUB:(DE-HGF) |
| 336 | 7 | _ | |a Contribution to a book |0 PUB:(DE-HGF)7 |2 PUB:(DE-HGF) |m contb |
| 490 | 0 | _ | |a Proceedings of Science |v LATTICE2018 |
| 520 | _ | _ | |a A simple minded approach to implement three discretizations of the Dirac operator (staggered, Wilson, Brillouin) on two architectures (KNL and core i7) is presented. The idea is to use a high-level compiler along with OpenMP parallelization and SIMD pragmas, but to stay away from cache-line optimization and/or assembly-tuning. The implementation is for N_v right-hand-sides, and this extra index is used to fill the SIMD pipeline. On one KNL node single precision performance figures for N_c=3, N_v=12 read 475 Gflop/s, 345 Gflop/s, and 790 Gflop/s for the three discretization schemes, respectively. |
| 536 | _ | _ | |a 511 - Computational Science and Mathematical Methods (POF3-511) |0 G:(DE-HGF)POF3-511 |c POF3-511 |f POF III |x 0 |
| 588 | _ | _ | |a Dataset connected to arXivarXiv |
| 773 | _ | _ | |p 033 |
| 856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf |
| 856 | 4 | _ | |y OpenAccess |x pdfa |u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf?subformat=pdfa |
| 909 | C | O | |o oai:juser.fz-juelich.de:859687 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
| 910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)132580 |
| 913 | 1 | _ | |a DE-HGF |b Key Technologies |1 G:(DE-HGF)POF3-510 |0 G:(DE-HGF)POF3-511 |2 G:(DE-HGF)POF3-500 |v Computational Science and Mathematical Methods |x 0 |4 G:(DE-HGF)POF |3 G:(DE-HGF)POF3 |l Supercomputing & Big Data |
| 914 | 1 | _ | |y 2018 |
| 915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
| 915 | _ | _ | |a Creative Commons Attribution-NonCommercial-NoDerivs CC BY-NC-ND 4.0 |0 LIC:(DE-HGF)CCBYNCND4 |2 HGFVOC |
| 920 | _ | _ | |l yes |
| 920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
| 980 | _ | _ | |a contrib |
| 980 | _ | _ | |a VDB |
| 980 | _ | _ | |a UNRESTRICTED |
| 980 | _ | _ | |a contb |
| 980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
| 980 | 1 | _ | |a FullTexts |
| Library | Collection | CLSMajor | CLSMinor | Language | Author |
|---|