001     859687
005     20210130000351.0
024 7 _ |a arXiv:1808.05506
|2 arXiv
024 7 _ |a 2128/21382
|2 Handle
024 7 _ |a altmetric:46609682
|2 altmetric
037 _ _ |a FZJ-2019-00526
041 _ _ |a English
100 1 _ |a Durr, Stephan
|0 P:(DE-Juel1)132580
|b 0
|e Corresponding author
111 2 _ |a 36th Annual International Symposium on Lattice Field Theory, Lattice 2018
|g Lattice 2018
|c East Lansing
|d 2018-07-22 - 2018-07-28
|w USA
245 _ _ |a Three Dirac operators on two architectures with one piece of code and no hassle
260 _ _ |a Trieste
|c 2018
|b SISSA
300 _ _ |a 7 p.
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a INPROCEEDINGS
|2 BibTeX
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1548423558_27135
|2 PUB:(DE-HGF)
336 7 _ |a Contribution to a book
|0 PUB:(DE-HGF)7
|2 PUB:(DE-HGF)
|m contb
490 0 _ |a Proceedings of Science
|v LATTICE2018
520 _ _ |a A simple minded approach to implement three discretizations of the Dirac operator (staggered, Wilson, Brillouin) on two architectures (KNL and core i7) is presented. The idea is to use a high-level compiler along with OpenMP parallelization and SIMD pragmas, but to stay away from cache-line optimization and/or assembly-tuning. The implementation is for N_v right-hand-sides, and this extra index is used to fill the SIMD pipeline. On one KNL node single precision performance figures for N_c=3, N_v=12 read 475 Gflop/s, 345 Gflop/s, and 790 Gflop/s for the three discretization schemes, respectively.
536 _ _ |a 511 - Computational Science and Mathematical Methods (POF3-511)
|0 G:(DE-HGF)POF3-511
|c POF3-511
|f POF III
|x 0
588 _ _ |a Dataset connected to arXivarXiv
773 _ _ |p 033
856 4 _ |y OpenAccess
|u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf
856 4 _ |y OpenAccess
|x pdfa
|u https://juser.fz-juelich.de/record/859687/files/1808.05506.pdf?subformat=pdfa
909 C O |o oai:juser.fz-juelich.de:859687
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)132580
913 1 _ |a DE-HGF
|b Key Technologies
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-511
|2 G:(DE-HGF)POF3-500
|v Computational Science and Mathematical Methods
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF3
|l Supercomputing & Big Data
914 1 _ |y 2018
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
915 _ _ |a Creative Commons Attribution-NonCommercial-NoDerivs CC BY-NC-ND 4.0
|0 LIC:(DE-HGF)CCBYNCND4
|2 HGFVOC
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a contb
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21