001     830143
005     20210129230430.0
024 7 _ |a 10.1142/S0129183117500632
|2 doi
024 7 _ |a 0129-1831
|2 ISSN
024 7 _ |a 1793-6586
|2 ISSN
024 7 _ |a WOS:000401622900007
|2 WOS
024 7 _ |a altmetric:15200733
|2 altmetric
037 _ _ |a FZJ-2017-03720
082 _ _ |a 530
100 1 _ |a Bonati, Claudio
|0 P:(DE-HGF)0
|b 0
245 _ _ |a Design and optimization of a portable LQCD Monte Carlo code using OpenACC
260 _ _ |a Singapore [u.a.]
|c 2017
|b World Scientific
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1495198726_3832
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
520 _ _ |a The present panorama of HPC architectures is extremely heterogeneous, ranging from traditional multi-core CPU processors, supporting a wide class of applications but delivering moderate computing performance, to many-core Graphics Processor Units (GPUs), exploiting aggressive data-parallelism and delivering higher performances for streaming computing applications. In this scenario, code portability (and performance portability) become necessary for easy maintainability of applications; this is very relevant in scientific computing where code changes are very frequent, making it tedious and prone to error to keep different code versions aligned. In this work, we present the design and optimization of a state-of-the-art production-level LQCD Monte Carlo application, using the directive-based OpenACC programming model. OpenACC abstracts parallel programming to a descriptive level, relieving programmers from specifying how codes should be mapped onto the target architecture. We describe the implementation of a code fully written in OpenAcc, and show that we are able to target several different architectures, including state-of-the-art traditional CPUs and GPUs, with the same code. We also measure performance, evaluating the computing efficiency of our OpenACC code on several architectures, comparing with GPU-specific implementations and showing that a good level of performance-portability can be reached.
536 _ _ |a 511 - Computational Science and Mathematical Methods (POF3-511)
|0 G:(DE-HGF)POF3-511
|c POF3-511
|f POF III
|x 0
588 _ _ |a Dataset connected to CrossRef
700 1 _ |a Coscetti, Simone
|0 P:(DE-HGF)0
|b 1
700 1 _ |a D’Elia, Massimo
|0 P:(DE-HGF)0
|b 2
700 1 _ |a Mesiti, Michele
|0 P:(DE-HGF)0
|b 3
700 1 _ |a Negro, Francesco
|0 P:(DE-HGF)0
|b 4
700 1 _ |a Calore, Enrico
|0 P:(DE-HGF)0
|b 5
700 1 _ |a Schifano, Sebastiano Fabio
|0 P:(DE-HGF)0
|b 6
700 1 _ |a Silvi, Giorgio
|0 P:(DE-Juel1)171116
|b 7
|u fzj
700 1 _ |a Tripiccione, Raffaele
|0 P:(DE-HGF)0
|b 8
773 _ _ |a 10.1142/S0129183117500632
|g Vol. 28, no. 05, p. 1750063 -
|0 PERI:(DE-600)2006526-7
|n 05
|p 1750063 -
|t International journal of modern physics / C
|v 28
|y 2017
|x 1793-6586
909 C O |o oai:juser.fz-juelich.de:830143
|p VDB
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 7
|6 P:(DE-Juel1)171116
913 1 _ |a DE-HGF
|b Key Technologies
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-511
|2 G:(DE-HGF)POF3-500
|v Computational Science and Mathematical Methods
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF3
|l Supercomputing & Big Data
914 1 _ |y 2017
915 _ _ |a JCR
|0 StatID:(DE-HGF)0100
|2 StatID
|b INT J MOD PHYS C : 2015
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0200
|2 StatID
|b SCOPUS
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0600
|2 StatID
|b Ebsco Academic Search
915 _ _ |a Peer Review
|0 StatID:(DE-HGF)0030
|2 StatID
|b ASC
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0199
|2 StatID
|b Thomson Reuters Master Journal List
915 _ _ |a WoS
|0 StatID:(DE-HGF)0110
|2 StatID
|b Science Citation Index
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0150
|2 StatID
|b Web of Science Core Collection
915 _ _ |a WoS
|0 StatID:(DE-HGF)0111
|2 StatID
|b Science Citation Index Expanded
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1150
|2 StatID
|b Current Contents - Physical, Chemical and Earth Sciences
915 _ _ |a IF < 5
|0 StatID:(DE-HGF)9900
|2 StatID
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a journal
980 _ _ |a VDB
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21