000830143 001__ 830143
000830143 005__ 20210129230430.0
000830143 0247_ $$2doi$$a10.1142/S0129183117500632
000830143 0247_ $$2ISSN$$a0129-1831
000830143 0247_ $$2ISSN$$a1793-6586
000830143 0247_ $$2WOS$$aWOS:000401622900007
000830143 0247_ $$2altmetric$$aaltmetric:15200733
000830143 037__ $$aFZJ-2017-03720
000830143 082__ $$a530
000830143 1001_ $$0P:(DE-HGF)0$$aBonati, Claudio$$b0
000830143 245__ $$aDesign and optimization of a portable LQCD Monte Carlo code using OpenACC
000830143 260__ $$aSingapore [u.a.]$$bWorld Scientific$$c2017
000830143 3367_ $$2DRIVER$$aarticle
000830143 3367_ $$2DataCite$$aOutput Types/Journal article
000830143 3367_ $$0PUB:(DE-HGF)16$$2PUB:(DE-HGF)$$aJournal Article$$bjournal$$mjournal$$s1495198726_3832
000830143 3367_ $$2BibTeX$$aARTICLE
000830143 3367_ $$2ORCID$$aJOURNAL_ARTICLE
000830143 3367_ $$00$$2EndNote$$aJournal Article
000830143 520__ $$aThe present panorama of HPC architectures is extremely heterogeneous, ranging from traditional multi-core CPU processors, supporting a wide class of applications but delivering moderate computing performance, to many-core Graphics Processor Units (GPUs), exploiting aggressive data-parallelism and delivering higher performances for streaming computing applications. In this scenario, code portability (and performance portability) become necessary for easy maintainability of applications; this is very relevant in scientific computing where code changes are very frequent, making it tedious and prone to error to keep different code versions aligned. In this work, we present the design and optimization of a state-of-the-art production-level LQCD Monte Carlo application, using the directive-based OpenACC programming model. OpenACC abstracts parallel programming to a descriptive level, relieving programmers from specifying how codes should be mapped onto the target architecture. We describe the implementation of a code fully written in OpenAcc, and show that we are able to target several different architectures, including state-of-the-art traditional CPUs and GPUs, with the same code. We also measure performance, evaluating the computing efficiency of our OpenACC code on several architectures, comparing with GPU-specific implementations and showing that a good level of performance-portability can be reached.
000830143 536__ $$0G:(DE-HGF)POF3-511$$a511 - Computational Science and Mathematical Methods (POF3-511)$$cPOF3-511$$fPOF III$$x0
000830143 588__ $$aDataset connected to CrossRef
000830143 7001_ $$0P:(DE-HGF)0$$aCoscetti, Simone$$b1
000830143 7001_ $$0P:(DE-HGF)0$$aD’Elia, Massimo$$b2
000830143 7001_ $$0P:(DE-HGF)0$$aMesiti, Michele$$b3
000830143 7001_ $$0P:(DE-HGF)0$$aNegro, Francesco$$b4
000830143 7001_ $$0P:(DE-HGF)0$$aCalore, Enrico$$b5
000830143 7001_ $$0P:(DE-HGF)0$$aSchifano, Sebastiano Fabio$$b6
000830143 7001_ $$0P:(DE-Juel1)171116$$aSilvi, Giorgio$$b7$$ufzj
000830143 7001_ $$0P:(DE-HGF)0$$aTripiccione, Raffaele$$b8
000830143 773__ $$0PERI:(DE-600)2006526-7$$a10.1142/S0129183117500632$$gVol. 28, no. 05, p. 1750063 -$$n05$$p1750063 -$$tInternational journal of modern physics / C$$v28$$x1793-6586$$y2017
000830143 909CO $$ooai:juser.fz-juelich.de:830143$$pVDB
000830143 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)171116$$aForschungszentrum Jülich$$b7$$kFZJ
000830143 9131_ $$0G:(DE-HGF)POF3-511$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$3G:(DE-HGF)POF3$$4G:(DE-HGF)POF$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data$$vComputational Science and Mathematical Methods$$x0
000830143 9141_ $$y2017
000830143 915__ $$0StatID:(DE-HGF)0100$$2StatID$$aJCR$$bINT J MOD PHYS C : 2015
000830143 915__ $$0StatID:(DE-HGF)0200$$2StatID$$aDBCoverage$$bSCOPUS
000830143 915__ $$0StatID:(DE-HGF)0300$$2StatID$$aDBCoverage$$bMedline
000830143 915__ $$0StatID:(DE-HGF)0600$$2StatID$$aDBCoverage$$bEbsco Academic Search
000830143 915__ $$0StatID:(DE-HGF)0030$$2StatID$$aPeer Review$$bASC
000830143 915__ $$0StatID:(DE-HGF)0199$$2StatID$$aDBCoverage$$bThomson Reuters Master Journal List
000830143 915__ $$0StatID:(DE-HGF)0110$$2StatID$$aWoS$$bScience Citation Index
000830143 915__ $$0StatID:(DE-HGF)0150$$2StatID$$aDBCoverage$$bWeb of Science Core Collection
000830143 915__ $$0StatID:(DE-HGF)0111$$2StatID$$aWoS$$bScience Citation Index Expanded
000830143 915__ $$0StatID:(DE-HGF)1150$$2StatID$$aDBCoverage$$bCurrent Contents - Physical, Chemical and Earth Sciences
000830143 915__ $$0StatID:(DE-HGF)9900$$2StatID$$aIF < 5
000830143 920__ $$lyes
000830143 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000830143 980__ $$ajournal
000830143 980__ $$aVDB
000830143 980__ $$aI:(DE-Juel1)JSC-20090406
000830143 980__ $$aUNRESTRICTED