Hauptseite > Publikationsdatenbank > Design and optimization of a portable LQCD Monte Carlo code using OpenACC > print |
001 | 830143 | ||
005 | 20210129230430.0 | ||
024 | 7 | _ | |a 10.1142/S0129183117500632 |2 doi |
024 | 7 | _ | |a 0129-1831 |2 ISSN |
024 | 7 | _ | |a 1793-6586 |2 ISSN |
024 | 7 | _ | |a WOS:000401622900007 |2 WOS |
024 | 7 | _ | |a altmetric:15200733 |2 altmetric |
037 | _ | _ | |a FZJ-2017-03720 |
082 | _ | _ | |a 530 |
100 | 1 | _ | |a Bonati, Claudio |0 P:(DE-HGF)0 |b 0 |
245 | _ | _ | |a Design and optimization of a portable LQCD Monte Carlo code using OpenACC |
260 | _ | _ | |a Singapore [u.a.] |c 2017 |b World Scientific |
336 | 7 | _ | |a article |2 DRIVER |
336 | 7 | _ | |a Output Types/Journal article |2 DataCite |
336 | 7 | _ | |a Journal Article |b journal |m journal |0 PUB:(DE-HGF)16 |s 1495198726_3832 |2 PUB:(DE-HGF) |
336 | 7 | _ | |a ARTICLE |2 BibTeX |
336 | 7 | _ | |a JOURNAL_ARTICLE |2 ORCID |
336 | 7 | _ | |a Journal Article |0 0 |2 EndNote |
520 | _ | _ | |a The present panorama of HPC architectures is extremely heterogeneous, ranging from traditional multi-core CPU processors, supporting a wide class of applications but delivering moderate computing performance, to many-core Graphics Processor Units (GPUs), exploiting aggressive data-parallelism and delivering higher performances for streaming computing applications. In this scenario, code portability (and performance portability) become necessary for easy maintainability of applications; this is very relevant in scientific computing where code changes are very frequent, making it tedious and prone to error to keep different code versions aligned. In this work, we present the design and optimization of a state-of-the-art production-level LQCD Monte Carlo application, using the directive-based OpenACC programming model. OpenACC abstracts parallel programming to a descriptive level, relieving programmers from specifying how codes should be mapped onto the target architecture. We describe the implementation of a code fully written in OpenAcc, and show that we are able to target several different architectures, including state-of-the-art traditional CPUs and GPUs, with the same code. We also measure performance, evaluating the computing efficiency of our OpenACC code on several architectures, comparing with GPU-specific implementations and showing that a good level of performance-portability can be reached. |
536 | _ | _ | |a 511 - Computational Science and Mathematical Methods (POF3-511) |0 G:(DE-HGF)POF3-511 |c POF3-511 |f POF III |x 0 |
588 | _ | _ | |a Dataset connected to CrossRef |
700 | 1 | _ | |a Coscetti, Simone |0 P:(DE-HGF)0 |b 1 |
700 | 1 | _ | |a D’Elia, Massimo |0 P:(DE-HGF)0 |b 2 |
700 | 1 | _ | |a Mesiti, Michele |0 P:(DE-HGF)0 |b 3 |
700 | 1 | _ | |a Negro, Francesco |0 P:(DE-HGF)0 |b 4 |
700 | 1 | _ | |a Calore, Enrico |0 P:(DE-HGF)0 |b 5 |
700 | 1 | _ | |a Schifano, Sebastiano Fabio |0 P:(DE-HGF)0 |b 6 |
700 | 1 | _ | |a Silvi, Giorgio |0 P:(DE-Juel1)171116 |b 7 |u fzj |
700 | 1 | _ | |a Tripiccione, Raffaele |0 P:(DE-HGF)0 |b 8 |
773 | _ | _ | |a 10.1142/S0129183117500632 |g Vol. 28, no. 05, p. 1750063 - |0 PERI:(DE-600)2006526-7 |n 05 |p 1750063 - |t International journal of modern physics / C |v 28 |y 2017 |x 1793-6586 |
909 | C | O | |o oai:juser.fz-juelich.de:830143 |p VDB |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 7 |6 P:(DE-Juel1)171116 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |1 G:(DE-HGF)POF3-510 |0 G:(DE-HGF)POF3-511 |2 G:(DE-HGF)POF3-500 |v Computational Science and Mathematical Methods |x 0 |4 G:(DE-HGF)POF |3 G:(DE-HGF)POF3 |l Supercomputing & Big Data |
914 | 1 | _ | |y 2017 |
915 | _ | _ | |a JCR |0 StatID:(DE-HGF)0100 |2 StatID |b INT J MOD PHYS C : 2015 |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0200 |2 StatID |b SCOPUS |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0300 |2 StatID |b Medline |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0600 |2 StatID |b Ebsco Academic Search |
915 | _ | _ | |a Peer Review |0 StatID:(DE-HGF)0030 |2 StatID |b ASC |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0199 |2 StatID |b Thomson Reuters Master Journal List |
915 | _ | _ | |a WoS |0 StatID:(DE-HGF)0110 |2 StatID |b Science Citation Index |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0150 |2 StatID |b Web of Science Core Collection |
915 | _ | _ | |a WoS |0 StatID:(DE-HGF)0111 |2 StatID |b Science Citation Index Expanded |
915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)1150 |2 StatID |b Current Contents - Physical, Chemical and Earth Sciences |
915 | _ | _ | |a IF < 5 |0 StatID:(DE-HGF)9900 |2 StatID |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a journal |
980 | _ | _ | |a VDB |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | _ | _ | |a UNRESTRICTED |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|