| Home > Publications database > Towards extreme scale dissipative particle dynamics simulations using multiple GPGPUs > print |
| 001 | 889100 | ||
| 005 | 20210127115252.0 | ||
| 024 | 7 | _ | |a 10.1016/j.cpc.2020.107159 |2 doi |
| 024 | 7 | _ | |a 0010-4655 |2 ISSN |
| 024 | 7 | _ | |a 1386-9485 |2 ISSN |
| 024 | 7 | _ | |a 1879-2944 |2 ISSN |
| 024 | 7 | _ | |a 2128/26662 |2 Handle |
| 024 | 7 | _ | |a altmetric:75734051 |2 altmetric |
| 024 | 7 | _ | |a WOS:000528002400017 |2 WOS |
| 037 | _ | _ | |a FZJ-2021-00030 |
| 082 | _ | _ | |a 530 |
| 100 | 1 | _ | |a Castagna, Jony |0 P:(DE-HGF)0 |b 0 |
| 245 | _ | _ | |a Towards extreme scale dissipative particle dynamics simulations using multiple GPGPUs |
| 260 | _ | _ | |a Amsterdam |c 2020 |b North Holland Publ. Co. |
| 336 | 7 | _ | |a article |2 DRIVER |
| 336 | 7 | _ | |a Output Types/Journal article |2 DataCite |
| 336 | 7 | _ | |a Journal Article |b journal |m journal |0 PUB:(DE-HGF)16 |s 1609857861_18366 |2 PUB:(DE-HGF) |
| 336 | 7 | _ | |a ARTICLE |2 BibTeX |
| 336 | 7 | _ | |a JOURNAL_ARTICLE |2 ORCID |
| 336 | 7 | _ | |a Journal Article |0 0 |2 EndNote |
| 520 | _ | _ | |a A multi-GPGPU development for Mesoscale Simulations using the Dissipative Particle Dynamics method is presented. This distributed GPU acceleration development is an extension of the DL_MESO package to MPI+CUDA in order to exploit the computational power of the latest NVIDIA cards on hybrid CPU–GPU architectures. Details about the extensively applicable algorithm implementation and memory coalescing data structures are presented. The key algorithms’ optimizations for the nearest-neighbour list searching of particle pairs for short range forces, exchange of data and overlapping between computation and communications are also given. We have carried out strong and weak scaling performance analyses with up to 4096 GPUs. A two phase mixture separation test case with 1.8 billion particles has been run on the Piz Daint supercomputer from the Swiss National Supercomputer Center. With CUDA aware MPI, proper GPU affinity, communication and computation overlap optimizations for multi-GPU version, the final optimization results demonstrated more than 94% efficiency for weak scaling and more than 80% efficiency for strong scaling. As far as we know, this is the first report in the literature of DPD simulations being run on this large number of GPUs. The remaining challenges and future work are also discussed at the end of the paper. |
| 536 | _ | _ | |a 511 - Computational Science and Mathematical Methods (POF3-511) |0 G:(DE-HGF)POF3-511 |c POF3-511 |f POF III |x 0 |
| 536 | _ | _ | |a E-CAM - An e-infrastructure for software, training and consultancy in simulation and modelling (676531) |0 G:(EU-Grant)676531 |c 676531 |f H2020-EINFRA-2015-1 |x 1 |
| 536 | _ | _ | |a PRACE CoE Allocation E-CAM (prcoe02_20181001) |0 G:(DE-Juel1)prcoe02_20181001 |c prcoe02_20181001 |f PRACE CoE Allocation E-CAM |x 2 |
| 588 | _ | _ | |a Dataset connected to CrossRef |
| 700 | 1 | _ | |a Guo, Xiaohu |0 0000-0003-3545-3432 |b 1 |e Corresponding author |
| 700 | 1 | _ | |a Seaton, Michael |0 0000-0002-4708-573X |b 2 |
| 700 | 1 | _ | |a O’Cais, Alan |0 P:(DE-Juel1)143791 |b 3 |
| 773 | _ | _ | |a 10.1016/j.cpc.2020.107159 |g Vol. 251, p. 107159 - |0 PERI:(DE-600)1466511-6 |p 107159 - |t Computer physics communications |v 251 |y 2020 |x 0010-4655 |
| 856 | 4 | _ | |u https://juser.fz-juelich.de/record/889100/files/1-s2.0-S0010465520300199-main.pdf |y OpenAccess |
| 909 | C | O | |o oai:juser.fz-juelich.de:889100 |p openaire |p open_access |p driver |p VDB |p ec_fundedresources |p dnbdelivery |
| 910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 3 |6 P:(DE-Juel1)143791 |
| 913 | 1 | _ | |a DE-HGF |b Key Technologies |l Supercomputing & Big Data |1 G:(DE-HGF)POF3-510 |0 G:(DE-HGF)POF3-511 |3 G:(DE-HGF)POF3 |2 G:(DE-HGF)POF3-500 |4 G:(DE-HGF)POF |v Computational Science and Mathematical Methods |x 0 |
| 914 | 1 | _ | |y 2020 |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0200 |2 StatID |b SCOPUS |d 2020-09-06 |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0160 |2 StatID |b Essential Science Indicators |d 2020-09-06 |
| 915 | _ | _ | |a Creative Commons Attribution CC BY 4.0 |0 LIC:(DE-HGF)CCBY4 |2 HGFVOC |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0600 |2 StatID |b Ebsco Academic Search |d 2020-09-06 |
| 915 | _ | _ | |a JCR |0 StatID:(DE-HGF)0100 |2 StatID |b COMPUT PHYS COMMUN : 2018 |d 2020-09-06 |
| 915 | _ | _ | |a WoS |0 StatID:(DE-HGF)0113 |2 StatID |b Science Citation Index Expanded |d 2020-09-06 |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0150 |2 StatID |b Web of Science Core Collection |d 2020-09-06 |
| 915 | _ | _ | |a IF < 5 |0 StatID:(DE-HGF)9900 |2 StatID |d 2020-09-06 |
| 915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
| 915 | _ | _ | |a Peer Review |0 StatID:(DE-HGF)0030 |2 StatID |b ASC |d 2020-09-06 |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)1150 |2 StatID |b Current Contents - Physical, Chemical and Earth Sciences |d 2020-09-06 |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0300 |2 StatID |b Medline |d 2020-09-06 |
| 915 | _ | _ | |a Nationallizenz |0 StatID:(DE-HGF)0420 |2 StatID |d 2020-09-06 |w ger |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0199 |2 StatID |b Clarivate Analytics Master Journal List |d 2020-09-06 |
| 920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
| 980 | 1 | _ | |a FullTexts |
| 980 | _ | _ | |a journal |
| 980 | _ | _ | |a VDB |
| 980 | _ | _ | |a UNRESTRICTED |
| 980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
| Library | Collection | CLSMajor | CLSMinor | Language | Author |
|---|