001     889100
005     20210127115252.0
024 7 _ |a 10.1016/j.cpc.2020.107159
|2 doi
024 7 _ |a 0010-4655
|2 ISSN
024 7 _ |a 1386-9485
|2 ISSN
024 7 _ |a 1879-2944
|2 ISSN
024 7 _ |a 2128/26662
|2 Handle
024 7 _ |a altmetric:75734051
|2 altmetric
024 7 _ |a WOS:000528002400017
|2 WOS
037 _ _ |a FZJ-2021-00030
082 _ _ |a 530
100 1 _ |a Castagna, Jony
|0 P:(DE-HGF)0
|b 0
245 _ _ |a Towards extreme scale dissipative particle dynamics simulations using multiple GPGPUs
260 _ _ |a Amsterdam
|c 2020
|b North Holland Publ. Co.
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1609857861_18366
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
520 _ _ |a A multi-GPGPU development for Mesoscale Simulations using the Dissipative Particle Dynamics method is presented. This distributed GPU acceleration development is an extension of the DL_MESO package to MPI+CUDA in order to exploit the computational power of the latest NVIDIA cards on hybrid CPU–GPU architectures. Details about the extensively applicable algorithm implementation and memory coalescing data structures are presented. The key algorithms’ optimizations for the nearest-neighbour list searching of particle pairs for short range forces, exchange of data and overlapping between computation and communications are also given. We have carried out strong and weak scaling performance analyses with up to 4096 GPUs. A two phase mixture separation test case with 1.8 billion particles has been run on the Piz Daint supercomputer from the Swiss National Supercomputer Center. With CUDA aware MPI, proper GPU affinity, communication and computation overlap optimizations for multi-GPU version, the final optimization results demonstrated more than 94% efficiency for weak scaling and more than 80% efficiency for strong scaling. As far as we know, this is the first report in the literature of DPD simulations being run on this large number of GPUs. The remaining challenges and future work are also discussed at the end of the paper.
536 _ _ |a 511 - Computational Science and Mathematical Methods (POF3-511)
|0 G:(DE-HGF)POF3-511
|c POF3-511
|f POF III
|x 0
536 _ _ |a E-CAM - An e-infrastructure for software, training and consultancy in simulation and modelling (676531)
|0 G:(EU-Grant)676531
|c 676531
|f H2020-EINFRA-2015-1
|x 1
536 _ _ |a PRACE CoE Allocation E-CAM (prcoe02_20181001)
|0 G:(DE-Juel1)prcoe02_20181001
|c prcoe02_20181001
|f PRACE CoE Allocation E-CAM
|x 2
588 _ _ |a Dataset connected to CrossRef
700 1 _ |a Guo, Xiaohu
|0 0000-0003-3545-3432
|b 1
|e Corresponding author
700 1 _ |a Seaton, Michael
|0 0000-0002-4708-573X
|b 2
700 1 _ |a O’Cais, Alan
|0 P:(DE-Juel1)143791
|b 3
773 _ _ |a 10.1016/j.cpc.2020.107159
|g Vol. 251, p. 107159 -
|0 PERI:(DE-600)1466511-6
|p 107159 -
|t Computer physics communications
|v 251
|y 2020
|x 0010-4655
856 4 _ |u https://juser.fz-juelich.de/record/889100/files/1-s2.0-S0010465520300199-main.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:889100
|p openaire
|p open_access
|p driver
|p VDB
|p ec_fundedresources
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)143791
913 1 _ |a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-511
|3 G:(DE-HGF)POF3
|2 G:(DE-HGF)POF3-500
|4 G:(DE-HGF)POF
|v Computational Science and Mathematical Methods
|x 0
914 1 _ |y 2020
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0200
|2 StatID
|b SCOPUS
|d 2020-09-06
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0160
|2 StatID
|b Essential Science Indicators
|d 2020-09-06
915 _ _ |a Creative Commons Attribution CC BY 4.0
|0 LIC:(DE-HGF)CCBY4
|2 HGFVOC
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0600
|2 StatID
|b Ebsco Academic Search
|d 2020-09-06
915 _ _ |a JCR
|0 StatID:(DE-HGF)0100
|2 StatID
|b COMPUT PHYS COMMUN : 2018
|d 2020-09-06
915 _ _ |a WoS
|0 StatID:(DE-HGF)0113
|2 StatID
|b Science Citation Index Expanded
|d 2020-09-06
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0150
|2 StatID
|b Web of Science Core Collection
|d 2020-09-06
915 _ _ |a IF < 5
|0 StatID:(DE-HGF)9900
|2 StatID
|d 2020-09-06
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
915 _ _ |a Peer Review
|0 StatID:(DE-HGF)0030
|2 StatID
|b ASC
|d 2020-09-06
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1150
|2 StatID
|b Current Contents - Physical, Chemical and Earth Sciences
|d 2020-09-06
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
|d 2020-09-06
915 _ _ |a Nationallizenz
|0 StatID:(DE-HGF)0420
|2 StatID
|d 2020-09-06
|w ger
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0199
|2 StatID
|b Clarivate Analytics Master Journal List
|d 2020-09-06
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 1 _ |a FullTexts
980 _ _ |a journal
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21