000173013 001__ 173013
000173013 005__ 20210129214540.0
000173013 0247_ $$2doi$$a10.1007/s11390-013-1313-9
000173013 0247_ $$2ISSN$$a1000-9000
000173013 0247_ $$2ISSN$$a1860-4749
000173013 0247_ $$2WOS$$aWOS:000314190600007
000173013 037__ $$aFZJ-2014-06427
000173013 082__ $$a004
000173013 1001_ $$0P:(DE-HGF)0$$aTeijeiro, Carlos$$b0$$eCorresponding Author
000173013 245__ $$aDesign and Implementation of an Extended Collectives Library for Unified Parallel C
000173013 260__ $$aBoston, Mass. [u.a.]$$bSpringer$$c2013
000173013 3367_ $$0PUB:(DE-HGF)16$$2PUB:(DE-HGF)$$aJournal Article$$bjournal$$mjournal$$s1417796426_21916
000173013 3367_ $$2DataCite$$aOutput Types/Journal article
000173013 3367_ $$00$$2EndNote$$aJournal Article
000173013 3367_ $$2BibTeX$$aARTICLE
000173013 3367_ $$2ORCID$$aJOURNAL_ARTICLE
000173013 3367_ $$2DRIVER$$aarticle
000173013 520__ $$aUnified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it can take advantage of the scalability of distributed memory architectures. Therefore, UPC allows programmers to write parallel applications on hybrid shared/distributed memory architectures, such as multi-core clusters, in a more productive way, accessing remote memory by means of different high-level language constructs, such as assignments to shared variables or collective primitives. However, the standard UPC collectives library includes a reduced set of eight basic primitives with quite limited functionality. This work presents the design and implementation of extended UPC collective functions that overcome the limitations of the standard collectives library, allowing, for example, the use of a specific source and destination thread or defining the amount of data transferred by each particular thread. This library fulfills the demands made by the UPC developers community and implements portable algorithms, independent of the specific UPC compiler/runtime being used. The use of a representative set of these extended collectives has been evaluated using two applications and four kernels as case studies. The results obtained confirm the suitability of the new library to provide easier programming without trading off performance, thus achieving high productivity in parallel programming to harness the performance of hybrid shared/distributed memory architectures in high performance computing.
000173013 536__ $$0G:(DE-HGF)POF2-41G21$$a41G - Supercomputer Facility (POF2-41G21)$$cPOF2-41G21$$fPOF II$$x0
000173013 588__ $$aDataset connected to CrossRef, juser.fz-juelich.de
000173013 7001_ $$0P:(DE-HGF)0$$aTaboada, Guillermo L.$$b1
000173013 7001_ $$0P:(DE-HGF)0$$aTouriño, Juan$$b2
000173013 7001_ $$0P:(DE-HGF)0$$aDoallo, Ramón$$b3
000173013 7001_ $$0P:(DE-HGF)0$$aMouriño, José C.$$b4
000173013 7001_ $$0P:(DE-Juel1)144660$$aAlvarez Mallon, Damian$$b5
000173013 7001_ $$0P:(DE-HGF)0$$aWibecan, Brian$$b6
000173013 773__ $$0PERI:(DE-600)2224868-7$$a10.1007/s11390-013-1313-9$$gVol. 28, no. 1, p. 72 - 89$$n1$$p72 - 89$$tJournal of computer science and technology$$v28$$x1860-4749$$y2013
000173013 8564_ $$uhttps://juser.fz-juelich.de/record/173013/files/FZJ-2014-06427.pdf$$yRestricted
000173013 909CO $$ooai:juser.fz-juelich.de:173013$$pVDB
000173013 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)144660$$aForschungszentrum Jülich GmbH$$b5$$kFZJ
000173013 9132_ $$0G:(DE-HGF)POF3-513$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data $$vSupercomputer Facility$$x0
000173013 9131_ $$0G:(DE-HGF)POF2-41G21$$1G:(DE-HGF)POF2-410$$2G:(DE-HGF)POF2-400$$3G:(DE-HGF)POF2$$4G:(DE-HGF)POF$$aDE-HGF$$bSchlüsseltechnologien$$lSupercomputing$$vSupercomputer Facility$$x0
000173013 9141_ $$y2014
000173013 915__ $$0StatID:(DE-HGF)0100$$2StatID$$aJCR
000173013 915__ $$0StatID:(DE-HGF)0111$$2StatID$$aWoS$$bScience Citation Index Expanded
000173013 915__ $$0StatID:(DE-HGF)0150$$2StatID$$aDBCoverage$$bWeb of Science Core Collection
000173013 915__ $$0StatID:(DE-HGF)0199$$2StatID$$aDBCoverage$$bThomson Reuters Master Journal List
000173013 915__ $$0StatID:(DE-HGF)0200$$2StatID$$aDBCoverage$$bSCOPUS
000173013 915__ $$0StatID:(DE-HGF)0300$$2StatID$$aDBCoverage$$bMedline
000173013 915__ $$0StatID:(DE-HGF)0420$$2StatID$$aNationallizenz
000173013 915__ $$0StatID:(DE-HGF)9900$$2StatID$$aIF < 5
000173013 920__ $$lyes
000173013 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000173013 980__ $$ajournal
000173013 980__ $$aVDB
000173013 980__ $$aI:(DE-Juel1)JSC-20090406
000173013 980__ $$aUNRESTRICTED