000128163 001__ 128163
000128163 005__ 20250314084107.0
000128163 0247_ $$2doi$$a10.1016/j.parco.2012.09.002
000128163 0247_ $$2ISSN$$a0167-8191
000128163 0247_ $$2ISSN$$a1872-7336
000128163 0247_ $$2WOS$$aWOS:000317371900004
000128163 037__ $$aFZJ-2012-01058
000128163 082__ $$a004
000128163 1001_ $$0P:(DE-HGF)0$$aHermanns, Marc-André$$b0$$eCorresponding author
000128163 245__ $$aA scalable infrastructure for the performance analysis of passive target synchronization
000128163 260__ $$aAmsterdam [u.a.]$$bNorth-Holland, Elsevier Science$$c2013
000128163 3367_ $$0PUB:(DE-HGF)16$$2PUB:(DE-HGF)$$aJournal Article$$bjournal$$mjournal$$s1366635474_9685
000128163 3367_ $$2DataCite$$aOutput Types/Journal article
000128163 3367_ $$00$$2EndNote$$aJournal Article
000128163 3367_ $$2BibTeX$$aARTICLE
000128163 3367_ $$2ORCID$$aJOURNAL_ARTICLE
000128163 3367_ $$2DRIVER$$aarticle
000128163 520__ $$aPartitioned global address space (PGAS) languages combine the convenient abstraction of shared memory with the notion of affinity, extending multi-threaded programming to large-scale systems with physically distributed memory. However, in spite of their obvious advantages, PGAS languages still lack appropriate tool support for performance analysis, one of the reasons why their adoption is still in its infancy. Some of the performance problems for which tool support is needed occur at the level of the underlying one-sided communication substrate, such as the Aggregate Remote Memory Copy Interface (ARMCI). One such example is the waiting time in situations where asynchronous data transfers cannot be completed without software intervention at the target side. This is not uncommon on systems with reduced operating-system kernels such as IBM Blue Gene/P where the use of progress threads would double the number of cores necessary to run an application. In this paper, we present an extension of the Scalasca trace-analysis infrastructure aimed at the identification and quantification of progress-related waiting times at larger scales. We demonstrate its utility and scalability using a benchmark running with up to 32,768 processes.
000128163 536__ $$0G:(DE-HGF)POF2-411$$a411 - Computational Science and Mathematical Methods (POF2-411)$$cPOF2-411$$fPOF II$$x0
000128163 536__ $$0G:(DE-Juel-1)ATMLPP$$aATMLPP - ATML Parallel Performance (ATMLPP)$$cATMLPP$$x1
000128163 588__ $$aDataset connected to CrossRef, juser.fz-juelich.de
000128163 7001_ $$0P:(DE-HGF)0$$aKrishnamoorthy, Sriram$$b1
000128163 7001_ $$0P:(DE-Juel1)132299$$aWolf, Felix$$b2
000128163 773__ $$0PERI:(DE-600)1466340-5$$a10.1016/j.parco.2012.09.002$$n3$$p132-145$$tParallel computing$$v39
000128163 8564_ $$uhttps://juser.fz-juelich.de/record/128163/files/FZJ-2012-01058.pdf$$yRestricted
000128163 909CO $$ooai:juser.fz-juelich.de:128163$$pVDB
000128163 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132299$$aForschungszentrum Jülich GmbH$$b2$$kFZJ
000128163 9132_ $$0G:(DE-HGF)POF3-511$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data $$vComputational Science and Mathematical Methods$$x0
000128163 9131_ $$0G:(DE-HGF)POF2-411$$1G:(DE-HGF)POF2-410$$2G:(DE-HGF)POF2-400$$3G:(DE-HGF)POF2$$4G:(DE-HGF)POF$$aDE-HGF$$bSchlüsseltechnologien$$lSupercomputing$$vComputational Science and Mathematical Methods$$x0
000128163 9141_ $$y2013
000128163 915__ $$0StatID:(DE-HGF)0010$$2StatID$$aJCR/ISI refereed
000128163 915__ $$0StatID:(DE-HGF)0100$$2StatID$$aJCR
000128163 915__ $$0StatID:(DE-HGF)0111$$2StatID$$aWoS$$bScience Citation Index Expanded
000128163 915__ $$0StatID:(DE-HGF)0150$$2StatID$$aDBCoverage$$bWeb of Science Core Collection
000128163 915__ $$0StatID:(DE-HGF)0199$$2StatID$$aDBCoverage$$bThomson Reuters Master Journal List
000128163 915__ $$0StatID:(DE-HGF)0200$$2StatID$$aDBCoverage$$bSCOPUS
000128163 915__ $$0StatID:(DE-HGF)0300$$2StatID$$aDBCoverage$$bMedline
000128163 915__ $$0StatID:(DE-HGF)1050$$2StatID$$aDBCoverage$$bBIOSIS Previews
000128163 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000128163 980__ $$ajournal
000128163 980__ $$aVDB
000128163 980__ $$aUNRESTRICTED
000128163 980__ $$aI:(DE-Juel1)JSC-20090406