000136584 001__ 136584
000136584 005__ 20250314084108.0
000136584 0247_ $$2doi$$a10.1145/2484762.2484777
000136584 037__ $$aFZJ-2013-03370
000136584 1001_ $$0P:(DE-Juel1)132302$$aWylie, Brian J. N.$$b0$$eCorresponding author$$ufzj
000136584 1112_ $$0Conf-20130722_San_Diego_California$$aConference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery$$cSan Diego, California$$d2013-07-22 - 2013-07-25$$gXSEDE'13$$pProceedings of the Conference on Extreme Science and Engineering Discovery Environment Gateway to Discovery - XSEDE '13$$wUSA
000136584 245__ $$aScalasca support for MPI+OpenMP parallel applications on large-scale HPC systems based on Intel Xeon Phi
000136584 260__ $$aNew York, New York, USA$$bACM Press$$c2013
000136584 29510 $$aXSEDE '13 Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
000136584 300__ $$a8
000136584 3367_ $$0PUB:(DE-HGF)8$$2PUB:(DE-HGF)$$aContribution to a conference proceedings$$bcontrib$$mcontrib$$s1375713225_28269
000136584 3367_ $$033$$2EndNote$$aConference Paper
000136584 3367_ $$2ORCID$$aCONFERENCE_PAPER
000136584 3367_ $$2DataCite$$aOutput Types/Conference Paper
000136584 3367_ $$2DRIVER$$aconferenceObject
000136584 3367_ $$2BibTeX$$aINPROCEEDINGS
000136584 520__ $$aIntel Xeon Phi coprocessors based on the Many Integrated Core (MIC) architecture are starting to appear in HPC systems, with Stampede being a prominent example available within the XSEDE cyber-infrastructure. Porting MPI and OpenMP applications to such systems is often no more than simple recompilation, however, execution performance needs to be carefully analyzed and tuned to effectively exploit their unique capabilities. For performance measurement and analysis tools, the variety of execution modes need to be supported in a consistent and convenient manner, and especially execution configurations involving large numbers of compute nodes each with several multicore host processors and many-core coprocessors. Early experience using the open-source Scalasca toolset for runtime summarization and automatic trace analysis with the NPB BT-MZ MPI+OpenMP parallel application on Stampede is reported, along with discussion of on-going and future work.
000136584 536__ $$0G:(DE-HGF)POF2-411$$a411 - Computational Science and Mathematical Methods (POF2-411)$$cPOF2-411$$fPOF II$$x0
000136584 536__ $$0G:(EU-Grant)287530$$aDEEP - Dynamical Exascale Entry Platform (287530)$$c287530$$fFP7-ICT-2011-7$$x1
000136584 536__ $$0G:(DE-Juel-1)ATMLPP$$aATMLPP - ATML Parallel Performance (ATMLPP)$$cATMLPP$$x2
000136584 588__ $$aDataset connected to CrossRef Conference
000136584 7001_ $$0P:(DE-Juel1)132108$$aFrings, Wolfgang$$b1$$ufzj
000136584 773__ $$a10.1145/2484762.2484777
000136584 8564_ $$uhttps://juser.fz-juelich.de/record/136584/files/FZJ-2013-03370.pdf$$yRestricted
000136584 909CO $$ooai:juser.fz-juelich.de:136584$$pec_fundedresources$$pVDB$$popenaire
000136584 9141_ $$y2013
000136584 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132302$$aForschungszentrum Jülich GmbH$$b0$$kFZJ
000136584 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)132108$$aForschungszentrum Jülich GmbH$$b1$$kFZJ
000136584 9132_ $$0G:(DE-HGF)POF3-511$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data $$vComputational Science and Mathematical Methods$$x0
000136584 9131_ $$0G:(DE-HGF)POF2-411$$1G:(DE-HGF)POF2-410$$2G:(DE-HGF)POF2-400$$3G:(DE-HGF)POF2$$4G:(DE-HGF)POF$$aDE-HGF$$bSchlüsseltechnologien$$lSupercomputing$$vComputational Science and Mathematical Methods$$x0
000136584 920__ $$lyes
000136584 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000136584 980__ $$acontrib
000136584 980__ $$aVDB
000136584 980__ $$aUNRESTRICTED
000136584 980__ $$aI:(DE-Juel1)JSC-20090406