001     136584
005     20250314084108.0
024 7 _ |a 10.1145/2484762.2484777
|2 doi
037 _ _ |a FZJ-2013-03370
100 1 _ |a Wylie, Brian J. N.
|0 P:(DE-Juel1)132302
|b 0
|u fzj
|e Corresponding author
111 2 _ |a Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
|c San Diego, California
|d 2013-07-22 - 2013-07-25
|g XSEDE'13
|0 Conf-20130722_San_Diego_California
|p Proceedings of the Conference on Extreme Science and Engineering Discovery Environment Gateway to Discovery - XSEDE '13
|w USA
245 _ _ |a Scalasca support for MPI+OpenMP parallel applications on large-scale HPC systems based on Intel Xeon Phi
260 _ _ |a New York, New York, USA
|c 2013
|b ACM Press
295 1 0 |a XSEDE '13 Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
300 _ _ |a 8
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1375713225_28269
|2 PUB:(DE-HGF)
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a INPROCEEDINGS
|2 BibTeX
520 _ _ |a Intel Xeon Phi coprocessors based on the Many Integrated Core (MIC) architecture are starting to appear in HPC systems, with Stampede being a prominent example available within the XSEDE cyber-infrastructure. Porting MPI and OpenMP applications to such systems is often no more than simple recompilation, however, execution performance needs to be carefully analyzed and tuned to effectively exploit their unique capabilities. For performance measurement and analysis tools, the variety of execution modes need to be supported in a consistent and convenient manner, and especially execution configurations involving large numbers of compute nodes each with several multicore host processors and many-core coprocessors. Early experience using the open-source Scalasca toolset for runtime summarization and automatic trace analysis with the NPB BT-MZ MPI+OpenMP parallel application on Stampede is reported, along with discussion of on-going and future work.
536 _ _ |a 411 - Computational Science and Mathematical Methods (POF2-411)
|0 G:(DE-HGF)POF2-411
|c POF2-411
|x 0
|f POF II
536 _ _ |a DEEP - Dynamical Exascale Entry Platform (287530)
|0 G:(EU-Grant)287530
|c 287530
|x 1
|f FP7-ICT-2011-7
536 _ _ |0 G:(DE-Juel-1)ATMLPP
|a ATMLPP - ATML Parallel Performance (ATMLPP)
|c ATMLPP
|x 2
588 _ _ |a Dataset connected to CrossRef Conference
700 1 _ |a Frings, Wolfgang
|0 P:(DE-Juel1)132108
|b 1
|u fzj
773 _ _ |a 10.1145/2484762.2484777
856 4 _ |u https://juser.fz-juelich.de/record/136584/files/FZJ-2013-03370.pdf
|y Restricted
909 C O |o oai:juser.fz-juelich.de:136584
|p openaire
|p VDB
|p ec_fundedresources
910 1 _ |a Forschungszentrum Jülich GmbH
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)132302
910 1 _ |a Forschungszentrum Jülich GmbH
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)132108
913 2 _ |a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-511
|2 G:(DE-HGF)POF3-500
|v Computational Science and Mathematical Methods
|x 0
913 1 _ |a DE-HGF
|b Schlüsseltechnologien
|l Supercomputing
|1 G:(DE-HGF)POF2-410
|0 G:(DE-HGF)POF2-411
|2 G:(DE-HGF)POF2-400
|v Computational Science and Mathematical Methods
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF2
914 1 _ |y 2013
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21