TY  - CONF
AU  - Kitayama, Itaru
AU  - Wylie, Brian J. N.
AU  - Maeda, Toshiyuki
TI  - Execution Performance Analysis of the ABySS Genome Sequence Assembler using Scalasca on the K Computer
VL  - 27
PB  - IOS Press
M1  - FZJ-2016-02510
SN  - 0927-5452
T2  - Advances in Parallel Computing
SP  - 63-72
PY  - 2016
AB  - Performance analysis of the ABySS genome sequence assembler (ABYSS-P) executing on the K computer with up to 8192 compute nodes is described which identified issues that limited scalability to less than 1024 compute nodes and required prohibitive message buffer memory with 16384 or more compute nodes. The open-source Scalasca toolset was employed to analyse executions, revealing the impact of massive amounts of MPI point-to-point communication used particularly for master/worker process coordination, and inefficient parallel file operations that manifest as waiting time at later MPI collective synchronisations and communications. Initial remediation via use of collective communication operations and alternate strategies for parallel file handling show large performance and scalability improvements, with partial executions validated on the full 82,944 compute nodes of the K computer.
T2  - International Conference on Parallel Computing 2015
CY  - 1 Sep 2015 - 4 Sep 2015, Edinburgh (Scotland)
Y2  - 1 Sep 2015 - 4 Sep 2015
M2  - Edinburgh, Scotland
LB  - PUB:(DE-HGF)8
UR  - <Go to ISI:>//WOS:000578348400007
DO  - DOI:10.3233/978-1-61499-621-7-63
UR  - https://juser.fz-juelich.de/record/809217
ER  -