Contribution to a conference proceedings/Contribution to a book FZJ-2021-00074

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Eventify: Event-Based Task Parallelism for Strong Scaling

 ;  ;  ;  ;

2020
ACM New York, NY, USA

Proceedings of the Platform for Advanced Scientific Computing Conference - ACM New York, NY, USA, 2020. - ISBN 9781450379939 - doi:10.1145/3394277.3401858
PASC '20: Platform for Advanced Scientific Computing Conference, GenevaGeneva, Switzerland, 29 Jun 2020 - 1 Jul 20202020-06-292020-07-01
ACM New York, NY, USA 1-10 () [10.1145/3394277.3401858]

This record in other databases:

Please use a persistent id in citations:   doi:

Abstract: Today's processors become fatter, not faster. However, the exploitation of these massively parallel compute resources remains a challenge for many traditional HPC applications regarding scalability, portability and programmability. To tackle this challenge, several parallel programming approaches such as loop parallelism and task parallelism are researched in form of languages, libraries and frameworks. Task parallelism as provided by OpenMP, HPX, StarPU, Charm++ and Kokkos is the most promising approach to overcome the challenges of ever increasing parallelism. The aforementioned parallel programming technologies enable scalability for a broad range of algorithms with coarse-grained tasks, e. g. in linear algebra and classical N-body simulation. However, they do not fully address the performance bottlenecks of algorithms with fine-grained tasks and the resultant large task graphs. Additionally, we experienced the description of large task graphs to be cumbersome with the common approach of providing in-, out- and inout-dependencies. We introduce event-based task parallelism to solve the performance and programmability issues for algorithms that exhibit fine-grained task parallelism and contain repetitive task patterns. With user-defined event lists, the approach provides a more convenient and compact way to describe large task graphs. Furthermore, we show how these event lists are processed by a task engine that reuses user-defined, algorithmic data structures. As use case, we describe the implementation of a fast multipole method for molecular dynamics with event-based task parallelism. The performance analysis reveals that the event-based implementation is 52 % faster than a classical loop-parallel implementation with OpenMP.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
  2. Zivile Sicherheitsforschung (IAS-7)
Research Program(s):
  1. 511 - Computational Science and Mathematical Methods (POF3-511) (POF3-511)
  2. PhD no Grant - Doktorand ohne besondere Förderung (PHD-NO-GRANT-20170405) (PHD-NO-GRANT-20170405)

Appears in the scientific report 2020
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Institute Collections > IAS > IAS-7
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2021-01-06, last modified 2021-01-27


Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)