001     889149
005     20210127115259.0
024 7 _ |a 10.1145/3394277.3401858
|2 doi
024 7 _ |a 2128/26687
|2 Handle
037 _ _ |a FZJ-2021-00074
100 1 _ |a Haensel, David
|0 P:(DE-Juel1)161429
|b 0
|e Corresponding author
111 2 _ |a PASC '20: Platform for Advanced Scientific Computing Conference
|c Geneva
|d 2020-06-29 - 2020-07-01
|w Switzerland
245 _ _ |a Eventify: Event-Based Task Parallelism for Strong Scaling
260 _ _ |c 2020
|b ACM New York, NY, USA
295 1 0 |a Proceedings of the Platform for Advanced Scientific Computing Conference - ACM New York, NY, USA, 2020. - ISBN 9781450379939 - doi:10.1145/3394277.3401858
300 _ _ |a 1-10
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a INPROCEEDINGS
|2 BibTeX
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1609945551_6559
|2 PUB:(DE-HGF)
336 7 _ |a Contribution to a book
|0 PUB:(DE-HGF)7
|2 PUB:(DE-HGF)
|m contb
520 _ _ |a Today's processors become fatter, not faster. However, the exploitation of these massively parallel compute resources remains a challenge for many traditional HPC applications regarding scalability, portability and programmability. To tackle this challenge, several parallel programming approaches such as loop parallelism and task parallelism are researched in form of languages, libraries and frameworks. Task parallelism as provided by OpenMP, HPX, StarPU, Charm++ and Kokkos is the most promising approach to overcome the challenges of ever increasing parallelism. The aforementioned parallel programming technologies enable scalability for a broad range of algorithms with coarse-grained tasks, e. g. in linear algebra and classical N-body simulation. However, they do not fully address the performance bottlenecks of algorithms with fine-grained tasks and the resultant large task graphs. Additionally, we experienced the description of large task graphs to be cumbersome with the common approach of providing in-, out- and inout-dependencies. We introduce event-based task parallelism to solve the performance and programmability issues for algorithms that exhibit fine-grained task parallelism and contain repetitive task patterns. With user-defined event lists, the approach provides a more convenient and compact way to describe large task graphs. Furthermore, we show how these event lists are processed by a task engine that reuses user-defined, algorithmic data structures. As use case, we describe the implementation of a fast multipole method for molecular dynamics with event-based task parallelism. The performance analysis reveals that the event-based implementation is 52 % faster than a classical loop-parallel implementation with OpenMP.
536 _ _ |a 511 - Computational Science and Mathematical Methods (POF3-511)
|0 G:(DE-HGF)POF3-511
|c POF3-511
|f POF III
|x 0
536 _ _ |a PhD no Grant - Doktorand ohne besondere Förderung (PHD-NO-GRANT-20170405)
|0 G:(DE-Juel1)PHD-NO-GRANT-20170405
|c PHD-NO-GRANT-20170405
|x 1
588 _ _ |a Dataset connected to CrossRef Conference
700 1 _ |a Morgenstern, Laura
|0 P:(DE-Juel1)169856
|b 1
|u fzj
700 1 _ |a Beckmann, Andreas
|0 P:(DE-Juel1)157750
|b 2
|u fzj
700 1 _ |a Kabadshow, Ivo
|0 P:(DE-Juel1)132152
|b 3
|u fzj
700 1 _ |a Dachsel, Holger
|0 P:(DE-Juel1)132079
|b 4
|u fzj
773 _ _ |a 10.1145/3394277.3401858
856 4 _ |u https://dl.acm.org/doi/pdf/10.1145/3394277.3401858
856 4 _ |u https://juser.fz-juelich.de/record/889149/files/3394277.3401858.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:889149
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)161429
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)169856
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)157750
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)132152
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 4
|6 P:(DE-Juel1)132079
913 1 _ |a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-511
|3 G:(DE-HGF)POF3
|2 G:(DE-HGF)POF3-500
|4 G:(DE-HGF)POF
|v Computational Science and Mathematical Methods
|x 0
914 1 _ |y 2020
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
920 1 _ |0 I:(DE-Juel1)IAS-7-20180321
|k IAS-7
|l Zivile Sicherheitsforschung
|x 1
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a contb
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a I:(DE-Juel1)IAS-7-20180321
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21