001     1049564
005     20251223155943.0
024 7 _ |a 10.1145/3748815
|2 doi
024 7 _ |a 0098-3500
|2 ISSN
024 7 _ |a 1557-7295
|2 ISSN
037 _ _ |a FZJ-2025-05368
082 _ _ |a 004
100 1 _ |a Mayani, Sonali
|0 0000-0002-3294-4650
|b 0
245 _ _ |a A Massively Parallel Performance Portable Free-Space Spectral Poisson Solver
260 _ _ |a New York, NY
|c 2025
|b ACM
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1766501243_23081
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
520 _ _ |a Vico et al. suggest a fast algorithm for computing volume potentials, beneficial to fields with problems requiring the solution of the free-space Poisson’s equation, such as beam and plasma physics. Currently, the standard is the algorithm of Hockney and Eastwood, with second order in convergence at best. The algorithm proposed by Vico et al. converges spectrally for sufficiently smooth functions, i.e., faster than any fixed order in the number of grid points. We implement a performance portable version of the traditional Hockney-Eastwood and the novel Vico-Greengard Poisson solver as part of the Independent Parallel Particle Layer (IPPL) library. For sufficiently smooth source functions, the Vico-Greengard algorithm achieves higher accuracy than the Hockney-Eastwood method with the same grid size, reducing the computational demands of high-resolution simulations since one could use coarser grids to achieve them. Additionally, we propose an improvement to the Vico-Greengard method which further reduces its memory footprint. This is important for GPUs, which have limited memory, and should be taken into account when selecting numerical algorithms for performance portable codes. Finally, we showcase performance through GPU and CPU scaling studies on the Perlmutter (NERSC) supercomputer, with efficiencies staying above 50% in the strong scaling case. To showcase portability, we also run the scaling studies on the Alps supercomputer at CSCS, Switzerland and the GPU partition of the Lumi supercomputer at CSC, Finland.
536 _ _ |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5112
|c POF4-511
|f POF IV
|x 0
588 _ _ |a Dataset connected to CrossRef, Journals: juser.fz-juelich.de
700 1 _ |a Montanaro, Veronica
|0 0009-0004-5772-6109
|b 1
700 1 _ |a Cerfon, Antoine
|0 0000-0003-3326-4785
|b 2
700 1 _ |a Frey, Matthias
|0 0000-0002-7842-0051
|b 3
700 1 _ |a Muralikrishnan, Sriramkrishnan
|0 P:(DE-Juel1)195613
|b 4
700 1 _ |a Adelmann, Andreas
|0 0000-0002-7230-7007
|b 5
|e Corresponding author
773 _ _ |a 10.1145/3748815
|g Vol. 51, no. 3, p. 1 - 23
|0 PERI:(DE-600)2006421-4
|n 3
|p 1 - 23
|t ACM transactions on mathematical software
|v 51
|y 2025
|x 0098-3500
856 4 _ |u https://juser.fz-juelich.de/record/1049564/files/3748815.pdf
|y Restricted
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 4
|6 P:(DE-Juel1)195613
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5112
|x 0
914 1 _ |y 2025
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0200
|2 StatID
|b SCOPUS
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0199
|2 StatID
|b Clarivate Analytics Master Journal List
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)1160
|2 StatID
|b Current Contents - Engineering, Computing and Technology
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0160
|2 StatID
|b Essential Science Indicators
|d 2024-12-18
915 _ _ |a WoS
|0 StatID:(DE-HGF)0113
|2 StatID
|b Science Citation Index Expanded
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0150
|2 StatID
|b Web of Science Core Collection
|d 2024-12-18
915 _ _ |a JCR
|0 StatID:(DE-HGF)0100
|2 StatID
|b ACM T MATH SOFTWARE : 2022
|d 2024-12-18
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0600
|2 StatID
|b Ebsco Academic Search
|d 2024-12-18
915 _ _ |a Peer Review
|0 StatID:(DE-HGF)0030
|2 StatID
|b ASC
|d 2024-12-18
915 _ _ |a IF < 5
|0 StatID:(DE-HGF)9900
|2 StatID
|d 2024-12-18
920 _ _ |l no
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a journal
980 _ _ |a EDITORS
980 _ _ |a VDBINPRINT
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21