001     1049804
005     20260108204824.0
024 7 _ |a 10.22323/1.466.0280
|2 doi
024 7 _ |a 10.34734/FZJ-2025-05588
|2 datacite_doi
037 _ _ |a FZJ-2025-05588
041 _ _ |a English
100 1 _ |a Altherr, Anian
|0 P:(DE-HGF)0
|b 0
111 2 _ |a The 41st International Symposium on Lattice Field Theory
|g LATTICE 2024
|c Liverpool
|d 2024-07-28 - 2024-08-03
|w UK
245 _ _ |a O(a)-improved QCD+QED Wilson Dirac operator on GPUs
260 _ _ |c 2025
295 1 0 |a Proceedings of The 41st International Symposium on Lattice Field Theory
300 _ _ |a 28
336 7 _ |a CONFERENCE_PAPER
|2 ORCID
336 7 _ |a Conference Paper
|0 33
|2 EndNote
336 7 _ |a INPROCEEDINGS
|2 BibTeX
336 7 _ |a conferenceObject
|2 DRIVER
336 7 _ |a Output Types/Conference Paper
|2 DataCite
336 7 _ |a Contribution to a conference proceedings
|b contrib
|m contrib
|0 PUB:(DE-HGF)8
|s 1767859784_29748
|2 PUB:(DE-HGF)
336 7 _ |a Contribution to a book
|0 PUB:(DE-HGF)7
|2 PUB:(DE-HGF)
|m contb
490 0 _ |a Proceedings of Science
520 _ _ |a Markov Chain Monte Carlo simulations of lattice Quantum Chromodynamics (QCD) are the onlyknown tool to investigate non-perturbatively the theory of the strong interaction and are requiredto perform precision tests of the Standard Model of Particle Physics. As the Markov Chain isa serial process, the sole option for improving the sampling rate is accelerating each individualupdate step. Heterogeneous clusters of GPU-accelerated nodes offer large total memory bandwidthwhich can be used to speed-up our application, openQxD-1.1, which is dominated by inversionsof the Dirac operator, a large sparse matrix. In this work we investigate offloading the inversionto GPU using the lattice-QCD library QUDA, and our early results demonstrate a significantpotential speed-up in the time-to-solution for state-of-the-art problem sizes. Minimal extensionsto the existing QUDA library are required for our specific physics programme while greatlyenhancing the performance portability of our code and retaining the reliability and robustness ofexisting applications in openQxD-1.1. Our new interface will enable us to utilize pre-exascaleinfrastructure and reduce the systematic uncertainty in our physics predictions by incorporatingthe effects of quantum electromagnetism (QED) in our simulations.
536 _ _ |a 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)
|0 G:(DE-HGF)POF4-5111
|c POF4-511
|f POF IV
|x 0
588 _ _ |a Dataset connected to CrossRef Conference
700 1 _ |a Campos, Isabel
|0 P:(DE-HGF)0
|b 1
700 1 _ |a Coles, Jonathan
|0 P:(DE-HGF)0
|b 2
700 1 _ |a Cotellucci, Alessandro
|0 P:(DE-Juel1)207074
|b 3
|u fzj
700 1 _ |a Fernández De la Garza, Juan Antonio
|0 P:(DE-HGF)0
|b 4
700 1 _ |a Gruber, Roman
|0 P:(DE-HGF)0
|b 5
|e Corresponding author
700 1 _ |a Harris, Tim
|0 P:(DE-HGF)0
|b 6
700 1 _ |a Komijani, Javad
|0 P:(DE-HGF)0
|b 7
700 1 _ |a Lücke, Jens
|0 P:(DE-HGF)0
|b 8
700 1 _ |a Maier, Stephanie
|0 P:(DE-HGF)0
|b 9
700 1 _ |a Marinkovic, Marina
|0 P:(DE-HGF)0
|b 10
700 1 _ |a Parato, Letizia
|0 P:(DE-HGF)0
|b 11
700 1 _ |a Patella, Agostino
|0 P:(DE-HGF)0
|b 12
700 1 _ |a Rosso, Sara
|0 P:(DE-HGF)0
|b 13
700 1 _ |a Tavella, Paola
|0 P:(DE-HGF)0
|b 14
700 1 _ |a Vogt, Hannes
|0 P:(DE-HGF)0
|b 15
773 _ _ |a 10.22323/1.466.0280
|y 2025
856 4 _ |u https://juser.fz-juelich.de/record/1049804/files/LATTICE2024_280.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:1049804
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)207074
913 1 _ |a DE-HGF
|b Key Technologies
|l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action
|1 G:(DE-HGF)POF4-510
|0 G:(DE-HGF)POF4-511
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Enabling Computational- & Data-Intensive Science and Engineering
|9 G:(DE-HGF)POF4-5111
|x 0
914 1 _ |y 2025
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
915 _ _ |a Creative Commons Attribution-NonCommercial-NoDerivs CC BY-NC-ND 4.0
|0 LIC:(DE-HGF)CCBYNCND4
|2 HGFVOC
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a contrib
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a contb
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21