Understanding Data Movement in AMD Multi-GPU Systems with Infinity Fabric

Schieffer, Gabin; Shi, Ruimin; Peng, Ivy; Faj, Jennifer; Herten, Andreas; Markidis, Stefano

doi:10.1109/SCW63240.2024.00079

Contribution to a conference proceedings

FZJ-2025-00766

Understanding Data Movement in AMD Multi-GPU Systems with Infinity Fabric

Schieffer, G. (Corresponding author) ; Shi, R. ; Markidis, S. ; Herten, A.FZJ* ; Faj, J. ; Peng, I.

2024
IEEE

SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC24, Atlanta, GA, USA, 17 Nov 2024 - 22 Nov 2024 IEEE 567-576 (2024) [10.1109/SCW63240.2024.00079]

This record in other databases:

Please use a persistent id in citations: doi:10.1109/SCW63240.2024.00079

Abstract: Modern GPU systems are constantly evolving tomeet the needs of computing-intensive applications in scientificand machine learning domains. However, there is typically a gapbetween the hardware capacity and the achievable applicationperformance. This work aims to provide a better understandingof the Infinity Fabric interconnects on AMD GPUs and CPUs. Wepropose a test and evaluation methodology for characterizing theperformance of data movements on multi-GPU systems, stressingdifferent communication options on AMD MI250X GPUs, includ-ing point-to-point and collective communication, and memoryallocation strategies between GPUs, as well as the host CPU.In a single-node setup with four GPUs, we show that directpeer-to-peer memory accesses between GPUs and utilization ofthe RCCL library outperform MPI-based solutions in terms ofmemory/communication latency and bandwidth. Our test andevaluation method serves as a base for validating memory andcommunication strategies on a system and improving applicationson AMD multi-GPU computing systems.

Contributing Institute(s):

Jülich Supercomputing Center (JSC)

Research Program(s):

Appears in the scientific report 2024

Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Workflow collections > Public records
Institute Collections > JSC
Publications database

Record created 2025-01-20, last modified 2025-08-22

Similar records

Restricted:

PDF

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help