%0 Conference Paper
%A Guimarães, Filipe Souza Mendes
%A Sankaran, Aravind
%A Frings, Wolfgang
%T Supporting HPC Users with LLview; 1
%V 16091
%@ 0302-9743
%C Heidelberg
%I Springer
%M FZJ-2025-05008
%@ 978-3-032-07611-3 (print)
%B Lecture Notes in Computer Science
%P 40 - 51
%D 2025
%< High Performance Computing / ISC High Performance 2025 International Workshops, Hamburg, Germany, June 10–13, 2025, Revised Selected Papers
%X Diagnosing and reporting operational issues to optimise system usage and performance is challenging on large-scale HPC systems due to their sheer complexity. At the Jülich Supercomputing Centre (JSC), we address this challenge with LLview, an open-source system and job reporting framework. LLview provides near real-time metrics for analysis through a web portal with role-based access for users, administrators, and support staff. In this paper, we present a series of use cases demonstrating how LLview enables efficient diagnosis and resolution of system and application issues, enhancing both reactive and proactive support for HPC users.
%B 40th International Conference on High Performance Computing, ISC High Performance 2025.
%C 10 Jun 2025 - 13 Jun 2025, Hamburg (Germany)
Y2 10 Jun 2025 - 13 Jun 2025
M2 Hamburg, Germany
%F PUB:(DE-HGF)8 ; PUB:(DE-HGF)7
%9 Contribution to a conference proceedingsContribution to a book
%R 10.1007/978-3-032-07612-0
%U https://juser.fz-juelich.de/record/1048907