| Hauptseite > Publikationsdatenbank > Supporting HPC Users with LLview |
| Contribution to a conference proceedings/Contribution to a book | FZJ-2025-05008 |
; ;
2025
Springer
Heidelberg
ISBN: 978-3-032-07611-3 (print), 978-3-032-07612-0 (electronic)
This record in other databases:
Please use a persistent id in citations: doi:10.1007/978-3-032-07612-0
Abstract: Diagnosing and reporting operational issues to optimise system usage and performance is challenging on large-scale HPC systems due to their sheer complexity. At the Jülich Supercomputing Centre (JSC), we address this challenge with LLview, an open-source system and job reporting framework. LLview provides near real-time metrics for analysis through a web portal with role-based access for users, administrators, and support staff. In this paper, we present a series of use cases demonstrating how LLview enables efficient diagnosis and resolution of system and application issues, enhancing both reactive and proactive support for HPC users.
|
The record appears in these collections: |