Contribution to a conference proceedings/Contribution to a book FZJ-2016-06504

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Scientific Software Management in Real Life: Deployment of EasyBuild on a Large Scale System

 ;  ;  ;

2016
IEEE Press Piscataway, NJ, USA

Proceedings of the Third International Workshop on HPC User Support Tools
Third International Workshop on HPC User Support Tools, Salt Lake CitySalt Lake City, USA, 13 Nov 2016 - 13 Nov 20162016-11-132016-11-13
Piscataway, NJ, USA : IEEE Press 31 - 40 () [10.1109/HUST.2016.009]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: Managing scientific software stacks has traditionally been a manual task that required a sizeable team with knowledge about the specifics of building each application. Keeping the software stack up to date also caused a significant overhead for system administrators as well as support teams. Furthermore, a flat module view and the manual creation of modules by different members of the teams can end up providing a confusing view of the installed software to end users. In addition, on many HPC clusters the OS images have to include auxiliary packages to support components of the scientific software stack, potentially bloating the images of the cluster nodes and restricting the installation of new software to a designated maintenance window.To alleviate this situation, tools like EasyBuild help to manage a large number of scientific software packages in a structured way, decoupling the scientific stack from the OS-provided software and lowering the overall overhead of managing a complex HPC software infrastructure. However, the relative novelty of these tools and the variety of requirements from both users and HPC sites means that such frameworks still have to evolve and adapt to different environments. In this paper, we report on how we deployed EasyBuild in a cluster with 45K+ cores (JURECA). In particular, we discuss which features were missing in order to meet our requirements, how we implemented them, how the installation, upgrade, and retirement of software is managed, and how this approach is reused for other internal systems. Finally, we outline some enhancements we would like to see implemented in our setup and in EasyBuild in the future.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 513 - Supercomputer Facility (POF3-513) (POF3-513)

Appears in the scientific report 2016
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2016-11-21, last modified 2021-01-29


Restricted:
Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)