%0 Journal Article
%A Seidler, Thomas
%A Emmerich, Fabian
%A Ehlert, Kristian
%A Berner, Rico
%A Nagel-Kanzler, Oliver
%A Schultz, Norbert
%A Quade, Markus
%A Schultz, Martin G.
%A Abel, Markus
%T Mantik: A Workflow Platform for the Development of Artificial Intelligence on High-Performance Computing Infrastructures
%J The journal of open source software
%V 9
%N 98
%@ 2475-9066
%M FZJ-2025-00807
%P 6136
%D 2024
%X The use of machine learning (ML) approaches is exponentially increasing, and for manyscientific applications, high-performance computing (HPC) infrastructure is used to train largemodels. However, the tooling for an easy deployment of models for training or inference onHPC infrastructures is not satisfactory, e.g. reproducibility, collaboration and monitoring ofML models are not addressed in existing toolsets. With Mantik, we provide an open-sourcecloud platform, which simplifies the development of and collaboration on ML models on HPCfacilities, and enhances reproducibility by supporting data and code versioning as well asexperiment tracking. The users are able to develop their applications in the environment theyare most comfortable with – their local machine. Usage of the best-choice IDE and mostrecent software versions allow to leverage the full potential of the software stack for theirresearch. Using Mantik’s remote file service allows for simple management of data in remotestorages and keeping track of it. As soon as an application is ready for training or inference,users can immediately submit it to an HPC cluster. During application development, userscan train and/or evaluate their models on HPC clusters via CLI on their local machine or ourbrowser-based Mantik cloud platform. The latter only requires an internet browser such thate.g., ML training from your phone becomes feasible. Once training or inference has begun, auser is able to monitor the application in real time on the Mantik cloud platform.
%F PUB:(DE-HGF)16
%9 Journal Article
%R 10.21105/joss.06136
%U https://juser.fz-juelich.de/record/1037642