TY  - JOUR
AU  - Seidler, Thomas
AU  - Emmerich, Fabian
AU  - Ehlert, Kristian
AU  - Berner, Rico
AU  - Nagel-Kanzler, Oliver
AU  - Schultz, Norbert
AU  - Quade, Markus
AU  - Schultz, Martin G.
AU  - Abel, Markus
TI  - Mantik: A Workflow Platform for the Development of Artificial Intelligence on High-Performance Computing Infrastructures
JO  - The journal of open source software
VL  - 9
IS  - 98
SN  - 2475-9066
M1  - FZJ-2025-00807
SP  - 6136
PY  - 2024
AB  - The use of machine learning (ML) approaches is exponentially increasing, and for manyscientific applications, high-performance computing (HPC) infrastructure is used to train largemodels. However, the tooling for an easy deployment of models for training or inference onHPC infrastructures is not satisfactory, e.g. reproducibility, collaboration and monitoring ofML models are not addressed in existing toolsets. With Mantik, we provide an open-sourcecloud platform, which simplifies the development of and collaboration on ML models on HPCfacilities, and enhances reproducibility by supporting data and code versioning as well asexperiment tracking. The users are able to develop their applications in the environment theyare most comfortable with – their local machine. Usage of the best-choice IDE and mostrecent software versions allow to leverage the full potential of the software stack for theirresearch. Using Mantik’s remote file service allows for simple management of data in remotestorages and keeping track of it. As soon as an application is ready for training or inference,users can immediately submit it to an HPC cluster. During application development, userscan train and/or evaluate their models on HPC clusters via CLI on their local machine or ourbrowser-based Mantik cloud platform. The latter only requires an internet browser such thate.g., ML training from your phone becomes feasible. Once training or inference has begun, auser is able to monitor the application in real time on the Mantik cloud platform.
LB  - PUB:(DE-HGF)16
DO  - DOI:10.21105/joss.06136
UR  - https://juser.fz-juelich.de/record/1037642
ER  -