%0 Conference Paper
%A John, Chelsea Maria
%A Herten, Andreas
%T Novel Architecture Exploration - OpenGPT-X: Open Large Language Models
%M FZJ-2023-04874
%D 2023
%X The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open-source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.
%B WHPC@SC23: 16th International Women in HPC Workshop
%C 12 Nov 2023 - 17 Nov 2023, Denver, Colorado (USA)
Y2 12 Nov 2023 - 17 Nov 2023
M2 Denver, Colorado, USA
%F PUB:(DE-HGF)6
%9 Conference Presentation
%R 10.34734/FZJ-2023-04874
%U https://juser.fz-juelich.de/record/1018546