TY  - CONF
AU  - John, Chelsea Maria
AU  - Herten, Andreas
TI  - Novel Architecture Exploration - OpenGPT-X: Open Large Language Models
M1  - FZJ-2023-04874
PY  - 2023
AB  - The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open-source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.
T2  - WHPC@SC23: 16th International Women in HPC Workshop
CY  - 12 Nov 2023 - 17 Nov 2023, Denver, Colorado (USA)
Y2  - 12 Nov 2023 - 17 Nov 2023
M2  - Denver, Colorado, USA
LB  - PUB:(DE-HGF)6
DO  - DOI:10.34734/FZJ-2023-04874
UR  - https://juser.fz-juelich.de/record/1018546
ER  -