TY - CONF
AU - John, Chelsea Maria
AU - Herten, Andreas
TI - Novel Architecture Exploration - OpenGPT-X: Open Large Language Models
M1 - FZJ-2023-04874
PY - 2023
AB - The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open-source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.
T2 - WHPC@SC23: 16th International Women in HPC Workshop
CY - 12 Nov 2023 - 17 Nov 2023, Denver, Colorado (USA)
Y2 - 12 Nov 2023 - 17 Nov 2023
M2 - Denver, Colorado, USA
LB - PUB:(DE-HGF)6
DO - DOI:10.34734/FZJ-2023-04874
UR - https://juser.fz-juelich.de/record/1018546
ER -