Home > Publications database > Novel Architecture Exploration - OpenGPT-X: Open Large Language Models |
Conference Presentation (Invited) | FZJ-2023-04874 |
;
2023
This record in other databases:
Please use a persistent id in citations: doi:10.34734/FZJ-2023-04874
Abstract: The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open-source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.
![]() |
The record appears in these collections: |