Home > Publications database > Novel Architecture Exploration - OpenGPT-X: Open Large Language Models > print |
001 | 1018546 | ||
005 | 20250822121411.0 | ||
024 | 7 | _ | |a 10.34734/FZJ-2023-04874 |2 datacite_doi |
037 | _ | _ | |a FZJ-2023-04874 |
041 | _ | _ | |a English |
100 | 1 | _ | |a John, Chelsea Maria |0 P:(DE-Juel1)187395 |b 0 |e Corresponding author |u fzj |
111 | 2 | _ | |a WHPC@SC23: 16th International Women in HPC Workshop |g WHPC@SC23 |c Denver, Colorado |d 2023-11-12 - 2023-11-17 |w USA |
245 | _ | _ | |a Novel Architecture Exploration - OpenGPT-X: Open Large Language Models |
260 | _ | _ | |c 2023 |
336 | 7 | _ | |a Conference Paper |0 33 |2 EndNote |
336 | 7 | _ | |a Other |2 DataCite |
336 | 7 | _ | |a INPROCEEDINGS |2 BibTeX |
336 | 7 | _ | |a conferenceObject |2 DRIVER |
336 | 7 | _ | |a LECTURE_SPEECH |2 ORCID |
336 | 7 | _ | |a Conference Presentation |b conf |m conf |0 PUB:(DE-HGF)6 |s 1702537370_9248 |2 PUB:(DE-HGF) |x Invited |
520 | _ | _ | |a The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open-source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project. |
536 | _ | _ | |a 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) |0 G:(DE-HGF)POF4-5112 |c POF4-511 |f POF IV |x 0 |
536 | _ | _ | |a 5121 - Supercomputing & Big Data Facilities (POF4-512) |0 G:(DE-HGF)POF4-5121 |c POF4-512 |f POF IV |x 1 |
536 | _ | _ | |a OpenGPT-X - Aufbau eines Gaia-X Knotens für große KI-Sprachmodelle und innovative Sprachapplikations-Services; Teilvorhaben: Optimierung und Skalierung auf großen HPC-Systemen (68GX21007F) |0 G:(DE-Juel-1)68GX21007F |c 68GX21007F |x 2 |
536 | _ | _ | |a ATML-X-DEV - ATML Accelerating Devices (ATML-X-DEV) |0 G:(DE-Juel-1)ATML-X-DEV |c ATML-X-DEV |x 3 |
588 | _ | _ | |a Dataset connected to DataCite |
700 | 1 | _ | |a Herten, Andreas |0 P:(DE-Juel1)145478 |b 1 |u fzj |
856 | 4 | _ | |y OpenAccess |u https://juser.fz-juelich.de/record/1018546/files/FZJ-2023-04874.pdf |
856 | 4 | _ | |y OpenAccess |x icon |u https://juser.fz-juelich.de/record/1018546/files/FZJ-2023-04874.gif?subformat=icon |
856 | 4 | _ | |y OpenAccess |x icon-1440 |u https://juser.fz-juelich.de/record/1018546/files/FZJ-2023-04874.jpg?subformat=icon-1440 |
856 | 4 | _ | |y OpenAccess |x icon-180 |u https://juser.fz-juelich.de/record/1018546/files/FZJ-2023-04874.jpg?subformat=icon-180 |
856 | 4 | _ | |y OpenAccess |x icon-640 |u https://juser.fz-juelich.de/record/1018546/files/FZJ-2023-04874.jpg?subformat=icon-640 |
909 | C | O | |o oai:juser.fz-juelich.de:1018546 |p openaire |p open_access |p VDB |p driver |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)187395 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 1 |6 P:(DE-Juel1)145478 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-511 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Enabling Computational- & Data-Intensive Science and Engineering |9 G:(DE-HGF)POF4-5112 |x 0 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Engineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action |1 G:(DE-HGF)POF4-510 |0 G:(DE-HGF)POF4-512 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Supercomputing & Big Data Infrastructures |9 G:(DE-HGF)POF4-5121 |x 1 |
914 | 1 | _ | |y 2023 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)JSC-20090406 |k JSC |l Jülich Supercomputing Center |x 0 |
980 | _ | _ | |a conf |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)JSC-20090406 |
980 | 1 | _ | |a FullTexts |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|