TY - CONF AU - Penke, Carolin TI - An Introduction to Large Language Models M1 - FZJ-2024-06880 PY - 2024 AB - Large Language Models (LLMs) have revolutionized the field of artificial intelligence, enabling advanced text generation and understanding. This talk provides a concise overview of LLMs, focusing on their development, architecture, and implementation. We explain key concepts, and give details on the backbone of modern LLMs: the transformer architecture and its innovative attention mechanism. To be able to train these models on supercomputers, advanced parallelization techniques are needed. Recent advancements and promising trends are identified. Through the lens of the OpenGPT-X project, this presentation will highlight the collaborative efforts in developing multilingual, open-source LLMs. T2 - Women in Data Science Conference Chemnitz CY - 6 Jun 2024 - 7 Jun 2024, Chemnitz (Germany) Y2 - 6 Jun 2024 - 7 Jun 2024 M2 - Chemnitz, Germany LB - PUB:(DE-HGF)6 UR - https://juser.fz-juelich.de/record/1034059 ER -