| Home > Publications database > From Parsing to Embeddings: The Hidden Challenges of RAG Development |
| Poster (Outreach) | FZJ-2026-00874 |
; ;
2025
This record in other databases:
Please use a persistent id in citations: doi:10.34734/FZJ-2026-00874
Abstract: Retrieval-Augmented Generation (RAG) is emerging as a powerful method for improving the accuracy and relevance of AI-generated responses by combining information retrieval with large language models (LLMs). In this project, we explore how RAG can be leveraged to build an intelligent chatbot that assists users in navigating high-performance computing (HPC) documentation.Our chatbot is designed to dynamically retrieve information from documentation related to the supercomputers at Forschungszentrum Jülich (JUWELS, JURECA, JUSUF). Developing an efficient RAG-based application presents several challenges, including properly parsing documentation in various formats, effectively segmenting the parsed text into meaningful chunks, and selecting optimal models for retrieval and generation.This work contributes to a broader understanding of RAG’s capabilities and limitations in specialized technical domains, offering insights into its potential for improving user support in complex computing environments.
|
The record appears in these collections: |