Conference Presentation (Invited) FZJ-2025-01589

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Helmholtz Blablador: An Inference Server for Scientific LargeLanguage Models



2024

1st Large Language Models in Physics Symposium, LIPS, DESYHamburg, DESY, Germany, 21 Feb 2024 - 23 Feb 20242024-02-212024-02-23

Abstract: Recent advances in large language models (LLMs) like chatGPT have demonstrated their potentialfor generating human-like text and reasoning about topics with natural language. However, ap-plying these advanced LLMs requires significant compute resources and expertise that are out ofreach for most academic researchers. To make scientific LLMs more accessible, we have developedHelmholtz Blablador, an open-source inference server optimized for serving predictions from cus-tomized scientific LLMs.Blablador provides the serving infrastructure to make models accessible via a simple API withoutmanaging servers, firewalls, authentication or infrastructure. Researchers can add their pretrainedLLMs to the central hub. Other scientists can then query the collective model catalog via web or usingthe popular OpenAI api to add LLM functionality in other tools, like programming IDEs.This enables a collaborative ecosystem for scientific LLMs:• Researchers train models using datasets and GPUs from their own lab. No need to set up produc-tion servers. They can even provide their models with inference happening on cpus, with the useof tools like llama.cpp.• Models are contributed to the Blablador hub through a web UI or API call. Blablador handlesloading models and publishing models for general use.• Added models become available for querying by other researchers. A model catalog displaysavailable LLMs from different labs and research areas.Besides that, one can train, quantize, fine-tune and evaluate LLMs directly with Blablador.The inference server is available at http://helmholtz-blablador.fz-juelich.de


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. Helmholtz AI Consultant Team FB Information (E54.303.11) (E54.303.11)

Appears in the scientific report 2024
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Conference Presentations
Workflow collections > Public records
Institute Collections > JSC
Publications database

 Record created 2025-01-31, last modified 2025-02-03


Restricted:
Download fulltext HTML
External link:
Download fulltextFulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)