Efficient Computation of Low-Rank Representations to Reduce Memory Requirements in LLM Training

Penke, Carolin

Conference Presentation (Invited)

FZJ-2024-06889

Efficient Computation of Low-Rank Representations to Reduce Memory Requirements in LLM Training

Penke, C. (Corresponding author)FZJ*

2024

LoRAINNe’24: workshop on LOw-Rank Approximations and their Interactions with Neural NEtworks, LoRAINNe’24, Nancy, France, 26 Nov 2024 - 27 Nov 2024 [10.34734/FZJ-2024-06889]

This record in other databases:

Please use a persistent id in citations: doi:10.34734/FZJ-2024-06889

Abstract: The OpenGPT-X project represents one of Europe’s pioneering publicly funded efforts in the domain of large language models (LLMs), covering the entire lifecycle from pre-training foundational models to fine-tuning and practical application development. To maximize the efficiency of training on High Performance Computing (HPC) resources, strategies aimed at reducing computational and memory demands are being explored. A promising avenue exploits the low-rank structure of gradients, as done in the LoRA or GaLore frameworks, the latter of which relies on the computation of dominant low-rank subspaces during training. The randomized range finder algorithm provides a more efficient alternative to computing a full singular value decomposition (SVD). We introduce a novel variant of the range finder, based on the blocked Householder QR decomposition, optimized for modern GPU accelerators.

Contributing Institute(s):

Jülich Supercomputing Center (JSC)

Research Program(s):

Appears in the scientific report 2024

Database coverage:
OpenAccess

Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Conference Presentations
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

Record created 2024-12-11, last modified 2025-02-03

Similar records

OpenAccess:

PDF

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help