Mathematical Techniques to Reduce Memory Requirements in Deep Learning

Penke, Carolin

Conference Presentation (Other)

FZJ-2024-06888

Mathematical Techniques to Reduce Memory Requirements in Deep Learning

Penke, C. (Corresponding author)FZJ*

2024

OpenGPT-X Forum 2024, Berlin, Germany, 5 Nov 2024 - 5 Nov 2024 [10.34734/FZJ-2024-06888]

This record in other databases:

Please use a persistent id in citations: doi:10.34734/FZJ-2024-06888

Abstract: We present a method to substantially lower memory requirements during the training of deep neural networks, based on the GaLore (Gradient Low-Rank Projection) training framework. A rapid decay of singular values in gradient matrices permits the use of low-rank bases to encapsulate the relevant subspaces, reducing the memory requirements for storing optimizer states between iterations. A novel, rank-adaptive, GPU-optimized version of the randomized range finder algorithm is employed to exploit this property and future research directions are discussed.

Contributing Institute(s):

Jülich Supercomputing Center (JSC)

Research Program(s):

Appears in the scientific report 2024

Database coverage:
OpenAccess

Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Conference Presentations
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

Record created 2024-12-11, last modified 2025-02-03

Similar records

OpenAccess:

PDF

Rate this document:

(Not yet reviewed)

Add to personal basket
Export as Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help