Talk (non-conference) (Invited) FZJ-2024-06883

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Efficient Computation of Low-Rank Representations to Reduce Memory Requirements in Deep Learning



2024

RWTH Aachen SFB 1481 Colloquium, AachenAachen, Germany, 11 Dec 2024 - 11 Dec 20242024-12-112024-12-11 [10.34734/FZJ-2024-06883]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: Computing an orthogonal basis that approximates the range or corange of a matrix is a ubiquitous problem in computational science and engineering. In numerous applications, a rapid decay of singular values permits the use of such bases to approximate a linear operator by restricting it to low-rank subspaces, thereby significantly reducing computational and storage demands. A powerful approach for constructing a basis with a specified rank or approximation tolerance is the (adaptive) randomized range finder. In this talk, we introduce a novel variant of this algorithm, based on the blocked Householder QR decomposition, optimized for modern GPU accelerators. This development is motivated by its potential to substantially lower memory requirements during the training of deep neural networks such as transformers. We discuss the GaLore (Gradient Low-Rank Projection) training framework, and demonstrate how the randomized range finder can be employed to derive low-rank representations of optimizer states. Further potential avenues for future research are discussed.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. OpenGPT-X - Aufbau eines Gaia-X Knotens für große KI-Sprachmodelle und innovative Sprachapplikations-Services; Teilvorhaben: Optimierung und Skalierung auf großen HPC-Systemen (68GX21007F) (68GX21007F)

Appears in the scientific report 2024
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Talks (non-conference)
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2024-12-11, last modified 2025-02-03


OpenAccess:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)