Contribution to a conference proceedings/Contribution to a book FZJ-2025-01156

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Optimizing Automatic Differentiation with Deep Reinforcement Learning

 ;

2024

38th Conference on Neural Information Processing Systems, NeurIPS, VancouverVancouver, Canada, 9 Dec 2024 - 16 Dec 20242024-12-092024-12-16 Advances in neural information processing systems 38, () [10.34734/FZJ-2025-01156]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: Computing Jacobians with automatic differentiation is ubiquitous in many scientific domains such as machine learning, computational fluid dynamics, robotics, and finance. Even small savings in the number of computations or memory usage in Jacobian computations can already incur massive savings in energy consumption and runtime. While there exist many methods that allow for such savings, they generally trade computational efficiency for approximations of the exact Jacobian. In this paper, we present a novel method to optimize the number of necessary multiplications for Jacobian computation by leveraging deep reinforcement learning (RL) and a concept called cross-country elimination while still computing the exact Jacobian. Cross-country elimination is a framework for automatic differentiation that phrases Jacobian accumulation as ordered elimination of all vertices on the computational graph where every elimination incurs a certain computational cost. We formulate the search for the optimal elimination order that minimizes the number of necessary multiplications as a single player game which is played by an RL agent. We demonstrate that this method achieves up to 33% improvements over state-of-the-art methods on several relevant tasks taken from diverse domains. Furthermore, we show that these theoretical gains translate into actual runtime improvements by providing a cross-country elimination interpreter in JAX that can efficiently execute the obtained elimination orders.

Classification:

Note: Accepted as a spotlight paper.

Contributing Institute(s):
  1. Neuromorphic Software Eco System (PGI-15)
Research Program(s):
  1. 5234 - Emerging NC Architectures (POF4-523) (POF4-523)
  2. BMBF 16ME0400 - Verbundprojekt: Neuro-inspirierte Technologien der künstlichen Intelligenz für die Elektronik der Zukunft - NEUROTEC II - (16ME0400) (16ME0400)
  3. GREENEDGE - Taming the environmental impact of mobile networks through GREEN EDGE computing platforms (953775) (953775)

Appears in the scientific report 2024
Database coverage:
Medline ; OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Institute Collections > PGI > PGI-15
Workflow collections > Public records
Publications database
Open Access

 Record created 2025-01-27, last modified 2025-02-03


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)