Preprint FZJ-2023-00752

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
A Recursively Recurrent Neural Network (R2N2) Architecture for Learning Iterative Algorithms

 ;  ;  ;  ;  ;  ;

2022
arXiv

arXiv () [10.48550/ARXIV.2211.12386]

This record in other databases:

Please use a persistent id in citations:   doi:

Abstract: Meta-learning of numerical algorithms for a given task consist of the data-driven identification and adaptation of an algorithmic structure and the associated hyperparameters. To limit the complexity of the meta-learning problem, neural architectures with a certain inductive bias towards favorable algorithmic structures can, and should, be used. We generalize our previously introduced Runge-Kutta neural network to a recursively recurrent neural network (R2N2) superstructure for the design of customized iterative algorithms. In contrast to off-the-shelf deep learning approaches, it features a distinct division into modules for generation of information and for the subsequent assembly of this information towards a solution. Local information in the form of a subspace is generated by subordinate, inner, iterations of recurrent function evaluations starting at the current outer iterate. The update to the next outer iterate is computed as a linear combination of these evaluations, reducing the residual in this space, and constitutes the output of the network. We demonstrate that regular training of the weight parameters inside the proposed superstructure on input/output data of various computational problem classes yields iterations similar to Krylov solvers for linear equation systems, Newton-Krylov solvers for nonlinear equation systems, and Runge-Kutta integrators for ordinary differential equations. Due to its modularity, the superstructure can be readily extended with functionalities needed to represent more general classes of iterative algorithms traditionally based on Taylor series expansions.

Keyword(s): Machine Learning (cs.LG) ; Numerical Analysis (math.NA) ; FOS: Computer and information sciences ; FOS: Mathematics


Contributing Institute(s):
  1. Modellierung von Energiesystemen (IEK-10)
Research Program(s):
  1. 1121 - Digitalization and Systems Technology for Flexibility Solutions (POF4-112) (POF4-112)
  2. HDS LEE - Helmholtz School for Data Science in Life, Earth and Energy (HDS LEE) (HDS-LEE-20190612) (HDS-LEE-20190612)

Appears in the scientific report 2022
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Institute Collections > ICE > ICE-1
Document types > Reports > Preprints
Workflow collections > Public records
IEK > IEK-10
Publications database
Open Access

 Record created 2023-01-17, last modified 2024-07-12


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)