Contribution to a conference proceedings/Contribution to a book FZJ-2021-00117

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Ensemble Kalman Filter Optimizing Deep Neural Networks: An Alternative Approach to Non-performing Gradient Descent

 ;  ;  ;

2020
Springer Cham

Machine Learning, Optimization, and Data Science
The Sixth International Conference on Machine Learning, Optimization, and Data Science, LOD2020, SienaSiena, Italy, 19 Jul 2020 - 22 Jul 20202020-07-192020-07-22
Cham : Springer, Lecture Notes in Computer Science 12566, : 5th ed., 78-92 () [10.1007/978-3-030-64580-9_7]

This record in other databases:

Please use a persistent id in citations:   doi:

Abstract: The successful training of deep neural networks is dependent on initialization schemes and choice of activation functions. Non-optimally chosen parameter settings lead to the known problem of exploding or vanishing gradients. This issue occurs when gradient descent and backpropagation are applied. For this setting the Ensemble Kalman Filter (EnKF) can be used as an alternative optimizer when training neural networks. The EnKF does not require the explicit calculation of gradients or adjoints and we show this resolves the exploding and vanishing gradient problem. We analyze different parameter initializations, propose a dynamic change in ensembles and compare results to established methods.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 511 - Computational Science and Mathematical Methods (POF3-511) (POF3-511)
  2. SMHB - Supercomputing and Modelling for the Human Brain (HGF-SMHB-2013-2017) (HGF-SMHB-2013-2017)
  3. CSD-SSD - Center for Simulation and Data Science (CSD) - School for Simulation and Data Science (SSD) (CSD-SSD-20190612) (CSD-SSD-20190612)
  4. SLNS - SimLab Neuroscience (Helmholtz-SLNS) (Helmholtz-SLNS)
  5. HDS LEE - Helmholtz School for Data Science in Life, Earth and Energy (HDS LEE) (HDS-LEE-20190612) (HDS-LEE-20190612)
  6. PhD no Grant - Doktorand ohne besondere Förderung (PHD-NO-GRANT-20170405) (PHD-NO-GRANT-20170405)
  7. HAF - Helmholtz Analytics Framework (ZT-I-0003) (ZT-I-0003)

Appears in the scientific report 2020
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2021-01-09, last modified 2021-01-27


OpenAccess:
Download fulltext PDF
External link:
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)