Home > Publications database > Decomposing neural networks as mappings of correlation functions > print |
001 | 912160 | ||
005 | 20240313103120.0 | ||
024 | 7 | _ | |a 10.48550/ARXIV.2202.04925 |2 doi |
024 | 7 | _ | |a 10.48550/arXiv.2202.04925 |2 doi |
024 | 7 | _ | |a 2128/32943 |2 Handle |
037 | _ | _ | |a FZJ-2022-05378 |
100 | 1 | _ | |a Fischer, Kirsten |0 P:(DE-Juel1)180150 |b 0 |e Corresponding author |
245 | _ | _ | |a Decomposing neural networks as mappings of correlation functions |
260 | _ | _ | |c 2022 |b arXiv |
336 | 7 | _ | |a Preprint |b preprint |m preprint |0 PUB:(DE-HGF)25 |s 1670304027_9967 |2 PUB:(DE-HGF) |
336 | 7 | _ | |a WORKING_PAPER |2 ORCID |
336 | 7 | _ | |a Electronic Article |0 28 |2 EndNote |
336 | 7 | _ | |a preprint |2 DRIVER |
336 | 7 | _ | |a ARTICLE |2 BibTeX |
336 | 7 | _ | |a Output Types/Working Paper |2 DataCite |
520 | _ | _ | |a Understanding the functional principles of information processing in deep neural networks continues to be a challenge, in particular for networks with trained and thus non-random weights. To address this issue, we study the mapping between probability distributions implemented by a deep feed-forward network. We characterize this mapping as an iterated transformation of distributions, where the non-linearity in each layer transfers information between different orders of correlation functions. This allows us to identify essential statistics in the data, as well as different information representations that can be used by neural networks. Applied to an XOR task and to MNIST, we show that correlations up to second order predominantly capture the information processing in the internal layers, while the input layer also extracts higher-order correlations from the data. This analysis provides a quantitative and explainable perspective on classification. |
536 | _ | _ | |a 5232 - Computational Principles (POF4-523) |0 G:(DE-HGF)POF4-5232 |c POF4-523 |f POF IV |x 0 |
536 | _ | _ | |a 5234 - Emerging NC Architectures (POF4-523) |0 G:(DE-HGF)POF4-5234 |c POF4-523 |f POF IV |x 1 |
536 | _ | _ | |a ACA - Advanced Computing Architectures (SO-092) |0 G:(DE-HGF)SO-092 |c SO-092 |x 2 |
536 | _ | _ | |a RenormalizedFlows - Transparent Deep Learning with Renormalized Flows (BMBF-01IS19077A) |0 G:(DE-Juel-1)BMBF-01IS19077A |c BMBF-01IS19077A |x 3 |
536 | _ | _ | |a MSNN - Theory of multi-scale neuronal networks (HGF-SMHB-2014-2018) |0 G:(DE-Juel1)HGF-SMHB-2014-2018 |c HGF-SMHB-2014-2018 |f MSNN |x 4 |
536 | _ | _ | |a neuroIC002 - Recurrence and stochasticity for neuro-inspired computation (EXS-SF-neuroIC002) |0 G:(DE-82)EXS-SF-neuroIC002 |c EXS-SF-neuroIC002 |x 5 |
588 | _ | _ | |a Dataset connected to DataCite |
650 | _ | 7 | |a Disordered Systems and Neural Networks (cond-mat.dis-nn) |2 Other |
650 | _ | 7 | |a Machine Learning (stat.ML) |2 Other |
650 | _ | 7 | |a FOS: Physical sciences |2 Other |
650 | _ | 7 | |a FOS: Computer and information sciences |2 Other |
700 | 1 | _ | |a René, Alexandre |0 P:(DE-Juel1)178936 |b 1 |u fzj |
700 | 1 | _ | |a Keup, Christian |0 P:(DE-Juel1)171384 |b 2 |u fzj |
700 | 1 | _ | |a Layer, Moritz |0 P:(DE-Juel1)174497 |b 3 |u fzj |
700 | 1 | _ | |a Dahmen, David |0 P:(DE-Juel1)156459 |b 4 |u fzj |
700 | 1 | _ | |a Helias, Moritz |0 P:(DE-Juel1)144806 |b 5 |u fzj |
773 | _ | _ | |a 10.48550/arXiv.2202.04925 |
856 | 4 | _ | |u https://arxiv.org/abs/2202.04925 |
856 | 4 | _ | |u https://juser.fz-juelich.de/record/912160/files/Fischer_2022_arxiv.pdf |y OpenAccess |
909 | C | O | |o oai:juser.fz-juelich.de:912160 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 0 |6 P:(DE-Juel1)180150 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 1 |6 P:(DE-Juel1)178936 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 2 |6 P:(DE-Juel1)171384 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 3 |6 P:(DE-Juel1)174497 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 4 |6 P:(DE-Juel1)156459 |
910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 5 |6 P:(DE-Juel1)144806 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Natural, Artificial and Cognitive Information Processing |1 G:(DE-HGF)POF4-520 |0 G:(DE-HGF)POF4-523 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Neuromorphic Computing and Network Dynamics |9 G:(DE-HGF)POF4-5232 |x 0 |
913 | 1 | _ | |a DE-HGF |b Key Technologies |l Natural, Artificial and Cognitive Information Processing |1 G:(DE-HGF)POF4-520 |0 G:(DE-HGF)POF4-523 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Neuromorphic Computing and Network Dynamics |9 G:(DE-HGF)POF4-5234 |x 1 |
914 | 1 | _ | |y 2022 |
915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
920 | _ | _ | |l yes |
920 | 1 | _ | |0 I:(DE-Juel1)INM-6-20090406 |k INM-6 |l Computational and Systems Neuroscience |x 0 |
920 | 1 | _ | |0 I:(DE-Juel1)IAS-6-20130828 |k IAS-6 |l Theoretical Neuroscience |x 1 |
920 | 1 | _ | |0 I:(DE-Juel1)INM-10-20170113 |k INM-10 |l Jara-Institut Brain structure-function relationships |x 2 |
980 | 1 | _ | |a FullTexts |
980 | _ | _ | |a preprint |
980 | _ | _ | |a VDB |
980 | _ | _ | |a UNRESTRICTED |
980 | _ | _ | |a I:(DE-Juel1)INM-6-20090406 |
980 | _ | _ | |a I:(DE-Juel1)IAS-6-20130828 |
980 | _ | _ | |a I:(DE-Juel1)INM-10-20170113 |
981 | _ | _ | |a I:(DE-Juel1)IAS-6-20130828 |
Library | Collection | CLSMajor | CLSMinor | Language | Author |
---|