Decomposing neural networks as mappings of correlation functions

Fischer, Kirsten; Dahmen, David; René, Alexandre; Layer, Moritz; Helias, Moritz; Keup, Christian

doi:10.48550/arXiv.2202.04925

Items
Marc 21

001			912160
005			20240313103120.0
024	7	_	\|a 10.48550/ARXIV.2202.04925 \|2 doi
024	7	_	\|a 10.48550/arXiv.2202.04925 \|2 doi
024	7	_	\|a 2128/32943 \|2 Handle
037	_	_	\|a FZJ-2022-05378
100	1	_	\|a Fischer, Kirsten \|0 P:(DE-Juel1)180150 \|b 0 \|e Corresponding author
245	_	_	\|a Decomposing neural networks as mappings of correlation functions
260	_	_	\|c 2022 \|b arXiv
336	7	_	\|a Preprint \|b preprint \|m preprint \|0 PUB:(DE-HGF)25 \|s 1670304027_9967 \|2 PUB:(DE-HGF)
336	7	_	\|a WORKING_PAPER \|2 ORCID
336	7	_	\|a Electronic Article \|0 28 \|2 EndNote
336	7	_	\|a preprint \|2 DRIVER
336	7	_	\|a ARTICLE \|2 BibTeX
336	7	_	\|a Output Types/Working Paper \|2 DataCite
520	_	_	\|a Understanding the functional principles of information processing in deep neural networks continues to be a challenge, in particular for networks with trained and thus non-random weights. To address this issue, we study the mapping between probability distributions implemented by a deep feed-forward network. We characterize this mapping as an iterated transformation of distributions, where the non-linearity in each layer transfers information between different orders of correlation functions. This allows us to identify essential statistics in the data, as well as different information representations that can be used by neural networks. Applied to an XOR task and to MNIST, we show that correlations up to second order predominantly capture the information processing in the internal layers, while the input layer also extracts higher-order correlations from the data. This analysis provides a quantitative and explainable perspective on classification.
536	_	_	\|a 5232 - Computational Principles (POF4-523) \|0 G:(DE-HGF)POF4-5232 \|c POF4-523 \|f POF IV \|x 0
536	_	_	\|a 5234 - Emerging NC Architectures (POF4-523) \|0 G:(DE-HGF)POF4-5234 \|c POF4-523 \|f POF IV \|x 1
536	_	_	\|a ACA - Advanced Computing Architectures (SO-092) \|0 G:(DE-HGF)SO-092 \|c SO-092 \|x 2
536	_	_	\|a RenormalizedFlows - Transparent Deep Learning with Renormalized Flows (BMBF-01IS19077A) \|0 G:(DE-Juel-1)BMBF-01IS19077A \|c BMBF-01IS19077A \|x 3
536	_	_	\|a MSNN - Theory of multi-scale neuronal networks (HGF-SMHB-2014-2018) \|0 G:(DE-Juel1)HGF-SMHB-2014-2018 \|c HGF-SMHB-2014-2018 \|f MSNN \|x 4
536	_	_	\|a neuroIC002 - Recurrence and stochasticity for neuro-inspired computation (EXS-SF-neuroIC002) \|0 G:(DE-82)EXS-SF-neuroIC002 \|c EXS-SF-neuroIC002 \|x 5
588	_	_	\|a Dataset connected to DataCite
650	_	7	\|a Disordered Systems and Neural Networks (cond-mat.dis-nn) \|2 Other
650	_	7	\|a Machine Learning (stat.ML) \|2 Other
650	_	7	\|a FOS: Physical sciences \|2 Other
650	_	7	\|a FOS: Computer and information sciences \|2 Other
700	1	_	\|a René, Alexandre \|0 P:(DE-Juel1)178936 \|b 1 \|u fzj
700	1	_	\|a Keup, Christian \|0 P:(DE-Juel1)171384 \|b 2 \|u fzj
700	1	_	\|a Layer, Moritz \|0 P:(DE-Juel1)174497 \|b 3 \|u fzj
700	1	_	\|a Dahmen, David \|0 P:(DE-Juel1)156459 \|b 4 \|u fzj
700	1	_	\|a Helias, Moritz \|0 P:(DE-Juel1)144806 \|b 5 \|u fzj
773	_	_	\|a 10.48550/arXiv.2202.04925
856	4	_	\|u https://arxiv.org/abs/2202.04925
856	4	_	\|u https://juser.fz-juelich.de/record/912160/files/Fischer_2022_arxiv.pdf \|y OpenAccess
909	C	O	\|o oai:juser.fz-juelich.de:912160 \|p openaire \|p open_access \|p VDB \|p driver \|p dnbdelivery
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 0 \|6 P:(DE-Juel1)180150
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 1 \|6 P:(DE-Juel1)178936
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 2 \|6 P:(DE-Juel1)171384
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 3 \|6 P:(DE-Juel1)174497
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 4 \|6 P:(DE-Juel1)156459
910	1	_	\|a Forschungszentrum Jülich \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 5 \|6 P:(DE-Juel1)144806
913	1	_	\|a DE-HGF \|b Key Technologies \|l Natural, Artificial and Cognitive Information Processing \|1 G:(DE-HGF)POF4-520 \|0 G:(DE-HGF)POF4-523 \|3 G:(DE-HGF)POF4 \|2 G:(DE-HGF)POF4-500 \|4 G:(DE-HGF)POF \|v Neuromorphic Computing and Network Dynamics \|9 G:(DE-HGF)POF4-5232 \|x 0
913	1	_	\|a DE-HGF \|b Key Technologies \|l Natural, Artificial and Cognitive Information Processing \|1 G:(DE-HGF)POF4-520 \|0 G:(DE-HGF)POF4-523 \|3 G:(DE-HGF)POF4 \|2 G:(DE-HGF)POF4-500 \|4 G:(DE-HGF)POF \|v Neuromorphic Computing and Network Dynamics \|9 G:(DE-HGF)POF4-5234 \|x 1
914	1	_	\|y 2022
915	_	_	\|a OpenAccess \|0 StatID:(DE-HGF)0510 \|2 StatID
920	_	_	\|l yes
920	1	_	\|0 I:(DE-Juel1)INM-6-20090406 \|k INM-6 \|l Computational and Systems Neuroscience \|x 0
920	1	_	\|0 I:(DE-Juel1)IAS-6-20130828 \|k IAS-6 \|l Theoretical Neuroscience \|x 1
920	1	_	\|0 I:(DE-Juel1)INM-10-20170113 \|k INM-10 \|l Jara-Institut Brain structure-function relationships \|x 2
980	1	_	\|a FullTexts
980	_	_	\|a preprint
980	_	_	\|a VDB
980	_	_	\|a UNRESTRICTED
980	_	_	\|a I:(DE-Juel1)INM-6-20090406
980	_	_	\|a I:(DE-Juel1)IAS-6-20130828
980	_	_	\|a I:(DE-Juel1)INM-10-20170113
981	_	_	\|a I:(DE-Juel1)IAS-6-20130828

Library	Collection	CLSMajor	CLSMinor	Language	Author

Marc 21

guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help