Deep Learning for Prediction and Control of Cellular Automata in Unreal Environments

Aach, Marcel
000892991 001__ 892991
000892991 005__ 20210623133408.0
000892991 0247_ $$2Handle$$a2128/27904
000892991 037__ $$aFZJ-2021-02488
000892991 1001_ $$0P:(DE-Juel1)180916$$aAach, Marcel$$b0$$eCorresponding author
000892991 245__ $$aDeep Learning for Prediction and Control of Cellular Automata in Unreal Environments$$f - 2021-02-12
000892991 260__ $$c2021
000892991 300__ $$a76 pages
000892991 3367_ $$2DataCite$$aOutput Types/Supervised Student Publication
000892991 3367_ $$02$$2EndNote$$aThesis
000892991 3367_ $$2BibTeX$$aMASTERSTHESIS
000892991 3367_ $$2DRIVER$$amasterThesis
000892991 3367_ $$0PUB:(DE-HGF)19$$2PUB:(DE-HGF)$$aMaster Thesis$$bmaster$$mmaster$$s1622611184_10531
000892991 3367_ $$2ORCID$$aSUPERVISED_STUDENT_PUBLICATION
000892991 502__ $$aMasterarbeit, University of Cologne, 2021$$bMasterarbeit$$cUniversity of Cologne$$d2021
000892991 520__ $$aIn this thesis, we show the ability of a deep convolutional neural network to understand the underlying transition rules of two-dimensional cellular automata by pure observation. To do so, we evaluate the network on a prediction task, where it has to predict the next state of some cellular automata, and a control task, where it has to intervene in the evolution of a cellular automaton to achieve a state of standstill. The cellular automata we use in this case are based on the classical Game of Life by John Conway and implemented in the Unreal Engine. With the usage of the Unreal Engine for data generation, a technical pipeline for processing output images with neural networks is established.Cellular automata in general are chaotic dynamical systems, making any sort of prediction or control very challenging, but using convolutional neural networks to exploit the locality of their interactions is a promising approach to solve these problems. The network we present in this thesis follows the Encoder-Decoder structure and features residual skip connections that serve as shortcuts in between the different layers. Recent advancements in the field of image recognition and segmentation have shown that both of these aspects are the key to success.The evaluation of the prediction task is split into several levels of generalization: we train the developed network on trajectories of several hundred different cellular automata, varying in their transition rules and neighborhood sizes. Results on a test set show that the network is able to learn the rules of even more complex cellular automata (with an accuracy of ≈ 93%). To some extent, it is even able to interpolate and generalize to completely unseen rules (with an accuracy of ≈ 77%). A qualitative investigation shows that static rules (not forcing many changes in between time steps) are among the easiest to predict.For the control task, we combine the encoder part of the developed neural network with a reinforcement agent and train it to stop all movements on the grid of the cellular automata as quickly as possible. To do so, the agent can change the state of a single cell per time step. A comparison between giving back rewards to agents continuously and giving them only in the case of success or failure shows that Proximal Policy Optimization agents do better with receiving sparse rewards while Deep Q-Network agents fare better with continuously receiving them. Both algorithms beat random agents on training data, but their generalization ability remains limited.
000892991 536__ $$0G:(DE-HGF)POF4-511$$a511 - Enabling Computational- & Data-Intensive Science and Engineering (POF4-511)$$cPOF4-511$$fPOF IV$$x0
000892991 8564_ $$uhttps://juser.fz-juelich.de/record/892991/files/Master_Thesis_MarcelAach.pdf$$yOpenAccess
000892991 909CO $$ooai:juser.fz-juelich.de:892991$$pdnbdelivery$$pdriver$$pVDB$$popen_access$$popenaire
000892991 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)180916$$aForschungszentrum Jülich$$b0$$kFZJ
000892991 9130_ $$0G:(DE-HGF)POF3-511$$1G:(DE-HGF)POF3-510$$2G:(DE-HGF)POF3-500$$3G:(DE-HGF)POF3$$4G:(DE-HGF)POF$$aDE-HGF$$bKey Technologies$$lSupercomputing & Big Data$$vComputational Science and Mathematical Methods$$x0
000892991 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
000892991 9141_ $$y2021
000892991 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000892991 920__ $$lyes
000892991 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
000892991 980__ $$amaster
000892991 980__ $$aVDB
000892991 980__ $$aUNRESTRICTED
000892991 980__ $$aI:(DE-Juel1)JSC-20090406
000892991 9801_ $$aFullTexts
guest :: login JuSER
		Search		Submit		Personalize Your alerts Your baskets Your searches		Help