001019494 001__ 1019494
001019494 005__ 20231220201930.0
001019494 0247_ $$2doi$$a10.48565/BONNDOC-179
001019494 0247_ $$2datacite_doi$$a10.34734/FZJ-2023-05441
001019494 037__ $$aFZJ-2023-05441
001019494 041__ $$aEnglish
001019494 1001_ $$0P:(DE-Juel1)171435$$aBetancourt, Clara$$b0$$eCorresponding author$$ufzj
001019494 245__ $$aMapping and Interpolation of Tropospheric Ozone Data with Machine Learning Methods$$f2019-06-01 - 2023-08-28
001019494 260__ $$aBonn$$bUniversitäts- und Landesbibliothek Bonn$$c2023
001019494 300__ $$a167 p.
001019494 3367_ $$2DataCite$$aOutput Types/Dissertation
001019494 3367_ $$2ORCID$$aDISSERTATION
001019494 3367_ $$2BibTeX$$aPHDTHESIS
001019494 3367_ $$02$$2EndNote$$aThesis
001019494 3367_ $$0PUB:(DE-HGF)11$$2PUB:(DE-HGF)$$aDissertation / PhD Thesis$$bphd$$mphd$$s1703066925_23279
001019494 3367_ $$2DRIVER$$adoctoralThesis
001019494 502__ $$aDissertation, Universität Bonn, 2023$$bDissertation$$cUniversität Bonn$$d2023$$o2023-08-28
001019494 520__ $$aTropospheric ozone is a toxic trace gas in the atmosphere. It threatens human health, damages crops and vegetation, and it is a short-lived climate forcer. Ozone is a secondary air pollutant that undergoes multiple physical and chemical processes on a wide range of timescales. Therefore, as with many environmental variables, it is difficult to quantify ozone concentrations where measurements are not available. To solve this problem, the goal of this work is to develop spatio-temporal mapping and interpolation methods using machine learning techniques with the example application of ozone data. We train the machine learning models on a large number of ozone measurements available in the Tropospheric Ozone Assessment Report (TOAR) database. The most important contributions of this work are: • Mapping and interpolating ozone data, providing high-resolution, high-accuracy, spatiotemporal data products. The data products cover spatial domains from the regional to the global level, and their temporal resolution ranges from hourly data to multi-year statistics. We use large quantities of ozone measurements, combined with model data and geospatial data to generate the data products. • Adapting, developing, and explaining new state-of-the-art machine learning methods that we use to create these data products. The most relevant algorithms of this work are tree-based and graph-based methods. For example, we develop a multi-scale evaluation technique for spatial machine learning models and verify their physical consistency by using Shapley additive explanations. • Utilizing spatiotemporal patterns in geospatial data and ozone measurements in machine learning models. We use aggregated local to regional geospatial site conditions as input features for machine learning models. Furthermore, we adopt a graph machine learning algorithm to work on ozone measurements at irregularly placed air quality monitoring stations.With this work, we publish AQ-Bench, a benchmark dataset for machine learning on global long-term ozone metrics. We link explainable machine learning on AQ-Bench with uncertainty assessments to point out limits in the dataset and the applicability of the resulting machine learning models. With the trained models, we also create the first completely data-driven, global, high-resolution map of long-term ozone metrics (resolution 0.1°×0.1°, years 2010 - 2014). Finally, we develop a high-performance graph-based missing data interpolation method for ozone measurements. It has an index of agreement of 0.96 - 0.99 for hourly missing data interpolation in Germany. The synthesis of this work is that an interplay of physically sound data selection, uncertainty quantification, and explainability in machine learning can produce trustworthy environmental data products. We also found that the accuracy of the data products in a specific region is mainly dependent on good coverage with ozone measurements in that region. Therefore, this work contributes not only to the gapless quantification of ozone concentrations but also to trustworthy machine learning in the environmental sciences.
001019494 536__ $$0G:(DE-HGF)POF4-5111$$a5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511)$$cPOF4-511$$fPOF IV$$x0
001019494 536__ $$0G:(EU-Grant)787576$$aIntelliAQ - Artificial Intelligence for Air Quality (787576)$$c787576$$fERC-2017-ADG$$x1
001019494 588__ $$aDataset connected to DataCite
001019494 650_7 $$2Other$$aair quality
001019494 650_7 $$2Other$$atropospheric ozone
001019494 650_7 $$2Other$$amachine learning
001019494 650_7 $$2Other$$addc:550
001019494 773__ $$a10.48565/BONNDOC-179
001019494 8564_ $$uhttps://juser.fz-juelich.de/record/1019494/files/FZJ-2023-05441.pdf$$yOpenAccess
001019494 8564_ $$uhttps://juser.fz-juelich.de/record/1019494/files/FZJ-2023-05441.gif?subformat=icon$$xicon$$yOpenAccess
001019494 8564_ $$uhttps://juser.fz-juelich.de/record/1019494/files/FZJ-2023-05441.jpg?subformat=icon-1440$$xicon-1440$$yOpenAccess
001019494 8564_ $$uhttps://juser.fz-juelich.de/record/1019494/files/FZJ-2023-05441.jpg?subformat=icon-180$$xicon-180$$yOpenAccess
001019494 8564_ $$uhttps://juser.fz-juelich.de/record/1019494/files/FZJ-2023-05441.jpg?subformat=icon-640$$xicon-640$$yOpenAccess
001019494 909CO $$ooai:juser.fz-juelich.de:1019494$$popenaire$$popen_access$$pdriver$$pVDB$$pec_fundedresources$$pdnbdelivery
001019494 9101_ $$0I:(DE-588b)5008462-8$$6P:(DE-Juel1)171435$$aForschungszentrum Jülich$$b0$$kFZJ
001019494 9131_ $$0G:(DE-HGF)POF4-511$$1G:(DE-HGF)POF4-510$$2G:(DE-HGF)POF4-500$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$9G:(DE-HGF)POF4-5111$$aDE-HGF$$bKey Technologies$$lEngineering Digital Futures – Supercomputing, Data Management and Information Security for Knowledge and Action$$vEnabling Computational- & Data-Intensive Science and Engineering$$x0
001019494 9141_ $$y2023
001019494 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
001019494 9201_ $$0I:(DE-Juel1)JSC-20090406$$kJSC$$lJülich Supercomputing Center$$x0
001019494 980__ $$aphd
001019494 980__ $$aVDB
001019494 980__ $$aUNRESTRICTED
001019494 980__ $$aI:(DE-Juel1)JSC-20090406
001019494 9801_ $$aFullTexts