Journal Article FZJ-2023-03392

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Graph Machine Learning for Improved Imputation of Missing Tropospheric Ozone Data

 ;  ;  ;

2023
American Chemical Society Columbus, Ohio

Environmental science & technology 57, 18246-18258 () [10.1021/acs.est.3c05104] special issue: "Data Science for Advancing Environmental Science, Engineering and Technology"

This record in other databases:    

Please use a persistent id in citations: doi:  doi:

Abstract: Gaps in the measurement series of atmospheric pollutants can impede the reliable assessment of their impacts and trends. We propose a new method for missing data imputation of the air pollutant tropospheric ozone by using the graph machine learning algorithm “correct and smooth”. This algorithm uses auxiliary data that characterize the measurement location and, in addition, ozone observations at neighboring sites to improve the imputations of simple statistical and machine learning models. We apply our method to data from 278 stations of the year 2011 of the German Environment Agency (Umweltbundesamt – UBA) monitoring network. The preliminary version of these data exhibits three gap patterns: shorter gaps in the range of hours, longer gaps of up to several months in length, and gaps occurring at multiple stations at once. For short gaps of up to 5 h, linear interpolation is most accurate. Longer gaps at single stations are most effectively imputed by a random forest in connection with the correct and smooth. For longer gaps at multiple stations, the correct and smooth algorithm improved the random forest despite a lack of data in the neighborhood of the missing values. We therefore suggest a hybrid of linear interpolation and graph machine learning for the imputation of tropospheric ozone time series.

Classification:

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  2. IntelliAQ - Artificial Intelligence for Air Quality (787576) (787576)

Appears in the scientific report 2023
Database coverage:
Medline ; Creative Commons Attribution CC BY 4.0 ; OpenAccess ; BIOSIS Previews ; Biological Abstracts ; Clarivate Analytics Master Journal List ; Current Contents - Agriculture, Biology and Environmental Sciences ; Ebsco Academic Search ; Essential Science Indicators ; IF >= 10 ; JCR ; NationallizenzNationallizenz ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection ; Zoological Record
Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Aufsätze > Zeitschriftenaufsätze
Workflowsammlungen > Öffentliche Einträge
Workflowsammlungen > Publikationsgebühren
Institutssammlungen > JSC
Publikationsdatenbank
Open Access

 Datensatz erzeugt am 2023-09-06, letzte Änderung am 2024-01-16


OpenAccess:
Volltext herunterladen PDF
Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)