001     875342
005     20230711152627.0
024 7 _ |2 doi
|a 10.5194/egusphere-egu2020-13357
024 7 _ |2 Handle
|a 2128/24954
037 _ _ |a FZJ-2020-01965
041 _ _ |a English
100 1 _ |0 P:(DE-Juel1)165903
|a Kaffashzadeh, Najmeh
|b 0
|e Corresponding author
111 2 _ |a EGU2020: Sharing Geoscience Online
|c Vienna
|d 2020-05-04 - 2020-05-08
|g #shareEGU20
|w Austria
245 _ _ |a A Statistical Model for Automated Quality Assessment of the TOAR-II
260 _ _ |c 2020
336 7 _ |0 33
|2 EndNote
|a Conference Paper
336 7 _ |2 DataCite
|a Other
336 7 _ |2 BibTeX
|a INPROCEEDINGS
336 7 _ |2 DRIVER
|a conferenceObject
336 7 _ |2 ORCID
|a LECTURE_SPEECH
336 7 _ |0 PUB:(DE-HGF)6
|2 PUB:(DE-HGF)
|a Conference Presentation
|b conf
|m conf
|s 1591168427_5616
|x Other
520 _ _ |a The Tropospheric Ozone Assessment Report, phase 2, (TOAR-II) database is a collection of global ground-level ozone in-situ measurements from various locations. It also holds data of selected ozone precursors and meteorological variables. TOAR-II assembles air quality data from many different sources and thus requires a common data quality assessment (QA) to ensure the data meet the quality required for globally consistent analyses. The large volume of this database (more than 100,000 data series) enforces the use of automated, data-driven QA procedures. Accordingly, we have developed a statistical model for automated QA. This model consists of several statistical tests that are classified into several sub-groups. In this model, a QA-score (an indicator ranging from 0 to 1) was assigned to each individual data point to estimates the value‘s plausibility. The foundation of this concept is statistical hypothesis testing and the probability theory. This model was implemented in a Python package and is called AutoQA4Env. One application of AutoQA4Env is the data ingestion workflow of TOAR-II. The tool generates a data quality report which is then sent back to the data provider for inspection. Since AutoQA4Env is easily configurable, it allows the users to set quality thresholds and thus filter data according to their use case. While we primarily develop AutoQA4Env for air quality data, the same concept and model might be applicable to other databases and the software framework is flexible enough to allow for other use cases.
536 _ _ |0 G:(DE-HGF)POF3-512
|a 512 - Data-Intensive Science and Federated Computing (POF3-512)
|c POF3-512
|f POF III
|x 0
536 _ _ |0 G:(EU-Grant)787576
|a IntelliAQ - Artificial Intelligence for Air Quality (787576)
|c 787576
|f ERC-2017-ADG
|x 1
536 _ _ |0 G:(DE-Juel-1)ESDE
|a Earth System Data Exploration (ESDE)
|c ESDE
|x 2
588 _ _ |a Dataset connected to CrossRef
700 1 _ |0 0000-0001-5812-3183
|a Chang, Kai-Lan
|b 1
700 1 _ |0 P:(DE-Juel1)16212
|a Schröder, Sabine
|b 2
700 1 _ |0 P:(DE-Juel1)6952
|a Schultz, Martin G.
|b 3
773 _ _ |a 10.5194/egusphere-egu2020-13357
856 4 _ |u https://juser.fz-juelich.de/record/875342/files/Abstract.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/875342/files/Presentation_Slides.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/875342/files/Abstract.pdf?subformat=pdfa
|x pdfa
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/875342/files/Presentation_Slides.pdf?subformat=pdfa
|x pdfa
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:875342
|p openaire
|p open_access
|p VDB
|p driver
|p ec_fundedresources
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)165903
|a Forschungszentrum Jülich
|b 0
|k FZJ
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)16212
|a Forschungszentrum Jülich
|b 2
|k FZJ
910 1 _ |0 I:(DE-588b)5008462-8
|6 P:(DE-Juel1)6952
|a Forschungszentrum Jülich
|b 3
|k FZJ
913 1 _ |0 G:(DE-HGF)POF3-512
|1 G:(DE-HGF)POF3-510
|2 G:(DE-HGF)POF3-500
|3 G:(DE-HGF)POF3
|4 G:(DE-HGF)POF
|a DE-HGF
|b Key Technologies
|l Supercomputing & Big Data
|v Data-Intensive Science and Federated Computing
|x 0
914 1 _ |y 2020
915 _ _ |0 StatID:(DE-HGF)0510
|2 StatID
|a OpenAccess
915 _ _ |0 LIC:(DE-HGF)CCBY4
|2 HGFVOC
|a Creative Commons Attribution CC BY 4.0
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a conf
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 _ _ |a OPENSCIENCE
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21