001     864070
005     20230127125336.0
024 7 _ |a 2128/24952
|2 Handle
037 _ _ |a FZJ-2019-03979
041 _ _ |a English
100 1 _ |a Kaffashzadeh, Najmeh
|0 P:(DE-Juel1)165903
|b 0
|e Corresponding author
|u fzj
245 _ _ |a A New Tool for Automated Quality Control of Environmental Data in Open Web Services
260 _ _ |c 2019
336 7 _ |a Preprint
|b preprint
|m preprint
|0 PUB:(DE-HGF)25
|s 1564042965_25756
|2 PUB:(DE-HGF)
336 7 _ |a WORKING_PAPER
|2 ORCID
336 7 _ |a Electronic Article
|0 28
|2 EndNote
336 7 _ |a preprint
|2 DRIVER
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a Output Types/Working Paper
|2 DataCite
520 _ _ |a We report on the development of a new software tool (auto-qc) for automated quality control (QC) of environmental timeseries data. Novel features of this tool include a flexible Python software architecture, which makes it easy for users to configure the sequence of tests as well as their statistical parameters, and a statistical concept to assign each value a probability of being a correct value. There are many occasions when it is necessary to inspect the quality of environmental datasets, from first quality checks during real-time sampling and data transmission to assessing the quality of long-term monitoring data from measurement stations. Erroneous data can have a substantial impact on the statistical data analysis and, for example, lead to wrong estimates of trends. Existing QC workflows largely rely on individual investigator knowledge and have often been constructed from practical considerations alone. Our tool aims to complement traditional data quality analyses and adds some insights into the nature of the individual tests that are being applied.
536 _ _ |a 512 - Data-Intensive Science and Federated Computing (POF3-512)
|0 G:(DE-HGF)POF3-512
|c POF3-512
|f POF III
|x 0
536 _ _ |a IntelliAQ - Artificial Intelligence for Air Quality (787576)
|0 G:(EU-Grant)787576
|c 787576
|f ERC-2017-ADG
|x 1
536 _ _ |0 G:(DE-Juel-1)ESDE
|a Earth System Data Exploration (ESDE)
|c ESDE
|x 2
700 1 _ |a Kleinert, Felix
|0 P:(DE-Juel1)176602
|b 1
|u fzj
700 1 _ |a Schultz, Martin
|0 P:(DE-Juel1)6952
|b 2
|u fzj
856 4 _ |u https://easychair.org/publications/preprint/cqRB
856 4 _ |u https://juser.fz-juelich.de/record/864070/files/EasyChair-Preprint-1325.pdf
|y OpenAccess
856 4 _ |u https://juser.fz-juelich.de/record/864070/files/EasyChair-Preprint-1325.pdf?subformat=pdfa
|x pdfa
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:864070
|p openaire
|p open_access
|p driver
|p VDB
|p ec_fundedresources
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)165903
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 1
|6 P:(DE-Juel1)176602
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)6952
913 1 _ |a DE-HGF
|b Key Technologies
|1 G:(DE-HGF)POF3-510
|0 G:(DE-HGF)POF3-512
|2 G:(DE-HGF)POF3-500
|v Data-Intensive Science and Federated Computing
|x 0
|4 G:(DE-HGF)POF
|3 G:(DE-HGF)POF3
|l Supercomputing & Big Data
914 1 _ |y 2019
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)JSC-20090406
|k JSC
|l Jülich Supercomputing Center
|x 0
980 _ _ |a preprint
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)JSC-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21