Conference Presentation (Other) FZJ-2020-01952

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Performance analysis and optimization of a TByte-scale atmospheric observation database

 ;  ;  ;

2020

EGU2020: Sharing Geoscience Online, EGU2020, onlineonline, online conference, 4 May 2020 - 8 May 20202020-05-042020-05-08

Please use a persistent id in citations:

Abstract: We present performance engineering of a TByte-scale air quality database (DB) system which was created by the Tropospheric Ozone Assessment Report (TOAR) and contains one of the world’s largest collections of near-surface air quality measurements. A special feature of our data service https://join.fz-juelich.deis on-demand processing of several air quality metrics directly from the TOAR database. As a service that is used by more than 150 users of the international air quality research community, our web service must be easily accessible and functionally flexible, while delivering good performance. The current on-demand calculations of air quality metrics outside the database are identified as the major performance bottleneck. In this study, we therefore explore and benchmark in-database approaches for the statistical processing, which result in performance enhancements of up to 32%. We will furthermore show how the web service infrastructure can be extended in functionality, allowing the calculation of flux-based ozone metrics.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 512 - Data-Intensive Science and Federated Computing (POF3-512) (POF3-512)
  2. IntelliAQ - Artificial Intelligence for Air Quality (787576) (787576)
  3. PhD no Grant - Doktorand ohne besondere Förderung (PHD-NO-GRANT-20170405) (PHD-NO-GRANT-20170405)
  4. Earth System Data Exploration (ESDE) (ESDE)

Appears in the scientific report 2020
Database coverage:
Creative Commons Attribution CC BY 4.0 ; OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Conference Presentations
Workflow collections > Public records
Institute Collections > JSC
JuOSC (Juelich Open Science Collection)
Publications database
Open Access

 Record created 2020-05-11, last modified 2023-07-11