Contribution to a conference proceedings/Contribution to a book FZJ-2026-01056

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Performance Analysis of an Efficient Algorithm for Feature Extraction from Large Scale Meteorological Data Stores

 ;  ;  ;  ;

2025
ACM New York, NY, USA

Proceedings of the Platform for Advanced Scientific Computing Conference
PASC '25: Platform for Advanced Scientific Computing Conference, PASC 2025, FHNW University of Applied Sciences and Arts Northwestern Switzerland Brugg-WindischFHNW University of Applied Sciences and Arts Northwestern Switzerland Brugg-Windisch, Switzerland, 16 Jun 2025 - 18 Jun 20252025-06-162025-06-18
New York, NY, USA : ACM 9 p. () [10.1145/3732775.3733573]

This record in other databases:  

Please use a persistent id in citations: doi:  doi:

Abstract: In recent years, Numerical Weather Prediction (NWP) has undergone a major shift with the rapid move towards kilometer-scale global weather forecasts and the emergence of AI-based forecasting models. Together, these trends will contribute to a significant increase in the daily data volume generated by NWP models. Ensuring efficient and timely access to this growing data requires innovative data extraction techniques. As an alternative to traditional data extraction algorithms, the European Centre for Medium-Range Weather Forecasts (ECMWF) has introduced the Polytope feature extraction algorithm. This algorithm is designed to reduce data transfer between systems to a bare minimum by allowing the extraction of non-orthogonal shapes of data. In this paper, we evaluate Polytope's suitability as a replacement for current extraction mechanisms in operational weather forecasting. We first adapt the Polytope algorithm to operate on ECMWF's FDB (Fields DataBase) meteorological data stores, before evaluating this integrated system's performance and scalability on real-time operational data. Our analysis shows that the low overhead of running the Polytope algorithm, which is in the order of a few seconds at most, is far outweighed by the benefits of significantly reducing the size of the extracted data by up to several orders of magnitude compared to traditional bounding box methods. Our ensuing discussion focuses on quantifying the strengths and limitations of each individual part of the system to identify potential bottlenecks and areas for future improvement.


Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  2. Earth System Data Exploration (ESDE) (ESDE)

Database coverage:
Creative Commons Attribution CC BY 4.0 ; OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Ereignisse > Beiträge zu Proceedings
Dokumenttypen > Bücher > Buchbeitrag
Workflowsammlungen > Öffentliche Einträge
Institutssammlungen > JSC
Publikationsdatenbank
Open Access

 Datensatz erzeugt am 2026-01-26, letzte Änderung am 2026-01-27


OpenAccess:
Volltext herunterladen PDF
Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)