Preprint FZJ-2020-05014

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
MLAir (v1.0) – a tool to enable fast and flexible machine learning on air data time series

 ;  ;

2020
Copernicus Katlenburg-Lindau

Geoscientific model development discussions 2020, 1-41 () [10.5194/gmd-2020-332]

This record in other databases:  

Please use a persistent id in citations:   doi:

Abstract: With MLAir (Machine Learning on Air data) we created a software environment that simplifies and accelerates the exploration of new machine learning (ML) models for the analysis and forecasting of meteorological and air quality time series. Thereby MLAir is not developed as an abstract workflow, but hand in hand with actual scientific questions. It thus addresses scientists with either a meteorological or a ML background. Due to their relative ease of use and spectacular results in other application areas, neural networks and other ML methods are gaining enormous momentum also in the weather and air quality research communities. Even though there are already many books and tutorials describing how to conduct a ML experiment, there are many stumbling blocks for a newcomer. In contrast, people familiar with ML concepts and technology often have difficulties understanding the nature of atmospheric data. With MLAir we have addressed a number of these pitfalls so that it becomes easier for scientists of both domains to rapidly start off their ML application. MLAir has been developed in such a way that it is easy to use and is designed from the very beginning as a standalone, fully functional experiment. Due to its flexible, modular code base, code modifications are easy and personal experiment schedules can be quickly derived. The package also includes a set of simple validation tools to facilitate the evaluation of ML results using standard meteorological statistics. MLAir can easily be ported onto different computing environments from desktop workstations to high-end supercomputers with or without graphics processing units (GPU).

Classification:

Note: The corresponding software is avialable at https://doi.org/10.34730/fcc6b509d5394dad8cfdfc6e9fff2bec

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 512 - Data-Intensive Science and Federated Computing (POF3-512) (POF3-512)
  2. IntelliAQ - Artificial Intelligence for Air Quality (787576) (787576)
  3. PhD no Grant - Doktorand ohne besondere Förderung (PHD-NO-GRANT-20170405) (PHD-NO-GRANT-20170405)
  4. Earth System Data Exploration (ESDE) (ESDE)

Appears in the scientific report 2020
Database coverage:
Creative Commons Attribution CC BY 4.0 ; OpenAccess ; Ebsco Academic Search
Click to display QR Code for this record

The record appears in these collections:
Document types > Reports > Preprints
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2020-12-07, last modified 2023-01-27


Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)