Bachelor Thesis FZJ-2025-02391

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Deep Leaning based data compression for climate data



2024

44p () [10.34734/FZJ-2025-02391] = Bachelorarbeit, FH Aachen, 2024

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: With the amount of data generated in climate science from scientific models reachingmultiple terabytes per simulation, it becomes difficult to provide sufficient storagespace and achieve satisfactory data transfer rates. In order to be able to store moreinformation, data compression becomes increasingly relevant, as it allows the size ofthe data to be decreased significantly.In this work a novel approach to lossy data compression for climate data by usingthe deep learning model AtmoRep is being analysed. AtmoRep uses large-scalerepresentation learning to determine a general description of the highly complex,stochastic dynamics of the atmosphere which can be used to achieve a varietyof different tasks. Specifically, I want to test the accuracy of data reconstructionthat can be achieved with this model. This reconstructive ability can be used toefficiently compress data by purposefully removing parts of the data, which can bereconstructed using AtmoRep.To determine an optimal configuration of the AtmoRep model, this thesis proposesa series of different experiments where portions of temperature data are completelyremoved from the dataset at varying temporal positions and then reconstructedusing AtmoRep. The reconstructed data is subsequently analyzed and compared tothe original data to determine the quality of the data in relation to the compressionratio.For comparison, the results of AtmoRep are being compared to those of commonlyused lossless and lossy data compression algorithms for climate data.


Note: Bachelorarbeit, FH Aachen, 2024

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)

Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Theses > Bachelor Theses
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2025-04-30, last modified 2025-04-30


OpenAccess:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)