Master Thesis FZJ-2026-00524

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
An Investigation of a Multimodal Variational Autoencoder Framework for Physics Data



2025

75p () [10.34734/FZJ-2026-00524] = Masterarbeit, RWTH Aachen, 2025

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: Many scientific domains, such as physics, provide multimodal data when observing complex phenomena or when doing experiments. Understanding individual contributions of each modality can help to optimise experimental setups and sensors, thereby potentially increasing accuracy on domain-specific tasks that rely on such data. This thesis examines the role of multimodal data in (downstream) prediction tasks, with a focus on the unique and shared contributions of the respective modalities. Disentangled representation learning is a paradigm that aims to extract the independent, underlying factors from data. We employ this approach for multimodal data, proposing an extension to the disentangled multimodal variational autoencoder (DMVAE) by incorporating an additional optimisation objective to enforce minimal redundancy between shared and unique latent representations extracted by the DMVAE. Based on these representations, we train and evaluate several downstream tasks to study their contributions to the task. We compare this method to the traditional DMVAE and VAE across multimodal and single-modal configurations and also compare it directly to regression models. In our experiments, this approach is applied to the Multimodal Universe (MMU) astronomical dataset, which includes both imagery and spectral data. We also evaluate the impact of a physical-model-based differentiable image decoder model for extracting meaningful parameters into the latent space. Addi-tionally, the setup is applied to HyPlant hyperspectral remote sensing data, which consists of airborne measurements of Earth’s surface, to study it as a source of multimodal data to test how much information images and spectra contain about hyperspectral data.


Note: Masterarbeit, RWTH Aachen, 2025

Contributing Institute(s):
  1. Datenanalyse und Maschinenlernen (IAS-8)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)

Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Theses > Master Theses
Institute Collections > IAS > IAS-8
Workflow collections > Public records
Publications database
Open Access

 Record created 2026-01-16, last modified 2026-01-20


OpenAccess:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)