TY - THES
AU - Effen, Moritz
TI - An Investigation of a Multimodal Variational Autoencoder Framework for Physics Data
PB - RWTH Aachen
VL - Masterarbeit
M1 - FZJ-2026-00524
SP - 75p
PY - 2025
N1 - Masterarbeit, RWTH Aachen, 2025
AB - Many scientific domains, such as physics, provide multimodal data when observing complex phenomena or when doing experiments. Understanding individual contributions of each modality can help to optimise experimental setups and sensors, thereby potentially increasing accuracy on domain-specific tasks that rely on such data. This thesis examines the role of multimodal data in (downstream) prediction tasks, with a focus on the unique and shared contributions of the respective modalities. Disentangled representation learning is a paradigm that aims to extract the independent, underlying factors from data. We employ this approach for multimodal data, proposing an extension to the disentangled multimodal variational autoencoder (DMVAE) by incorporating an additional optimisation objective to enforce minimal redundancy between shared and unique latent representations extracted by the DMVAE. Based on these representations, we train and evaluate several downstream tasks to study their contributions to the task. We compare this method to the traditional DMVAE and VAE across multimodal and single-modal configurations and also compare it directly to regression models. In our experiments, this approach is applied to the Multimodal Universe (MMU) astronomical dataset, which includes both imagery and spectral data. We also evaluate the impact of a physical-model-based differentiable image decoder model for extracting meaningful parameters into the latent space. Addi-tionally, the setup is applied to HyPlant hyperspectral remote sensing data, which consists of airborne measurements of Earth’s surface, to study it as a source of multimodal data to test how much information images and spectra contain about hyperspectral data.
LB - PUB:(DE-HGF)19
DO - DOI:10.34734/FZJ-2026-00524
UR - https://juser.fz-juelich.de/record/1051601
ER -