Master Thesis FZJ-2025-03665

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Data Fusion for Scene Graph Generation: Bridging Simulated and Real-World Datasets

 ;  ;

2025
Jülich

Jülich 102 pages: Figures, Tables () [10.34734/FZJ-2025-03665] = Masterarbeit, Hochschule Coburg, 2025

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: Scene graph generation has emerged as a powerful tool for AI-driven visual understandingof images by not only detecting objects in an image but also predicting the relationshipsbetween them, such as car–stops at–traffic light or pedestrian–crosses–street. This capabilityis particularly important for autonomous driving, where relational context between roadusers and infrastructure plays a critical role. However, the application of scene graphgeneration in this domain is hindered by the scarcity of annotated datasets. Drivingsimulators such as CARLA provide a scalable alternative, enabling efficient data generationcompared to manual annotation. Yet models trained exclusively on simulated data oftenfail to generalize to real-world data due to the substantial domain gap between the two.This thesis addresses this challenge by proposing a novel data fusion framework thatcombines simulated and real datasets to construct autonomous driving–specific relationshipannotations and subsequently bridge the domain gap for real-world prediction. The workpresents the complete pipeline, including dataset generation in simulation, adaptationof publicly available resources, and augmentation strategies. The Relation Transformermodel is analyzed in depth, and particular attention is given to interpreting its internalmechanisms by visualizing the learned attention maps as heatmaps. This analysis providesinsights into whether the model focuses on semantically meaningful regions when predictingrelationships. Building on this understanding, two new approaches are introduced to enableinference on real data while transferring relational knowledge acquired in simulation. Anablation study further quantifies the impact of the domain gap on model performance andhighlights the strengths and limitations of the proposed methods. Results demonstratethat one of the developed approaches effectively mitigates the simulation-to-reality gapand concrete suggestions for advancing this technique toward further uses for AI-drivenvisual understanding of images in the automotive context are provided.


Note: Masterarbeit, Hochschule Coburg, 2025

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. nxtAIM - nxtAIM – NXT GEN AI Methods (19A23014l) (19A23014l)

Appears in the scientific report 2025
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Theses > Master Theses
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2025-09-08, last modified 2025-11-24


OpenAccess:
Download fulltext PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)