Home > Publications database > Audio Deepfake Detection using the Stationary Wavelet Transform |
Master Thesis | FZJ-2024-04917 |
;
2024
This record in other databases:
Please use a persistent id in citations: doi:10.34734/FZJ-2024-04917
Abstract: Recent advances in generative modelling have uncovered new avenues for creative expression, while also raising the potential of malicious misuse. Deepfakes pose significant risks to personal privacy, information integrity and cybersecurity. One possible countermeasure against deepfakes are robust detection models based on deep learning. In the context of audio, convolutional architectures have demonstrated good recognition rates. The raw audio data is usually transformed into a suitable format for convolutional neural networks. Current state-of-the art models employ Fourier or wavelet transforms. This thesis investigates the detection of audio fakes using deep convolutional networks that process stationary wavelet transform inputs. The results show functional detector models, but also reveal limitations of the stationary wavelet transform.
![]() |
The record appears in these collections: |