TY  - JOUR
AU  - Goldermann, Lavinia
AU  - Fonck, Simon
AU  - Olivier, Lena
AU  - Fritsch, Sebastian
AU  - Stollenwerk, André
TI  - The Influence of Human Annotation on CNN Performance for Anomaly Detection in ICU Data
JO  - Current directions in biomedical engineering
VL  - 11
IS  - 1
SN  - 2364-5504
CY  - Berlin
PB  - De Gruyter
M1  - FZJ-2025-04636
SP  - 362 - 365
PY  - 2025
AB  - Deep learning methods are increasingly used in clinical artificial intelligence (AI) research, including for detecting anomalies in intensive care data. However, their evaluation often depends on human annotations, which can vary in quality and consistency. In this study, we analyse the effect of annotation variability on the performance of DeepAnT, an unsupervised convolutional neural network for anomaly detection (AD). Using intensive care time-series data from 38 patients for training and six patients separately annotated for evaluation, we compare F1 scores based on two independent physician annotations. Our results show differences in model performance across different vital parameters, between patients, and especially between annotators evaluating the same data. These findings indicate that human labelling has a measurable impact on the perceived performance of the AD algorithm. Structured labelling protocols may be beneficial for achieving more consistent and reliable evaluations.
LB  - PUB:(DE-HGF)16
DO  - DOI:10.1515/cdbme-2025-0192
UR  - https://juser.fz-juelich.de/record/1048425
ER  -