Preprint FZJ-2024-07530

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Confounder control in biomedicine necessitates conceptual considerations beyond statistical evaluations

 ;  ;  ;  ;

2024

medrxiv () [10.1101/2024.02.02.24302198]

This record in other databases:  

Please use a persistent id in citations: doi:

Abstract: Machine learning (ML) models hold promise in precision medicine by enabling personalized predictions basedon high-dimensional biomedical data. Yet, transitioning models from prototyping to clinical applications poseschallenges, with confounders being a significant hurdle by undermining the reliability, generalizability, andinterpretability of ML models. Using hand grip strength (HGS) prediction from neuroimaging data from theUK Biobank as a case study, we demonstrate that confounder adjustment can have a greater impact on modelperformance than changes in features or algorithms. An ubiquitous and necessary approach to confounding isby statistical means. However, a pure statistical viewpoint overlooks the biomedical relevance of candidateconfounders, i.e. their biological link and conceptual similarity to actual variables of interest. Problematically,this can lead to biomedically not-meaningful confounder-adjustment, which limits the usefulness of resultingmodels, both in terms of biological insights and clinical applicability. To address this, we propose a two-dimensional framework, the Confound Continuum, that combines both statistical association and biomedicalrelevance, i.e. conceptual similarity, of a candidate confounder. The evaluation of conceptual similarityassesses on a continuum how much two variables overlap in their biological meaning, ranging from negligiblelinks to expressing the same underlying biology. It thereby acknowledges the gradual nature of the biologicallink between candidate confounders and a predictive task. Our framework aims to create awareness for theimperative need to complement statistical confounder considerations with biomedical, conceptual domainknowledge (without going into causal considerations) and thereby offers a means to arrive at meaningful andinformed confounder decisions. The position of a candidate confoudner in the two-dimensional grid of theConfound Continuum can support informed and context-specific confounder decisions and thereby not onlyenhance biomedical validity of predictions but also support translation of predictive models into clinicalpractice.


Contributing Institute(s):
  1. Gehirn & Verhalten (INM-7)
  2. Kognitive Neurowissenschaften (INM-3)
Research Program(s):
  1. 5254 - Neuroscientific Data Analytics and AI (POF4-525) (POF4-525)
  2. DFG project G:(GEPRIS)431549029 - SFB 1451: Schlüsselmechanismen normaler und krankheitsbedingt gestörter motorischer Kontrolle (431549029) (431549029)

Appears in the scientific report 2024
Click to display QR Code for this record

The record appears in these collections:
Institute Collections > INM > INM-3
Institute Collections > INM > INM-7
Document types > Reports > Preprints
Workflow collections > Public records
Publications database

 Record created 2024-12-20, last modified 2025-02-03


External link:
Download fulltext
Fulltext
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)