TY - CONF
AU - Zhao, Xuan
AU - Krieger, Lena
AU - Cao, Zhuo
AU - Bangun, Arya
AU - Scharr, Hanno
AU - Assent, Ira
TI - Probabilistic Framework for Robustness of Counterfactual Explanations Under Data Shifts
M1 - FZJ-2025-05281
PY - 2025
AB - Counterfactual explanations (CEs) are a powerful method for interpreting machine learning models, but CEs might be not valid when the model is updated due to distribution shifts in the underlying data. Existing approaches to robust CEs often impose explicit bounds on model parameters to ensure stability, but such bounds can be difficult to estimate and overly restrictive in practice. In this work, we propose a data shift-driven probabilistic framework for robust counterfactual explanations with plausible data shift modeling via a Wasserstein ball. We formalize a linearized Wasserstein perturbation scheme that captures realistic distributional changes which enables Monte Carlo estimation of CE robustness probabilities with domain-specific data shift tolerances. Theoretical analysis reveals that our framework is equivalent in spirit to model parameter bounding approaches but offers greater flexibility, avoids the need to estimate maximal model parameter shifts. Experiments on real-world datasets demonstrate that the proposed method maintains high robustness of CEs under plausible distribution shifts, outperforming conventional parameter-bounding techniques in both validity and proximity costs.
T2 - NeurIPS 2025 - Reliable ML Workshop
CY - 2 Dec 2025 - 8 Dec 2025, San Diego (USA)
Y2 - 2 Dec 2025 - 8 Dec 2025
M2 - San Diego, USA
LB - PUB:(DE-HGF)6
DO - DOI:10.34734/FZJ-2025-05281
UR - https://juser.fz-juelich.de/record/1049199
ER -