Higher-Order Feature Attribution: Bridging Statistics, Explainable AI, and Topological Signal Processing
arXiv:2510.06165v2 Announce Type: replace-cross Abstract: Feature attributions are post-training analysis methods that assess how various input features of a machine learning model contribute to an output prediction. Their interpretation is straightforward when features act independently, but it becomes less clear when the predictive model involves interactions, such as multiplicative relationships or joint feature contributions. In this work, we propose a general theory of higher-order feature attribution, which we develop on the foundation of Integrated Gradients (IG). This work […]