The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Swamy, Vinitra; Frej, Jibril Albachir; Käser, Tanja

doi:10.48550/arxiv.2307.00364

Swamy, Vinitra; Frej, Jibril Albachir; Käser, Tanja

2023

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Explainable Artificial Intelligence (XAI) plays a crucial role in enabling human understanding and trust in deep learning systems, often defined as determining which features are most important to a model's prediction. As models get larger, more ubiquitous, and pervasive in aspects of daily life, explainability is necessary to avoid or minimize adverse effects of model mistakes. Unfortunately, current approaches in human-centric XAI (e.g. predictive tasks in healthcare, education, or personalized ads) tend to rely on a single explainer. This is a particularly concerning trend when considering that recent work has identified systematic disagreement in explainability methods when applied to the same points and underlying black-box models. In this paper, we therefore present a call for action to address the limitations of current state-of-the-art explainers. We propose to shift from post-hoc explainability to designing interpretable neural network architectures; moving away from approximation techniques in human-centric and high impact applications. We identify five needs of human-centric XAI (real-time, accurate, actionable, human-interpretable, and consistent) and propose two schemes for interpretable-by-design neural network workflows (adaptive routing for interpretable conditional computation and diagnostic benchmarks for iterative model learning). We postulate that the future of human-centric XAI is neither in explaining black-boxes nor in reverting to traditional, interpretable models, but in neural networks that are intrinsically interpretable.

Details

Title The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Author(s) Swamy, Vinitra ; Frej, Jibril Albachir ; Käser, Tanja

Date 2023

Note Viewpoint paper, under review at JAIR

DOI https://doi.org/10.48550/arxiv.2307.00364

Laboratories ML4ED
AVP-E-LEARN

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > ML4ED - Machine Learning for Education Laboratory
Scientific production and competences > Transdisciplinary Entities > LEARN - Center for Learning Science
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Submitted

Record creation date 2023-07-04

Files

Abstract

Details

PDF