Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Shao, Xiaoting; Skryagin, Arseny; Stammer, Wolfgang; Schramowski, Patrick; Kersting, Kristian

doi:10.1609/aaai.v35i11.17148

Cited by 17 publications

(11 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Regularization techniques such as EXPO (Explanation-based Optimization) and RRR (Right for the Right Reasons) , are designed to enhance the black-box model interpretability . Although one can argue that “simplicity” of models is positively correlated with interpretability, this is based on how the interpretability is evaluated.…”

Section: Theorymentioning

confidence: 99%

A Perspective on Explanations of Molecular Prediction Models

Wellawatte

Gandhi

Seshadri

et al. 2023

J. Chem. Theory Comput.

View full text Add to dashboard Cite

Chemists can be skeptical in using deep learning (DL) in decision making, due to the lack of interpretability in “black-box” models. Explainable artificial intelligence (XAI) is a branch of artificial intelligence (AI) which addresses this drawback by providing tools to interpret DL models and their predictions. We review the principles of XAI in the domain of chemistry and emerging methods for creating and evaluating explanations. Then, we focus on methods developed by our group and their applications in predicting solubility, blood–brain barrier permeability, and the scent of molecules. We show that XAI methods like chemical counterfactuals and descriptor explanations can explain DL predictions while giving insight into structure–property relationships. Finally, we discuss how a two-step process of developing a black-box model and explaining predictions can uncover structure–property relationships.

show abstract

Section: Theorymentioning

confidence: 99%

A Perspective on Explanations of Molecular Prediction Models

Wellawatte

Gandhi

Seshadri

et al. 2023

J. Chem. Theory Comput.

View full text Add to dashboard Cite

show abstract

“…Intrinsic interpretability can also be improved by regularizing the input gradients as they can identify which feature descriptors contributed towards a prediction. [48] Regularization techniques such as EXPO [49] and RRR [50] are designed to enhance the black-box model interpretability. Although one can argue that "simplicity" of models are positively correlated with interpretability, this is based on how the interpretability is evaluated.…”

Section: Self-explaining Modelsmentioning

confidence: 99%

A Perspective on Explanations of Molecular Prediction Models

Wellawatte

Gandhi

Seshadri

et al. 2022

Preprint

View full text Add to dashboard Cite

Chemists can be skeptical in using deep learning (DL) in decision making, due to the lack of interpretability in "black-box" models. Explainable artificial intelligence (XAI) is a branch of AI which addresses this drawback by providing tools to interpret DL models and their predictions. We review the principles of XAI in the domain of chemistry and emerging methods for creating and evaluating explanations. Then we focus methods developed by our group and their application to predicting solubility, blood-brain barrier permeability, and the scent of molecules. We show that XAI methods like chemical counterfactuals and descriptor explanations can both explain DL predictions and give insight into structure-property relationships. Finally, we discuss how a two step process of highly accurate black-box modeling and then creating explanations gives both highly accurate predictions and clear structure-property relationships.

show abstract

“…This approach is summarised in Equations 2 and 3 using GradCAM explanations, where M n ∈ {0, 1} is the ground truth annotation and norm normalizes the Grad-CAM output, θ holds a model's parameters, with input X, labels y, predictions ŷ, and a parameter regularization term λ. Techniques such as Right for Right Reasons using Integrated Gradients (RRR-IG) [10], Right for the Right Reasons using GradCAM (RRR-GC) [11], and Right for Better Reasons (RBR) [15] modify a model through explanation and training losses. Explanation losses can be computed between a feature annotations ground truth dataset and model generated explanations as can be seen in Equation 2 [11].…”

Section: Model Trainingmentioning

confidence: 99%

“…Missing region and spurious region feedback are the two most commonly used types of user feedback in image-based XIL under the assumption of correct classification of instances. While techniques such as RRR-IG [10], RRR-GC [11] and RBR [15] use spurious region feedback to fine-tune a model to ignore spurious features, Human Importance-aware Network Tuning (HINT) trains a model to focus on valid image objects [13].…”

Section: Feedback Collectionmentioning

confidence: 99%

Impact of Feedback Type on Explanatory Interactive Learning

Hagos

Curran

Namee

2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Explanatory Interactive Learning (XIL) collects user feedback on visual model explanations to implement a Human-in-the-Loop (HITL) based interactive learning scenario. Different user feedback types will have different impacts on user experience and the cost associated with collecting feedback since different feedback types involve different levels of image annotation. Although XIL has been used to improve classification performance in multiple domains, the impact of different user feedback types on model performance and explanation accuracy is not well studied. To guide future XIL work we compare the effectiveness of two different user feedback types in image classification tasks: (1) instructing an algorithm to ignore certain spurious image features, and (2) instructing an algorithm to focus on certain valid image features. We use explanations from a Gradient-weighted Class Activation Mapping (GradCAM) based XIL model to support both feedback types. We show that identifying and annotating spurious image features that a model finds salient results in superior classification and explanation accuracy than user feedback that tells a model to focus on valid image features.

show abstract

Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Cited by 17 publications

References 14 publications

A Perspective on Explanations of Molecular Prediction Models

A Perspective on Explanations of Molecular Prediction Models

A Perspective on Explanations of Molecular Prediction Models

Impact of Feedback Type on Explanatory Interactive Learning

Contact Info

Product

Resources

About