CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines

Akula, Arjun R.; Wang, Shuai; Zhu, Song‐Chun

doi:10.1609/aaai.v34i03.5643

Cited by 50 publications

(29 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(Buchanan & Shortliffe, 1984), which is providing a foil for the fact. More recently, there has been a keen interest in answering why-not questions for many different sub-fields of artificial intelligence, including machine learning classification (Dhurandhar et al" 2018;Mothilal et al" 2020), belief-desireintention agents (Winikoff, 2017), reinforcement learning (Madumal et al" 2020;Waa et al" 2018), classical planning (Krarup et al" 2019;Sreedharan et al" 2018), and image classification (Akula et al" 2020), to cite just a few papers.…”

Section: Computational Approachesmentioning

confidence: 99%

See 1 more Smart Citation

Contrastive explanation: a structural-model approach

Miller¹

2021

The Knowledge Engineering Review

View full text Add to dashboard Cite

This paper presents a model of contrastive explanation using structural casual models. The topic of causal explanation in artificial intelligence has gathered interest in recent years as researchers and practitioners aim to increase trust and understanding of intelligent decision-making. While different sub-fields of artificial intelligence have looked into this problem with a sub-field-specific view, there are few models that aim to capture explanation more generally. One general model is based on structural causal models. It defines an explanation as a fact that, if found to be true, would constitute an actual cause of a specific event. However, research in philosophy and social sciences shows that explanations are contrastive: that is, when people ask for an explanation of an event—the fact—they (sometimes implicitly) are asking for an explanation relative to some contrast case; that is, ‘Why P rather than Q?’. In this paper, we extend the structural causal model approach to define two complementary notions of contrastive explanation, and demonstrate them on two classical problems in artificial intelligence: classification and planning. We believe that this model can help researchers in subfields of artificial intelligence to better understand contrastive explanation.

show abstract

Section: Computational Approachesmentioning

confidence: 99%

“…This leads to algorithms that can be useful, but terminology and solutions that are not aligned. For example, Dhurandhar et al (2018) use the term pertinent negatives/positives to refer to foils, while Akula et al (2020) use the term fault lines, and Krarup et al (2019) use foil.…”

Section: Computational Approachesmentioning

confidence: 99%

Contrastive explanation: a structural-model approach

Miller¹

2021

The Knowledge Engineering Review

View full text Add to dashboard Cite

show abstract

“…In that way, a taxonomy the ML system can use to create causal attributions can be created and expanded towards a semantic level, using concepts identified and named by human users during usage. This borders recent work by Akula et al (2020) that identifies concepts like, 'stripedness', 'beard', 'horn', et cetera that can be used to create counterfactual explanations building on minimal concept changes that changes the classification, from for example goat to sheep. Our problem differs in that the concepts identified can be very subtle, relative and can even be, in a system that enhances human capabilities, not visible to the naked eye.…”

Section: Explanations From a Human Perspectivementioning

confidence: 72%

“…In ML there exists several methods both for lifting out causal attributes and work that aims to name and identify these attributes as concepts using previously defined concepts (Ghorbani et al, 2019;Amershi et al, 2009;Koh et al, 2020;Gonzalez-Garcia et al, 2018;Kornblith et al, 2019;Samek et al, 2020;Bengio et al, 2013;Akula et al, 2020). There exists, to our knowledge less work that focuses on finding and naming concepts during usage (Ghorbani et al, 2019).…”

Section: Related Workmentioning

confidence: 99%

“…There is a growing research interest focused on explaining these blackboxed decisions by summarising reasons for a decision Akula et al, 2020;Bengio et al, 2013;Ghorbani et al, 2019;Gonzalez-Garcia et al, 2018). A large quantity of the current work focus on creating causal attributions related to the trained ML model as such, and less focus is on supporting user's understanding of the ML system's capabilities Lu et al, 2019).…”

Section: Pa P E R S Paper 1 -Evaluating Interpretability In Machine Teachingmentioning

confidence: 99%

See 1 more Smart Citation

Human In Command Machine Learning

Holmberg¹

View full text Add to dashboard Cite

Machine Learning (ML) and Artificial Intelligence (AI) impact many aspects of human life, from recommending a significant other to assist the search for extraterrestrial life. The area develops rapidly and exiting unexplored design spaces are constantly laid bare. The focus in this work is one of these areas; ML systems where decisions concerning ML model training, usage and selection of target domain lay in the hands of domain experts. This work is then on ML systems that function as a tool that augments and/or enhance human capabilities. The approach presented is denoted Human In Command ML (HIC-ML) systems. To enquire into this research domain design experiments of varying fidelity were used. Two of these experiments focus on augmenting human capabilities and targets the domains commuting and sorting batteries. One experiment focuses on enhancing human capabilities by identifying similar hand-painted plates. The experiments are used as illustrative examples to explore settings where domain experts potentially can: independently train an ML model and in an iterative fashion, interact with it and interpret and understand its decisions. HIC-ML should be seen as a governance principle that focuses on adding value and meaning to users. In this work, concrete application areas are presented and discussed. To open up for designing ML-based products for the area an abstract model for HIC-ML is constructed and design guidelines are proposed. In addition, terminology and abstractions useful when designing for explicability are presented by imposing structure and rigidity derived from scientific explanations. Together, this opens up for a contextual shift in ML and makes new application areas probable, areas that naturally couples the usage of AI technology to human virtues and potentially, as a consequence, can result in a democratisation of the usage and knowledge concerning this powerful technology.

show abstract