What do You Mean? Interpreting Image Classification with Crowdsourced Concept Extraction and Analysis

Balayn, Agathe; Soilis, Panagiotis; Lofi, Christoph; Yang, Jie; Bozzon, Alessandro

doi:10.1145/3442381.3450069

Cited by 19 publications

(11 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…TCAV [27], ACE [14]), or textual information (e.g. SECA [8]). We use these categorizations to identify the explanations relevant to include in our probe.…”

Section: Machine Learning Explainabilitymentioning

confidence: 99%

See 1 more Smart Citation

How can Explainability Methods be Used to Support Bug Identification in Computer Vision Models?

Balayn

Rikalo

Lofi

et al. 2022

CHI Conference on Human Factors in Computing Systems

Self Cite

View full text Add to dashboard Cite

Deep learning models for image classifcation sufer from dangerous issues often discovered after deployment. The process of identifying bugs that cause these issues remains limited and understudied. Especially, explainability methods are often presented as obvious tools for bug identifcation. Yet, the current practice lacks an understanding of what kind of explanations can best support the diferent steps of the bug identifcation process, and how practitioners could interact with those explanations. Through a formative study and an iterative co-creation process, we build an interactive design probe providing various potentially relevant explainability functionalities, integrated into interfaces that allow for fexible workfows. Using the probe, we perform 18 user-studies with a diverse set of machine learning practitioners. Two-thirds of the practitioners engage in successful bug identifcation. They use multiple types of explanations, e.g. visual and textual ones, through non-standardized sequences of interactions including queries and exploration. Our results highlight the need for interactive, guiding, interfaces with diverse explanations, shedding light on future research directions. CCS CONCEPTS• Human-centered computing → User interface programming; Empirical studies in HCI ; • Computing methodologies → Computer vision; • Software and its engineering → Software testing and debugging.

show abstract

“…TCAV [27], ACE [14]), or textual information (e.g. SECA [8]). We use these categorizations to identify the explanations relevant to include in our probe.…”

Section: Machine Learning Explainabilitymentioning

confidence: 99%

“…Yet, limited efort has been devoted to investigating the debugging practices of computer vision practitioners. The machine learning community develops various explainability methods, often arguing their usefulness for model bugs identifcation [8,14,27,40,47]. However, few studies investigate their concrete uses in this process.…”

Section: Introductionmentioning

confidence: 99%

How can Explainability Methods be Used to Support Bug Identification in Computer Vision Models?

Balayn

Rikalo

Lofi

et al. 2022

CHI Conference on Human Factors in Computing Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…The most closely related work, as we discussed, is Lakkaraju et al [27] and Liu et al [29] that use HItL methods for unknown unknowns detection. Recent work that has directly inspired ours is Balayn et al [6] that propose to use human computation to interpret the behavior of image classifiers by attaching semantic concepts to the saliency maps of classification. We employ this method for unknown unknowns characterization in image recognition, and take a step further to show that by including human specified requirements of what a model should know, we can significantly improve unknown unknowns characterization.…”

Section: Related Workmentioning

confidence: 99%

“…For effective characterization of unknown unknowns, two types of knowledge are needed: knowledge of what a model has learned, that we henceforth refer to as REALLY-KNOWS, and what a model should have learned, referred to as SHOULD-KNOW. Recent work on human-in-the-loop machine learning interpretability [6] has shown the important role of humans as computational agents to describe REALLY-KNOWS, by annotating salient image areas in image recognition with semantic concepts. In this paper, we advocate another view to the role of humans as contributors who can shed light on SHOULD-KNOW.…”

Section: Introductionmentioning

confidence: 99%

What Should You Know? A Human-In-the-Loop Approach to Unknown Unknowns Characterization in Image Recognition

Noorian

Qiu

Gadiraju

et al. 2022

Proceedings of the ACM Web Conference 2022

Self Cite

View full text Add to dashboard Cite

Unknown unknowns represent a major challenge in reliable image recognition. Existing methods mainly focus on unknown unknowns identification, leveraging human intelligence to gather images that are potentially difficult for the machine. To drive a deeper understanding of unknown unknowns and more effective identification and treatment, this paper focuses on unknown unknowns characterization. We introduce a human-in-the-loop, semantic analysis framework for characterizing unknown unknowns at scale. We engage humans in two tasks that specify what a machine should know and describe what it really knows, respectively, both at the conceptual level, supported by information extraction and machine learning interpretability methods. Data partitioning and sampling techniques are employed to scale out human contributions in handling large data. Through extensive experimentation on scene recognition tasks, we show that our approach provides a rich, descriptive characterization of unknown unknowns and allows for more effective and cost-efficient detection than the state of the art. CCS CONCEPTS• Computing methodologies → Machine learning; Knowledge representation and reasoning; • Human-centered computing → Human computer interaction (HCI).

show abstract

“…Most issues are ultimately questions of ill-defined requirements. Developing methods to better identify the requirements of the systems prior to their development, and to test for such requirements, would allow to foresee such issues and possibly correct for them [23]. A recent study (not from the OCL domain) refers to adjacent problems as underspecification of machine learning models [76], i.e., models trained on the same dataset with the same architecture but various seemingly "unimportant" hyperparameters (e.g., initialization seed) provide similar performance on a test set, but diverging performance on the deployment data.…”

Section: Issuesmentioning

confidence: 99%

Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature

Balayn

Yang

Szlávik

et al. 2021

Trans. Soc. Comput.

Self Cite

View full text Add to dashboard Cite

The automatic detection of conflictual languages (harmful, aggressive, abusive, and offensive languages) is essential to provide a healthy conversation environment on the Web. To design and develop detection systems that are capable of achieving satisfactory performance, a thorough understanding of the nature and properties of the targeted type of conflictual language is of great importance. The scientific communities investigating human psychology and social behavior have studied these languages in details, but their insights have only partially reached the computer science community. In this survey, we aim both at systematically characterizing the conceptual properties of online conflictual languages, and at investigating the extent to which they are reflected in state-of-the-art automatic detection systems. Through an analysis of psychology literature, we provide a reconciled taxonomy that denotes the ensemble of conflictual languages typically studied in computer science. We then characterize the conceptual mismatches that can be observed in the main semantic and contextual properties of these languages and their treatment in computer science works; and systematically uncover resulting technical biases in the design of machine learning classification models and the dataset created for their training. Finally, we discuss diverse research opportunities for the computer science community and reflect on broader technical and structural issues.

show abstract

What do You Mean? Interpreting Image Classification with Crowdsourced Concept Extraction and Analysis

Cited by 19 publications

References 27 publications

How can Explainability Methods be Used to Support Bug Identification in Computer Vision Models?

How can Explainability Methods be Used to Support Bug Identification in Computer Vision Models?

What Should You Know? A Human-In-the-Loop Approach to Unknown Unknowns Characterization in Image Recognition

Automatic Identification of Harmful, Aggressive, Abusive, and Offensive Language on the Web: A Survey of Technical Biases Informed by Psychology Literature

Contact Info

Product

Resources

About