GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation

Kahng, Minsuk; Thorat, Nikhil; Chau, Duen Horng; Viégas, Fernanda B.; Wattenberg, Martin

doi:10.1109/tvcg.2018.2864500

Cited by 138 publications

(77 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Regarding the former, various visual analytic approaches have been proposed for convolutional neural networks mainly computer vision domains [2,6,12,13,19,34] and RNNs in NLP domains [5,11,17,23,24]. Visual analytic approaches have also been integrated with other advanced neural network architectures, such as generative adversarial networks [9,30], deep reinforcement learning [29]. Among them, Strobelt et al [22] developed a visual analytic system for RNN-based attention models, mainly for the exploration and understanding of sequence-to-sequence modeling tasks.…”

Section: Related Workmentioning

confidence: 99%

SANVis: Visual Analytics for Understanding Self-Attention Networks

Park¹,

Choo²,

Na³

et al. 2019

2019 IEEE Visualization Conference (VIS)

View full text Add to dashboard Cite

Figure 1: Overview of SANVis. (A) The network view displays multiple attention patterns for each layer according to three type of visualization options: (A-1) the attention piling option, (A-2) the Sankey diagram option, and (A-3) the small multiples option. (A-4) The bar chart shows the average attention weights for all heads (each colored with its corresponding hue) per each layer. (B) The HeadLens view helps the user analyze what the attention head learned by showing representative words and by providing statistical information of part-of-speech tags and positions. ABSTRACTAttention networks, a deep neural network architecture inspired by humans' attention mechanism, have seen significant success in image captioning, machine translation, and many other applications. Recently, they have been further evolved into an advanced approach called multi-head self-attention networks, which can encode a set of input vectors, e.g., word vectors in a sentence, into another set of vectors. Such encoding aims at simultaneously capturing diverse syntactic and semantic features within a set, each of which corresponds to a particular attention head, forming altogether multi-head attention. Meanwhile, the increased model complexity prevents users from easily understanding and manipulating the inner workings of models. To tackle the challenges, we present a visual analytics system called SANVis, which helps users understand the behaviors and the characteristics of multi-head self-attention networks. Using a state-of-the-art self-attention model called Transformer, we demonstrate usage scenarios of SANVis in machine translation tasks. Our system is available at http://short.sanvis.org.

show abstract

Section: Related Workmentioning

confidence: 99%

SANVis: Visual Analytics for Understanding Self-Attention Networks

Park¹,

Choo²,

Na³

et al. 2019

2019 IEEE Visualization Conference (VIS)

View full text Add to dashboard Cite

show abstract

“…BOOSTVis [36] and iForest [70] also focus on explaining tree ensemble models through the use of multiple coordinated views to help explain and explore decision paths. Similarly, recent visual analytics work on deep learning [24,25,30,34,44,49,55,[63][64][65]68] tackles the issue of the low interpretability of neural network structures and supports revealing the internal logic of the training and prediction processes.…”

Section: Explainable Artificial Intelligence -Xaimentioning

confidence: 99%

Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Xie

et al. 2020

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

1 2 Recall: 0.81 in the poisoned model 0.90 in the victim model 7 5 4 B A C E D F G 3 G.1 6 G.2 Fig. 1. Reliability attack on spam filters.(1) Poisoning instance #40 has the largest impact on the recall value, which is (2) also depicted in the model overview.(3) There is heavy overlap among instances in the two classes as well the poisoning instances. (4) Instance #40 has been successfully attacked causing a number of innocent instances to have their labels flipped. (5) The flipped instances are very close to the decision boundary. (6) On the feature of words "will" and "email", the variances of poisoning instances are large. (7) A sub-optimal target (instance #80) has less impact on the recall value, but the cost of insertions is 40% lower than that of instance #40.Abstract-Machine learning models are currently being deployed in a variety of real-world applications where model predictions are used to make decisions about healthcare, bank loans, and numerous other critical tasks. As the deployment of artificial intelligence technologies becomes ubiquitous, it is unsurprising that adversaries have begun developing methods to manipulate machine learning models to their advantage. While the visual analytics community has developed methods for opening the black box of machine learning models, little work has focused on helping the user understand their model vulnerabilities in the context of adversarial attacks. In this paper, we present a visual analytics framework for explaining and exploring model vulnerabilities to adversarial attacks. Our framework employs a multi-faceted visualization scheme designed to support the analysis of data poisoning attacks from the perspective of models, data instances, features, and local structures. We demonstrate our framework through two case studies on binary classifiers and illustrate model vulnerabilities with respect to varying attack strategies.

show abstract

“…The generator takes a random noise vector z (following a Gaussian distribution) as input and outputs a generated sample G(z) without any access to real samples. The discriminator takes both a real sample P data and a generated sample P g as input and predicts the probability of D(x) or D(G(x)) [39,52], as shown in Figure 1.…”

Section: Fully Connected Ganmentioning

confidence: 99%

“…GANs have been used to make promising contributions in variety of difficult generative tasks [35], e.g., text-to-photo translation [18], image generation [36], image composition [37], and image-to-image translation [38]. Although GANs are one type of powerful deep generative models, the training of GANs suffers from several issues, such as mode collapse and training instability [39], as discussed in Section 7.1.…”

Section: Introductionmentioning

confidence: 99%

Deep Generative Adversarial Networks for Image-to-Image Translation: A Review

Alotaibi

2020

Symmetry

View full text Add to dashboard Cite

Many image processing, computer graphics, and computer vision problems can be treated as image-to-image translation tasks. Such translation entails learning to map one visual representation of a given input to another representation. Image-to-image translation with generative adversarial networks (GANs) has been intensively studied and applied to various tasks, such as multimodal image-to-image translation, super-resolution translation, object transfiguration-related translation, etc. However, image-to-image translation techniques suffer from some problems, such as mode collapse, instability, and a lack of diversity. This article provides a comprehensive overview of image-to-image translation based on GAN algorithms and its variants. It also discusses and analyzes current state-of-the-art image-to-image translation techniques that are based on multimodal and multidomain representations. Finally, open issues and future research directions utilizing reinforcement learning and three-dimensional (3D) modal translation are summarized and discussed.

show abstract

GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation

Cited by 138 publications

References 33 publications

SANVis: Visual Analytics for Understanding Self-Attention Networks

SANVis: Visual Analytics for Understanding Self-Attention Networks

Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Deep Generative Adversarial Networks for Image-to-Image Translation: A Review

Contact Info

Product

Resources

About