Adversarially Learned One-Class Classifier for Novelty Detection

Sabokrou, Mohammad; Khalooei, Mohammad; Fathy, Mahmood; Adeli, Ehsan

doi:10.1109/cvpr.2018.00356

Cited by 662 publications

(505 citation statements)

References 47 publications

Supporting

Mentioning

503

Contrasting

Order By: Relevance

“…We evaluate our method on the UCSD Ped2 dataset [8], which is a popular dataset for this task. We follow the evaluation criteria of [51]. Similar to [51], the frame-level accuracy is reported as the performance metric.…”

Section: Video Anomaly Detectionmentioning

confidence: 99%

See 1 more Smart Citation

Self-Supervised Representation Learning via Neighborhood-Relational Encoding

Sabokrou

Khalooei

Adeli

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

Self Cite

View full text Add to dashboard Cite

In this paper, we propose a novel self-supervised representation learning by taking advantage of a neighborhoodrelational encoding (NRE) among the training data. Conventional unsupervised learning methods only focused on training deep networks to understand the primitive characteristics of the visual data, mainly to be able to reconstruct the data from a latent space. They often neglected the relation among the samples, which can serve as an important metric for self-supervision. Different from the previous work, NRE aims at preserving the local neighborhood structure on the data manifold. Therefore, it is less sensitive to outliers. We integrate our NRE component with an encoder-decoder structure for learning to represent samples considering their local neighborhood information. Such discriminative and unsupervised representation learning scheme is adaptable to different computer vision tasks due to its independence from intense annotation requirements. We evaluate our proposed method for different tasks, including classification, detection, and segmentation based on the learned latent representations. In addition, we adopt the auto-encoding capability of our proposed method for applications like defense against adversarial example attacks and video anomaly detection. Results confirm the performance of our method is better or at least comparable with the state-of-the-art for each specific application, but with a generic and self-supervised approach.

show abstract

Section: Video Anomaly Detectionmentioning

confidence: 99%

“…We follow the evaluation criteria of [51]. Similar to [51], the frame-level accuracy is reported as the performance metric. In frame-level measure, a frame is considered as anomaly, if at least one of its pixels is detected as anomaly.…”

Section: Video Anomaly Detectionmentioning

confidence: 99%

Self-Supervised Representation Learning via Neighborhood-Relational Encoding

Sabokrou

Khalooei

Adeli

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

Self Cite

View full text Add to dashboard Cite

show abstract

“…This bad estimation significantly affected the error map though the three cars running on other way were correctly determined. The results may thus be im- Method Belleview Train GANomaly [2] 0.735 0.194 AEs + local feature [35] 0.748 0.171 AEs + global feature [35] 0.776 0.216 ALOCC D(X) [40] 0.734 0.182 ALOCC D(R(X)) [40] 0.805 0.237 Our proposed method 0.751 0.490 SSIM on appearance stream 0.830 0.798 Table 3. The average precision of frame-level anomaly detection on the Traffic-Belleview and Traffic-Train datasets.…”

Section: Traffic-belleview and Traffic-trainmentioning

confidence: 99%

Anomaly Detection in Video Sequence With Appearance-Motion Correspondence

Nguyen

Meunier²

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

355

148

View full text Add to dashboard Cite

Anomaly detection in surveillance videos is currently a challenge because of the diversity of possible events. We propose a deep convolutional neural network (CNN) that addresses this problem by learning a correspondence between common object appearances (e.g. pedestrian, background, tree, etc.) and their associated motions. Our model is designed as a combination of a reconstruction network and an image translation model that share the same encoder. The former sub-network determines the most significant structures that appear in video frames and the latter one attempts to associate motion templates to such structures. The training stage is performed using only videos of normal events and the model is then capable to estimate frame-level scores for an unknown input. The experiments on 6 benchmark datasets demonstrate the competitive performance of the proposed approach with respect to state-ofthe-art methods. AbstractThis supplementary material provides these contents:• ROC curves of our frame-level scores on the CUHK Avenue and UCSD Ped2 datasets, and Precision-Recall (PR) curves on the traffic datasets.

show abstract

“…Deep approaches to anomaly detection for image data often use a convolutional autoencoder (CAE) which include convolutional layers in the AE architecture [24,31]. Another line of work uses Generative Adversarial Networks (GAN) for this task [8,27,29]. This two-step process is also used to make the density estimation task easier by learning low-dimensional representations.…”

Section: Deep Learning For Anomaly Detectionmentioning

confidence: 99%

Fast Distance-Based Anomaly Detection in Images Using an Inception-Like Autoencoder

Sarafijanovic-Djukic

Davis

2019

Discovery Science

View full text Add to dashboard Cite

The goal of anomaly detection is to identify examples that deviate from normal or expected behavior. We tackle this problem for images. We consider a two-phase approach. First, using normal examples, a convolutional autoencoder (CAE) is trained to extract a low-dimensional representation of the images. Here, we propose a novel architectural choice when designing the CAE, an Inception-like CAE. It combines convolutional filters of different kernel sizes and it uses a Global Average Pooling (GAP) operation to extract the representations from the CAE's bottleneck layer. Second, we employ a distanced-based anomaly detector in the low-dimensional space of the learned representation for the images. However, instead of computing the exact distance, we compute an approximate distance using product quantization. This alleviates the high memory and prediction time costs of distance-based anomaly detectors. We compare our proposed approach to a number of baselines and state-of-the-art methods on four image datasets, and we find that our approach resulted in improved predictive performance.

show abstract

Adversarially Learned One-Class Classifier for Novelty Detection

Cited by 662 publications

References 47 publications

Self-Supervised Representation Learning via Neighborhood-Relational Encoding

Self-Supervised Representation Learning via Neighborhood-Relational Encoding

Anomaly Detection in Video Sequence With Appearance-Motion Correspondence

Fast Distance-Based Anomaly Detection in Images Using an Inception-Like Autoencoder

Contact Info

Product

Resources

About