Semantic Perceptual Image Compression Using Deep Convolution Networks

Prakash, Aaditya; Moran, Nick; Garber, Solomon; DiLillo, Antonella; Storer, James A.

doi:10.1109/dcc.2017.56

Cited by 68 publications

(43 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 1 The JPEG quantization level was chosen to achieve the highest accuracy. For MSROI, we applied the technique described in [17] and set the quantization level to keep the final image quality as close as possible to the JPEG version. The results are shown in Figure 2…”

Section: Methodsmentioning

confidence: 99%

“…Saliency detection techniques can solve these issues, but such techniques are limited in their ability to detect multiple objects, and the identified salient region may only contain a limited subset of the objects in the image. To address these shortcomings, we utilize MSROI [17], a CNN designed to retrieve all salient regions and provide a soft boundary over the image.…”

Section: Semantic Quantizationmentioning

confidence: 99%

See 1 more Smart Citation

Protecting JPEG Images Against Adversarial Attacks

Prakash

Moran

Garber

et al. 2018

2018 Data Compression Conference

Self Cite

View full text Add to dashboard Cite

As deep neural networks (DNNs) have been integrated into critical systems, several methods to attack these systems have been developed. These adversarial attacks make imperceptible modifications to an image that fool DNN classifiers. We present an adaptive JPEG encoder which defends against many of these attacks. Experimentally, we show that our method produces images with high visual quality while greatly reducing the potency of state-of-the-art attacks. Our algorithm requires only a modest increase in encoding time, produces a compressed image which can be decompressed by an off-theshelf JPEG decoder, and classified by an unmodified classifier.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Semantic Quantizationmentioning

confidence: 99%

Protecting JPEG Images Against Adversarial Attacks

Prakash

Moran

Garber

et al. 2018

2018 Data Compression Conference

Self Cite

View full text Add to dashboard Cite

show abstract

“…To the best of our knowledge, there are no previous scientific works that propose to learn a mapping of the pixel coordinates to the corresponding pixel color values using neural networks. However, there are numerous neural models that learn a mapping from image pixels to a set of classes [10,16,30,33] or from pixels to pixels [1,2,[4][5][6]12,14,18,20,21,24,28,29,35,[39][40][41][42]. The neural models that map pixels to pixels are usually applied on tasks such as image compression [1,2,4,6,21,24,35], image denoising and restoration [20,39,42], image super-resolution [5,14,18,20,28,29,39,40], image completion [12,41] and image generation [11,36].…”

Section: Related Workmentioning

confidence: 99%

“…Dumas et al [6] address image compression using sparse representations, by proposing a stochastic winner-takes-all auto-encoder in which image patches compete with one another when their sparse representation is computed. Prakash et al [24] design a technique that makes JPEG content-aware by training a deep CNN model to generate a map that highlights semantically-salient regions so that they can be encoded at higher quality as compared to background regions. Toderici et al [35] present several recurrent neural network (RNN) architectures that provide variable compression rates during deployment without requiring retraining.…”

Section: Related Workmentioning

confidence: 99%

CocoNet: A Deep Neural Network for Mapping Pixel Coordinates to Color Values

Bricman¹,

Ionescu

2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

In this paper, we propose a deep neural network approach for mapping the 2D pixel coordinates in an image to the corresponding Red-Green-Blue (RGB) color values. The neural network is termed CocoNet, i.e. coordinates-to-color network. During the training process, the neural network learns to encode the input image within its layers. More specifically, the network learns a continuous function that approximates the discrete RGB values sampled over the discrete 2D pixel locations. At test time, given a 2D pixel coordinate, the neural network will output the approximate RGB values of the corresponding pixel. By considering every 2D pixel location, the network can actually reconstruct the entire learned image. It is important to note that we have to train an individual neural network for each input image, i.e. one network encodes a single image only. To the best of our knowledge, we are the first to propose a neural approach for encoding images individually, by learning a mapping from the 2D pixel coordinate space to the RGB color space. Our neural image encoding approach has various low-level image processing applications ranging from image encoding, image compression and image denoising to image resampling and image completion. We conduct experiments that include both quantitative and qualitative results, demonstrating the utility of our approach and its superiority over standard baselines, e.g. bilateral filtering or bicubic interpolation. Our code is available at https://github.com/paubric/python-fuse-coconet.

show abstract

“…David E. Rumelhart first proposed the concept of an auto-encoder [6] and employed it to process data with large dimensions, which promoted the development of neural networks. In 2006, Hinton et al [7] improved the original shallow auto-encoder and proposed the concept of a deep learning neural network as well as its training strategy, which can be used in the signal processing field for applications such as feature extraction [8], image compression [9][10][11], classification [12,13], image denoising [14], prediction [15], and so on. Wang et al [16] proposed a rapid 3D feature learning method named a convolutional auto-encoder extreme learning machine (CAE-ELM), and the features extracted were superior to other previous deep learning methods.…”

Section: Introductionmentioning

confidence: 99%

An FPGA Implementation of a Convolutional Auto-Encoder

et al. 2018

View full text Add to dashboard Cite

Abstract:In order to simplify the hardware design and reduce the resource requirements, this paper proposes a novel implementation of a convolutional auto-encoder (CAE) in a field programmable gate array (FPGA). Instead of the traditional framework realized in a layer-by-layer way, we designed a new periodic layer-multiplexing framework for CAE. Only one layer is introduced and periodically reused to establish the network, which consumes fewer hardware resources. Moreover, by fixing the number of channels, this framework can be applicable to an image of arbitrary size. Furthermore, to effectively improve the speed of convolution calculation, the parallel convolution method is used based on shift registers. Experimental results show that the proposed CAE framework achieves good performance in image compression. It can be observed that our CAE framework has advantages in resources occupation, operation speed, and power consumption, indicating great potential for application in digital signal processing.

show abstract

Semantic Perceptual Image Compression Using Deep Convolution Networks

Cited by 68 publications

References 24 publications

Protecting JPEG Images Against Adversarial Attacks

Protecting JPEG Images Against Adversarial Attacks

CocoNet: A Deep Neural Network for Mapping Pixel Coordinates to Color Values

An FPGA Implementation of a Convolutional Auto-Encoder

Contact Info

Product

Resources

About