2017 Data Compression Conference (DCC) 2017
DOI: 10.1109/dcc.2017.56
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Perceptual Image Compression Using Deep Convolution Networks

Abstract: It has long been considered a significant problem to improve the visual quality of lossy image and video compression. Recent advances in computing power together with the availability of large training data sets has increased interest in the application of deep learning cnns to address image recognition and image processing tasks. Here, we present a powerful cnn tailored to the specific task of semantic image understanding to achieve higher visual quality in lossy compression. A modest increase in complexity i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 68 publications
(43 citation statements)
references
References 24 publications
0
43
0
Order By: Relevance
“…Table 1 The JPEG quantization level was chosen to achieve the highest accuracy. For MSROI, we applied the technique described in [17] and set the quantization level to keep the final image quality as close as possible to the JPEG version. The results are shown in Figure 2…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Table 1 The JPEG quantization level was chosen to achieve the highest accuracy. For MSROI, we applied the technique described in [17] and set the quantization level to keep the final image quality as close as possible to the JPEG version. The results are shown in Figure 2…”
Section: Methodsmentioning
confidence: 99%
“…Saliency detection techniques can solve these issues, but such techniques are limited in their ability to detect multiple objects, and the identified salient region may only contain a limited subset of the objects in the image. To address these shortcomings, we utilize MSROI [17], a CNN designed to retrieve all salient regions and provide a soft boundary over the image.…”
Section: Semantic Quantizationmentioning
confidence: 99%
“…To the best of our knowledge, there are no previous scientific works that propose to learn a mapping of the pixel coordinates to the corresponding pixel color values using neural networks. However, there are numerous neural models that learn a mapping from image pixels to a set of classes [10,16,30,33] or from pixels to pixels [1,2,[4][5][6]12,14,18,20,21,24,28,29,35,[39][40][41][42]. The neural models that map pixels to pixels are usually applied on tasks such as image compression [1,2,4,6,21,24,35], image denoising and restoration [20,39,42], image super-resolution [5,14,18,20,28,29,39,40], image completion [12,41] and image generation [11,36].…”
Section: Related Workmentioning
confidence: 99%
“…Dumas et al [6] address image compression using sparse representations, by proposing a stochastic winner-takes-all auto-encoder in which image patches compete with one another when their sparse representation is computed. Prakash et al [24] design a technique that makes JPEG content-aware by training a deep CNN model to generate a map that highlights semantically-salient regions so that they can be encoded at higher quality as compared to background regions. Toderici et al [35] present several recurrent neural network (RNN) architectures that provide variable compression rates during deployment without requiring retraining.…”
Section: Related Workmentioning
confidence: 99%
“…David E. Rumelhart first proposed the concept of an auto-encoder [6] and employed it to process data with large dimensions, which promoted the development of neural networks. In 2006, Hinton et al [7] improved the original shallow auto-encoder and proposed the concept of a deep learning neural network as well as its training strategy, which can be used in the signal processing field for applications such as feature extraction [8], image compression [9][10][11], classification [12,13], image denoising [14], prediction [15], and so on. Wang et al [16] proposed a rapid 3D feature learning method named a convolutional auto-encoder extreme learning machine (CAE-ELM), and the features extracted were superior to other previous deep learning methods.…”
Section: Introductionmentioning
confidence: 99%