Proceedings of the Twelfth Indian Conference on Computer Vision, Graphics and Image Processing 2021
DOI: 10.1145/3490035.3490288
|View full text |Cite
|
Sign up to set email alerts
|

An encoder-decoder based deep architecture for visible to near infrared image transformation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 25 publications
0
9
0
Order By: Relevance
“…If IR and depth images can be estimated from an RGB image, a lip-reading system with a high recognition rate can be implemented by using existing RGB cameras without adding/changing any hardware. Previous studies demonstrated the feasibility of using RGB images to predict a different domain for its application-specific representation [44]. Accordingly, in this paper, we propose a method to improve the accuracy of lip-reading by using IR/depth images estimated from RGB images instead of the actually captured IR/depth images.…”
Section: Prediction Of Ir/depth Images Using An Optical Rgb Imagementioning
confidence: 99%
See 3 more Smart Citations
“…If IR and depth images can be estimated from an RGB image, a lip-reading system with a high recognition rate can be implemented by using existing RGB cameras without adding/changing any hardware. Previous studies demonstrated the feasibility of using RGB images to predict a different domain for its application-specific representation [44]. Accordingly, in this paper, we propose a method to improve the accuracy of lip-reading by using IR/depth images estimated from RGB images instead of the actually captured IR/depth images.…”
Section: Prediction Of Ir/depth Images Using An Optical Rgb Imagementioning
confidence: 99%
“…Despite the shared information between the two images, each image has unique characteristics that cannot be explained by simple dependency relationships. Therefore, the correspondence between the two images was primarily represented by nonlinear mapping rules, such as deep neural networks [38][39][40][42][43][44][45]49,53]. Typically, a CNN was adopted to estimate the IR/depth images from optical RGB images.…”
Section: Prediction Of Ir/depth Images Using An Optical Rgb Imagementioning
confidence: 99%
See 2 more Smart Citations
“…Furthermore, they improve the quality of generated thermal images by integrating gradient and intensity losses, which helps the generator learn better characteristics of the thermal images. [77] propose a visible to Near-Infrared (NIR) image translation model based on GANs. Their proposed model consists of two UNet models as generators, one for the regular image, and the other one for a lower resolution version of the image.…”
Section: Related Workmentioning
confidence: 99%