2020
DOI: 10.1145/3386569.3392403
|View full text |Cite
|
Sign up to set email alerts
|

Single image HDR reconstruction using a CNN with masked features and perceptual loss

Abstract: Digital cameras can only capture a limited range of real-world scenes' luminance, producing images with saturated pixels. Existing single image high dynamic range (HDR) reconstruction methods attempt to expand the range of luminance, but are not able to hallucinate plausible textures, producing results with artifacts in the saturated areas. In this paper, we present a novel learning-based approach to reconstruct an HDR image by recovering the saturated pixels of an input LDR image in a visually pleasing way. P… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
79
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 121 publications
(80 citation statements)
references
References 42 publications
1
79
0
Order By: Relevance
“…Jang et al [40] introduced ΔE76 color difference formula in their loss function, which was the first human perception related loss term in HDR related CNN. Santos et al [43] first introduced gram matrix [44] to measure style loss, and feature masking to emphasize global difference.…”
Section: Other Hdr Related Deep Cnnmentioning
confidence: 99%
See 3 more Smart Citations
“…Jang et al [40] introduced ΔE76 color difference formula in their loss function, which was the first human perception related loss term in HDR related CNN. Santos et al [43] first introduced gram matrix [44] to measure style loss, and feature masking to emphasize global difference.…”
Section: Other Hdr Related Deep Cnnmentioning
confidence: 99%
“…Hence, an extra task of strengthening the global comprehension of Full-scale Local Branch NL was posed. Other methods tackled this by introducing multi-group (in [56]) or multi-pass (in [8]) convolution where different kernel-size are assigned to each pass, using spatial attention mechanism (in [43], [50] and [5]), using 1-D and 2-D dynamic convolution (in [47]), and improving encoder-decoder structure (U-net) [57].…”
Section: ) Multi-group Residual Blockmentioning
confidence: 99%
See 2 more Smart Citations
“…Although each algorithm addresses an important issue, no algorithm has enough robustness to completely handle the misalignment caused by object motion in images. Concurrently, another approach attempts to infer an HDR image directly from a single LDR image using CNNs [26]- [31], which is also known as inverse tonemapping (ITM). Although the networks in this approach can recover missing details in under-and over-exposed regions, one limitation of this approach is its high dependence on a single input LDR image, thereby lacking underlying information.…”
Section: Introductionmentioning
confidence: 99%