The problem of proper identification of the color space associated with digital luma and chroma data has been widely reported by video processing professionals. The problem arises from confusing, and sometimes conflicting, statements regarding color space usage and description. Although standards allow for the carriage of descriptive metadata regarding color space, some applications do not require the presence of such metadata. Those standards typically recite assumptions on color space that should be followed in the absence of embedded color space descriptors. The unfortunate result of this approach is a state of confusion in the industry, and consequently the possibility of errors in rendering the output of decoded video and images. Our work represents the first known attempt to determine color space directly from luma/chroma pixel data, and provides an alternative to sole reliance on potentially missing or incorrect metadata, or weakly followed defaults. Although a color space is defined by many parameters, such as primary chromaticity, transfer characteristics and matrix coefficients, we chose to focus on determining which standard for matrix coefficients had been used to create a given luma/chroma image. We addressed the problem via deep convolutional neural networks (DCNNs), trained on millions of images. Our results are encouraging, and suggest that DCNNs can be used to solve this ill-posed problem.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.