Abstract:The large domain discrepancy between faces captured in polarimetric (or conventional) thermal and visible domain makes cross-domain face recognition quite a challenging problem for both human-examiners and computer vision algorithms. Previous approaches utilize a two-step procedure (visible feature estimation and visible image reconstruction) to synthesize the visible image given the corresponding polarimetric thermal image. However, these are regarded as two disjoint steps and hence may hinder the performance… Show more
“…Regarding Protocol I, we evaluate and compare the performance of the proposed method with recent state-of-theart methods [35,19,27,26,5,36]. Figure 5 shows the evaluation performance for two different experimental settings, S0 (representing conventional thermal) and Polar separately.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…It has been shown that polarimetric thermal imaging captures additional geometric and textural facial details compared to conventional thermal imaging [10]. Hence, the polarization-state information has been used to improve the performance of cross-spectrum face recognition [10,27,30,35,26,5]. A polarimetric, referred to as Stokes images, is composed of three channels: S0, S1 and S2.…”
Section: Introductionmentioning
confidence: 99%
“…The large domain discrepancy between these images makes the cross-spectrum matching problem very challenging. Various methods have been proposed in the literature for crossspectrum matching [10,27,30,35,26,16,21,18,2,28,24].…”
Section: Introductionmentioning
confidence: 99%
“…Various synthesis-based methods have also been proposed in the literature [26,35,38,27] to perform crossmodal mapping at the image level for direct use in existing visible-based matchers. Riggan et al [27] trained a regression network to estimate the mapping between features from both visible and thermal then reconstruct the visible face based on the estimated features.…”
Section: Introductionmentioning
confidence: 99%
“…Riggan et al [27] trained a regression network to estimate the mapping between features from both visible and thermal then reconstruct the visible face based on the estimated features. Zhang et al [35,36] leveraged generative adversarial networks (GANs) to synthesize visible images from polarimetric thermal images. Riggan et al [26] proposed a global and local region-based synthesis network to transform the thermal image into the visible spectrum.…”
Polarimetric thermal to visible face verification entails matching two images that contain significant domain differences. Several recent approaches have attempted to synthesize visible faces from thermal images for cross-modal matching. In this paper, we take a different approach in which rather than focusing only on synthesizing visible faces from thermal faces, we also propose to synthesize thermal faces from visible faces. Our intuition is based on the fact that thermal images also contain some discriminative information about the person for verification. Deep features from a pre-trained Convolutional Neural Network (CNN) are extracted from the original as well as the synthesized images. These features are then fused to generate a template which is then used for verification. The proposed synthesis network is based on the self-attention generative adversarial network (SAGAN) which essentially allows efficient attention-guided image synthesis. Extensive experiments on the ARL polarimetric thermal face dataset demonstrate that the proposed method achieves state-of-the-art performance.
“…Regarding Protocol I, we evaluate and compare the performance of the proposed method with recent state-of-theart methods [35,19,27,26,5,36]. Figure 5 shows the evaluation performance for two different experimental settings, S0 (representing conventional thermal) and Polar separately.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…It has been shown that polarimetric thermal imaging captures additional geometric and textural facial details compared to conventional thermal imaging [10]. Hence, the polarization-state information has been used to improve the performance of cross-spectrum face recognition [10,27,30,35,26,5]. A polarimetric, referred to as Stokes images, is composed of three channels: S0, S1 and S2.…”
Section: Introductionmentioning
confidence: 99%
“…The large domain discrepancy between these images makes the cross-spectrum matching problem very challenging. Various methods have been proposed in the literature for crossspectrum matching [10,27,30,35,26,16,21,18,2,28,24].…”
Section: Introductionmentioning
confidence: 99%
“…Various synthesis-based methods have also been proposed in the literature [26,35,38,27] to perform crossmodal mapping at the image level for direct use in existing visible-based matchers. Riggan et al [27] trained a regression network to estimate the mapping between features from both visible and thermal then reconstruct the visible face based on the estimated features.…”
Section: Introductionmentioning
confidence: 99%
“…Riggan et al [27] trained a regression network to estimate the mapping between features from both visible and thermal then reconstruct the visible face based on the estimated features. Zhang et al [35,36] leveraged generative adversarial networks (GANs) to synthesize visible images from polarimetric thermal images. Riggan et al [26] proposed a global and local region-based synthesis network to transform the thermal image into the visible spectrum.…”
Polarimetric thermal to visible face verification entails matching two images that contain significant domain differences. Several recent approaches have attempted to synthesize visible faces from thermal images for cross-modal matching. In this paper, we take a different approach in which rather than focusing only on synthesizing visible faces from thermal faces, we also propose to synthesize thermal faces from visible faces. Our intuition is based on the fact that thermal images also contain some discriminative information about the person for verification. Deep features from a pre-trained Convolutional Neural Network (CNN) are extracted from the original as well as the synthesized images. These features are then fused to generate a template which is then used for verification. The proposed synthesis network is based on the self-attention generative adversarial network (SAGAN) which essentially allows efficient attention-guided image synthesis. Extensive experiments on the ARL polarimetric thermal face dataset demonstrate that the proposed method achieves state-of-the-art performance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.