In this study, the authors consider the problem of image super-resolution (SR) in terms of the perceptual criteria. Existing SR methods treat the traditional mean-squared error (MSE) as an irreplaceable objective function. However, MSE has been widely criticised since it is inconsistent with visual perception of human beings. The perceptual criteria, including the structural similarity (SSIM) index and feature similarity (FSIM) index, have been reported to be more effective in assessing image quality. Therefore SSIM and FSIM are included for the SR task in this study. Specifically, the authors first propose to reform principal component analysis (PCA), which is named as visual perceptual PCA (VP-PCA), by adopting SSIM as the object function. Subsequently, to accomplish the SR task, the authors cluster the training data and perform VP-PCA on each cluster to calculate the coefficients. Finally, based on the principle of FSIM, the traditional SR results and the SR results using VP-PCA are combined to form our fused results. Experimental results are provided to show the superiority of the proposed method over several state-of-the-art methods in both quantitative and visual comparisons.