Research on viewpoint in 3D scenes has been a cutting-edge subject in computer vision and 3D scene understanding. Calculating a high quality 3D viewpoint in virtual scenes is conducive to a more comprehensive analysis of 3D graphic structure, which can contribute to an overall understanding of virtual environments. 3D viewpoint research can also be applied to the analysis of structural relationships in virtual scenes and to exploit hidden hierarchical structures, thus it plays a role in fields such as scientific visualization computing or image-based scene modeling. Viewpoint research combined with relevant research results of human visual perception or visual psychology can be used to analyze the visual interest and focus areas of 3D objects in human vision, thus effectively improving the quality of mesh simplification and rendering efficiency, as well as providing an optimization basis for highly complex scenes. In fields such as 3D computer games, virtual reality, and landscape animation, research on high-quality viewpoints can be adopted to optimize global lighting and illumination. Additionally, practical applications can be realized through user attention analysis and aesthetic improvement strategies based on high-quality viewpoints, such as intelligent modeling and scene optimization calculations. In common fields such as 3D model retrieval, practical applications can be realized for saliency research, scientific visualization computing, and medical 3D imaging. In-depth research has been performed and extensive applications have been developed that make use of viewpoint analysis. In this paper, we introduce an optimization strategy to calculate high-quality virtual viewpoints for aesthetic images. A novel framework is proposed for modifying and optimizing the viewpoint calculation model by combining a multibranch CNN and a viewpoint correction method, thus realizing rendered images with a higher aesthetic quality. By attempting to fully integrate the visual perception with the geometric information calculation, we expect this method to achieve more comprehensive applications in many practice areas in the future, such as the realization of aesthetically based virtual camera path planning and analysis of the aesthetic characteristics of virtual environments.