Over the past few years, 3D display technology has improved so much that it is being used in many fields, such as education, medicine and the military. 3D holographic display is seen as the ultimate solution to 3D display, but one of the problems is that there is not enough 3D content available. Conventional methods to obtain 3D content from real scenes using lightfield cameras or RGB-D cameras are complicated. Here, we proposed a 3D scene acquisition and reconstruction system based on optical axial scanning. First an electrically tunable lens (ETL) was used for high-speed focus shift (up to 2.5 ms). A CCD camera is synchronized with the ETL to acquire multi-focused image sequence of real scene. Then, The Tenengrad operator was used to obtain the focusing area of each multi-focused image, and the 3D image were obtained. Finally, the Computer-Generated Hologram (CGH) can be obtained by the layer-based diffraction algorithm. The CGH was loaded onto the space light modulator (SLM) to reconstruct the 3D holographic image. The experimental results verify the feasibility of the system. This method will expand the application of 3D holographic display in the field of education, advertising, entertainment, and other fields.