Based on the good performance of deep reinforcement learning (DRL) in policy optimization, a stereoscopic projection policy optimization method is proposed, which combines the simulation experiment method with the DRL method. On the basis of policy optimization research, a deep learning framework is selected according to the research problems, and a DRL stereoscopic project policy model based on the asynchronous advantage actor–critic (A3C) algorithm, which uses two groups of neural networks, is constructed. The optimized stereoscopic projection policy is obtained by the interactive learning between the DRL model and the simulation. The effectiveness of the cooperative optimization policy between the DRL and the simulation experiment is verified.