The optimization of aircraft is a typical multidisciplinary and multi-objective problem. To solve this problem, the difficulty lies not only in the high cost of discipline performance evaluation but also in the complex coupling relationship between different disciplines. To improve the optimization efficiency, a new optimization method is proposed, including two new algorithms: conditional generative adversarial nets with vector similarity (VS-CGAN) and distributed single-step deep reinforcement learning with transfer learning (TL-DSDRL). For low-cost disciplines, VS-CGAN learns the relationship between variables and objectives through presampling to compress the variable domains. The cosine function is used to evaluate the similarity between the random noise and generated variables to avoid mode collapse. For high-cost disciplines, TL-DSDRL improves optimization efficiency through pretraining. The newly designed reward function and multi-agent cooperation mechanism enhance the multi-objective search ability of reinforcement learning.