Detecting aircraft from remote sensing image (RSI) is an important but challenging task due to the variations of aircraft type, size, pose, angle, complex background and small size of aircraft in RSIs. An aircraft detection method is proposed based on multi-scale convolution neural network with attention (MSCNNA), consisting of encoder, decoder, attention and classification. In MSCNNA, the multiple convolutional and pooling kernels with different sizes are utilized to learn the multi-scale discriminant features, and the global attention mechanism (GAM) is employed to capture the spatial and channel dependencies and adaptively preserve the relationships of the entire image. Compared with the standard deep CNN, multi-scale convolution neural networks (CNN) and GAM are integrated to learn the multi-scale features for aircraft detection, especially small aircrafts. Experiment results on the aircraft image dataset of the public EORSSD dataset show that the proposed method outperforms the state-of-the-art method on the same dataset and the obtained multi-size aircraft edge is clearer.