Conventional acoustic echo cancellation works by using an adaptive algorithm to identify the impulse response of the echo path. In this paper, we use the CNN(convolutional neural network) filter to remove the echo signal from the microphone input signal, so that only the speech signal is transmitted to the far-end. Using the neural network filter, weights are well converged by the general speech signal. Especially, it shows the ability to perform stable operation without divergence even in the double-talk state, in which both parties speak simultaneously. As a result of simulation, this system showed better performance and stable operation compared to the echo canceler of the adaptive filter structure. And, in double-talk, we showed the ERLE in the CNN is about 3 [dB] better than in the general neural network.