To address the low accuracy of current one-dimensional signal recognition for coal-rock cutting vibration and the low efficiency of traditional static neural networks, this paper proposes an interpretable recognition method that combines Markov Transition Field (MTF) and channel-selective neural networks. Firstly, by decomposing and reconstructing the energy of wavelet packets and denoising the signal, it is transformed into MTF images with temporal correlation to improve signal processing efficiency. Then, a channel selective module is proposed to replace traditional convolutional layers, enhancing the model's ability to extract data features. A spatial shift mechanism is improved to alleviate the problem of weight degradation and improve the model's generalization capability. The test results on a self-constructed experimental dataset show that this method achieves a recognition accuracy of up to 97.375% without increasing additional parameters, significantly outperforming traditional methods and maintaining good robustness in noisy environments. This study improves the efficiency and accuracy of coal-rock cutting state recognition and provides new ideas for signal processing in environments with limited underground computing resources.