Psychological stress cannot be ignored in today's society, and there is an urgent need for an objective and cost-effective method to detect it. However, traditional machine learning methods that require manual feature extraction require a lot of research time and cannot guarantee accuracy. In this paper, we establish a four-category stress multimodal dataset by collecting EEG and ECG signals from 24 subjects performing mental arithmetic tasks with different difficulty levels and propose a multimodal decision fusion model based on Convolution Neural Network to classify the data. The prediction probabilities of EEG and ECG signals for the four stress categories are first extracted by two models each and then fused into the decision model for the final classification, 5-fold cross-validation and Leave-Three-Subjects-Out experiments are performed, which achieve 91.14% and 91.97% accuracy, respectively. In addition, the features of the convolution layer were visualized using the 1D-Grad-CAM method to improve the interpretability of the model.