Machine health monitoring has become increasingly important in modern manufacturers because of its ability to reduce downtime of the machine and cut down the production cost. Enormous signals acquired from machinery are capable of reflecting current working conditions by in-depth analysis with various data-driven methods. Hand-crafted feature extraction and representation from the traditional methods are essential but daunting tasks, and these methods may not be suitable for these massive data. Compared with traditional methods, deep learning ones are able to extract the best feature combination during model training without any artificial intervention, which makes it easier, more efficient, and more effective to monitor machine health, but the training cost and training time hamper its application. The short-time Fourier transform is adopted as the data preprocessing method to cut down the training cost and boost the training procedure. Inspired by the great achievements of ResNet, the new optimized model based on ResNet has been proposed with layer-by-layer dimension reduction of the feature maps. The proposed model is also able to avoid information loss in the conventional pooling layer. All the potential candidate model blocks are introduced and compared, and the best one is selected as the final one. Repeated model block layers are adapted for the best feature combinations, followed by a two-layer full connection layer for the final targets. The proposed method is validated by conducting experiments on bearing fault diagnosis and tool wear prediction dataset. The final results show that the proposed model achieves the best accuracy rate in the classification task and the lowest root mean squared error in the prediction task.