Mechanical defects and partial discharge (PD) defects can appear in the indoor switchgear of substations or distribution stations, making the switchgear a safety hazard. However, traditional acoustic methods detect and identify these two types of defects separately, ignoring the general recognition of audio signals. In addition, the process of using testing equipment is complex and costly, which is not conducive to timely testing and widespread application. To assist technicians in making a quick preliminary diagnosis of defect types for switchgear, improve the efficiency of the subsequent overhaul, and reduce the cost of detection, this paper proposes a general audio recognition method for identifying defects in switchgear using a smartphone. Using this method, we can analyze and identify audio and video files recorded with smartphones and synchronously distinguish background noise, mechanical vibration, and PD audio signals, which have good applicability within a certain range. When testing the feasibility of using smartphones to identify three types of audio signal, through characterizing 12 sets of live audio and video files provided by technicians, it was found that there were similarities and differences in these characteristics, such as the autocorrelation, density, and steepness of the waveforms in the time domain, and the band energy and harmonic components of the frequency spectrum, and new combinations of features were proposed as applicable. To compare the recognition performance for features in the time domain, frequency band energy, Mel-frequency cepstral coefficient (MFCC), and this method, feature vectors were input into a support vector machine (SVM) for a recognition test, and the recognition results showed that the the present method had the highest recognition accuracy. Finally, a set of mechanical defects and PD defects were set up for a switchgear, for practical verification, which proved that this method was general and effective.