Gearbox failures have a detrimental effect on the machine performance that affects the production capacity and economic benefits of a manufacturing industry in an adverse manner. Early detection of faults in gears helps to prevent sudden machine failures. Previously, many signal processing and artificial intelligence techniques have been successfully applied to diagnose gear faults by analyzing the machine vibrations. These techniques have repeatedly emphasized on the need to extract quality features from gear vibration signals and using artificial intelligence techniques that can identify multiple types of faults simultaneously. As such, in this article, new features called as visibility graph features based on complex network theory have been proposed to extract fault information from vibration signals. In addition, a new artificial intelligence framework for multifault classification known as error-correcting output codes-multiclass support vector machine is proposed to detect gear faults. To the best of authors’ knowledge, these features along with the error-correcting output codes-artificial intelligence framework have been rarely utilized for fault diagnosis purpose. The proposed approach has been applied on experimental gear vibration data to validate its effectiveness. First, the vibration signals are transformed to graphs, and various graph properties are computed to serve as fault features. Then, the fault features are supplied to train the error-correcting output codes-multiclass support vector machine model and learn the fault patterns. Finally, the gear fault classification accuracies are determined for various gear test conditions and compared with those of existing methods. It is observed that the proposed approach provides an average improvement of 4.22%, 9.93% and 11.8% over the time-domain features, time–frequency domain features and voting-based multiclass support vector machine classifier, respectively, for different operating conditions. Furthermore, the suggested technique augments the classification accuracies by 10%, 7.5% and 6.7%, respectively, as compared with the deep-learning models, namely standard stacked sparse autoencoder, deep neural network and convolution neural network.