Transformer oil can dissolve a certain amount of gas, which can provide an important basis for transformer fault diagnosis. The relationship between the characteristic amount of dissolved gas in transformers and the type of transformer fault was analyzed, and the main influencing factors were extracted. We studied a transformer fault diagnosis method based on the improved random forest algorithm and combined it with the identification requirements of abnormal data in multiple parts of transformers. We improved the random forest from four aspects: dataset construction, bootstrap sampling, decision tree generation, and multi-node voting. We established an improved random forest diagnosis model with oil chromatogram feature gas ratio as the characteristic parameter and analyzed different input parameter combinations. The effectiveness of improving the random forest diagnosis model under different training times was compared with various machine learning methods. The experimental results show that the diagnostic accuracy of the improved random forest diagnostic model is significantly improved compared to other machine learning methods.