An axle box bearing is one of the most important components of high-speed EMUs (electric multiple units), which runs at a very fast speed, suffers a heavy load, and operates under various complex working conditions. Once a bearing fault occurs, it not only has an enormous impact on the railway system, but also poses a threat to personal safety. Therefore, there is significant value in studying a real-time fault early warning of a high-speed EMU axle box bearing. However, to our best knowledge, there are three obvious defects in the existing fault early warning methods used for high-speed EMU axle box bearings: (1) these methods based on vibration are extremely mature, but there are no vibration sensors installed in high-speed EMU axle box because it will greatly increase the manufacturing cost; (2) a TADS (trackside acoustic device system) can effectively detect early failures, but only a portion of railways are equipped with such a facility; and (3) an EMU-ODS (electric multiple unit onboard detection system) has reported numerous untimely warnings, along with warnings of frequent occurrence being missed. Whereupon, a method is proposed to realize the fault early warning of an axle box bearing without installing a vibration sensor on the high-speed EMU in service, namely a MLSTM-iForest (multilayer long short-term memory–isolation forest). First, the time-series data of the temperature-related variables of the axle box bearing is used as the input of MLSTM to predict the axle box bearing temperature in the future. Then, the deviation index of the predicted axle box bearing temperature is calculated. Finally, the deviation index is input into an iForest algorithm for unsupervised classification to realize the fault early warning of an axle box bearing. Experimental results on high-speed EMU operation data sets demonstrated the availability and feasibility of the presented method toward achieving early fault warnings of a high-speed EMU axle box bearing.