Canopy height is an important forest structure parameter for understanding forest ecosystem and improving global carbon stock quantification accuracy. Light detection and ranging (LiDAR) can provide accurate canopy height measurements, but its application at large scales is limited. Using LiDAR-derived canopy height as ground truth to train the Random Forest (RF) algorithm and therefore predict canopy height from other remotely sensed datasets in areas without LiDAR coverage has been one of the most commonly used method in large-scale canopy height mapping. However, how variances in location, vegetation type, and spatial scale of study sites influence the RF modelling results is still a question that needs to be addressed. In this study, we selected 16 study sites (100 km2 each) with full airborne LiDAR coverage across the United States, and used the LiDAR-derived canopy height along with optical imagery, topographic data, and climate surfaces to evaluate the transferability of the RF-based canopy height prediction method. The results show a series of findings from general to complex. The RF model trained at a certain location or vegetation type cannot be transferred to other locations or vegetation types. However, by training the RF algorithm using samples from all sites with various vegetation types, a universal model can be achieved for predicting canopy height at different locations and different vegetation types with self-predicted R2 higher than 0.6 and RMSE lower than 6 m. Moreover, the influence of spatial scales on the RF prediction accuracy is noticeable when spatial extent of the study site is less than 50 km2 or the spatial resolution of the training pixel is finer than 500 m. The canopy height prediction accuracy increases with the spatial extent and the targeted spatial resolution.