In large‐scale watersheds, only the calibration of parameters for the outlet may ignore spatial heterogeneity and fail to meet the simulation accuracy of local areas. Based on the runoff observation data of three hydrological stations (Qingyang, Yangjiaping and Zhangjiashan) in the Jinghe River Watershed, single‐ and multi‐site calibration strategies were designed to investigate the performance and applicability of the Soil and Water Assessment Tool (SWAT) in predicting the monthly runoff of nested catchments and evaluate the differences in parameter sensitivity between the upstream and downstream. (i) The two calibration strategies showed little difference in the simulation accuracy of the watershed outlet; the Nash–Sutcliffe efficiency (NSE) and determination coefficient (R2) were approximately 0.8 and 0.6, respectively, and the percentage of bias (PBIAS) was mostly <15%. The single‐site one‐by‐one calibration was significantly better than the multi‐site joint calibration at the interior points of the watershed. Among them, Qingyang decreased most obviously in the joint calibration with Zhangjiashan, with both NSE and R2 < 0.5, whereas Yangjiaping was >0.5 in combination with the other two sites. (ii) The sensitive parameters obtained by the two calibration strategies were different, and different parameters were effective for different‐sized catchments because hydrological processes exhibit astounding variability at all scales. The single‐site one‐by‐one calibration was mainly affected by the hydrological processes of the local catchment, whereas the multi‐site joint calibration was largely constrained by the sensitive parameters of the single‐site calibration. (iii) Parameters at different scales and locations were related to the problem of scaling and regionalization. The multi‐site joint calibration did not significantly improve the model performance of the watershed outlet but increased the capacity for synchronous identification of local sensitive parameters when compared to single‐site calibration. These findings are expected to provide experience for parameter calibration in other similar large‐scale watersheds and to lay a good foundation for the accurate identification of other related hydrological behaviours at watershed scales.