The Agricultural Policy Environmental eXtender (APEX) model is capable of estimating edge-of-field water, nutrient, and sediment transport and is used to assess the environmental impacts of management practices. The current practice is to fully calibrate the model for each site simulation, a task that requires resources and data not always available. The objective of this study was to compare model performance for flow, sediment, and phosphorus transport under two parameterization schemes: a best professional judgment (BPJ) parameterization based on readily available data and a fully calibrated parameterization based on site-specific soil, weather, event flow, and water quality data. The analysis was conducted using 12 datasets at four locations representing poorly drained soils and row-crop production under different tillage systems. Model performance was based on the Nash-Sutcliffe efficiency (NSE), the coefficient of determination (r 2 ) and the regression slope between simulated and measured annualized loads across all site years. Although the BPJ model performance for flow was acceptable (NSE = 0.7) at the annual time step, calibration improved it (NSE = 0.9). Acceptable simulation of sediment and total phosphorus transport (NSE = 0.5 and 0.9, respectively) was obtained only after full calibration at each site. Given the unacceptable performance of the BPJ approach, uncalibrated use of APEX for planning or management purposes may be misleading. Model calibration with water quality data prior to using APEX for simulating sediment and total phosphorus loss is essential. (P) and sediment loss from agricultural fields continues to degrade fresh water quality despite decades of efforts to understand loss processes and implement management practices to reduce nonpoint pollution (e.g., Sharpley et al., 1994Sharpley et al., , 2015Sims and Kleinman, 2005;Jarvie et al., 2013). Failure of conservation practices to produce expected improvements in water quality has renewed appreciation for the complexity of P movement within landscapes and to waterbodies Sharpley et al., 2013). Processbased watershed-and field-scale models offer the prospect of integrating knowledge to assess and quantify impacts of conservation and management practices.The P Index was developed in the mid-1990s to assess risk of P loss from agricultural land (Lemunyon and Gilbert, 1993) and is now an integral part of the NRCS 590 Nutrient Management Standard and other state and federal programs (Sharpley et al., 2003(Sharpley et al., , 2017. However, the diversity in P Index ratings and P management recommendations for similar conditions led and Osmond et al. (2006) to emphasize the need for science-based assessment and improvement of existing P Indices.Extensive testing of P Indices requires water quality data from field-scale watersheds from a broad range of soils and management scenarios with a sufficient number of monitoring years to estimate long-term average annual losses. Such extensive datasets are rare. Hence, computer models must ...