“…In the typical objective evaluation and intercomparison of these models, a suite of standardized statistical metrics (e.g., correlation, root-mean-squared errors) are applied to quantify differences between modeled and observed variables (e.g., Doney et al, 2009;Rose et al, 2009;Stow et al, 2009;Romanou et al, 2013Romanou et al, , 2014. With the goal of constraining future projections, statistical metrics are often used for model ranking (e.g., Anav et al, 2013), weighting of model projections (e.g., Steinacher et al, 2010) or selection of the most skillful models across a wider ensemble (e.g., Massonnet et al, 2012;Wenzel et al, 2014).…”