-2010-2015). Findings: The models were derived from weighted averages of candidate models submitted by ten international teams. Teams were led by the British Geological Survey (UK), DTU Space (Denmark), ISTerre (France), IZMIRAN (Russia), NOAA/NGDC (USA), GFZ Potsdam (Germany), NASA/GSFC (USA), IPGP (France), LPG Nantes (France), and ETH Zurich (Switzerland). Each candidate model was carefully evaluated and compared to all other models and a mean model using well-defined statistical criteria in the spectral domain and maps in the physical space. These analyses were made to pinpoint both troublesome coefficients and the geographical regions where the candidate models most significantly differ. Some models showed clear deviation from other candidate models. However, a majority of the task force members appointed by IAGA thought that the differences were not sufficient to exclude models that were well documented and based on different techniques. Conclusions: The task force thus voted for and applied an iterative robust estimation scheme in space. In this paper, we report on the evaluations of the candidate models and provide details of the algorithm that was used to derive the IGRF-12 product.