“…These studies established if there was greater IOV between one dataset and another, independent of the magnitude, shape or other descriptors of the volumes being delineated. These studies were generally undertaken to determine if there was a factor which resulted in greater concordance between clinicians such as a different imaging modality [20,21,[24][25][26]32,37,41,45,57,58,68,69,73,[76][77][78][79]84,87,88,96,101,132], fiducials [16], training [36,56,62,75,122], guidelines or atlas [46,55,74,83,97,107,108,114,126,133] or provision of autosegmented contours to edit [15,51,73,80,82,94]. For large datasets comparing only two situations, the paired T-test was commonly used for assessing statistical significance…”