“…The size of the training set, as well as the accuracy of prior data used in training, play a very central role also in the denoising and segmentation of CT images, where the number of instances in the training set T is significantly smaller than the feature space dimension D , corresponding to the number of voxels. A problem characterized by pertains to the so-called “small-data learning challenge” [ 34 , 35 , 36 , 37 , 38 ], and represents a scenario in which ML and DL approaches are prone to quickly overfit the small training set (which in addition often also contains missig data or incorrectly labeled data) and to achieve an unsatisfactory performance on the validation set [ 39 , 40 , 41 , 42 , 43 ]. To tackle this issue, several alternative approaches have been proposed [ 44 , 45 ], with transfer learning representing one of the most powerful alternatives [ 46 ].…”