“…Firstly, the variability in study design, radiomic methods employed, texture features extracted, and recorded endpoints make it difficult to compare any two techniques and to perform quantitative analysis. Secondly, most ML and DL algorithms utilized in these studies were validated with their own dataset; therefore, without external validation, result generalizability and reproducibility cannot be applied to other datasets and populations [62]. Thirdly, repeatability, reproducibility, sample size, statistical power, and standardization are still vital factors to be considered in future investigations [63].…”