Siteâspecific investigations of the role of radiomics in cancer diagnosis and therapy are emerging. We evaluated the reproducibility of radiomic features extracted from 18Flourineâfluorodeoxyglucose (18FâFDG) PET images for three parameters: manual versus computerâaided segmentation methods, grayâlevel discretization, and PET image reconstruction algorithms. Our cohort consisted of pretreatment PET/CT scans from 88 cervical cancer patients. Two boardâcertified radiation oncologists manually segmented the metabolic tumor volume (MTV1 and MTV2) for each patient. For comparison, we used a graphicalâbased method to generate semiautomated segmented volumes (GBSV). To address any perturbations in radiomic feature values, we downâsampled the tumor volumes into three grayâlevels: 32, 64, and 128 from the original grayâlevel of 256. Finally, we analyzed the effect on radiomic features on PET images of eight patients due to four PET 3Dâreconstruction algorithms: maximum likelihoodâordered subset expectation maximization (OSEM) iterative reconstruction (IR) method, fourier rebinningâMLâOSEM (FOREIR), FOREâfiltered back projection (FOREFBP), and 3DâReprojection (3DRP) analytical method. We extracted 79 features from all segmentation method, grayâlevels of downâsampled volumes, and PET reconstruction algorithms. The features were extracted using grayâlevel coâoccurrence matrices (GLCM), grayâlevel size zone matrices (GLSZM), grayâlevel runâlength matrices (GLRLM), neighborhood grayâtone difference matrices (NGTDM), shapeâbased features (SF), and intensity histogram features (IHF). We computed the Dice coefficient between each MTV and GBSV to measure segmentation accuracy. Coefficient values close to one indicate high agreement, and values close to zero indicate low agreement. We evaluated the effect on radiomic features by calculating the mean percentage differences (dÂŻ) between feature values measured from each pair of parameter elements (i.e. segmentation methods: MTV1âMTV2, MTV1âGBSV, MTV2âGBSV; grayâlevels: 64â32, 64â128, and 64â256; reconstruction algorithms: OSEMâFOREâOSEM, OSEMâFOREFBP, and OSEMâ3DRP). We used false|normaldÂŻfalse| as a measure of radiomic feature reproducibility level, where any feature scored false|normaldÂŻfalse| ±SD †|25|% ± 35% was considered reproducible. We used BlandâAltman analysis to evaluate the mean, standard deviation (SD), and upper/lower reproducibility limits (U/LRL) for radiomic features in response to variation in each testing parameter. Furthermore, we proposed U/LRL as a method to classify the level of reproducibility: Highâ ±1% †U/LRL †±30%; Intermediateâ ±30% < U/LRL †±45%; Lowâ ±45 < U/LRL †±50%. We considered any feature below the low level as nonreproducible (NR). Finally, we calculated the interclass correlation coefficient (ICC) to evaluate the reliability of radiomic feature measurements for each parameter. The segmented volumes of 65 patients (81.3%) scored Dice coefficient >0.75 for all three volumes. The result outcomes revealed a tendency of higher radiomic fe...