“…Data collected is represented by direct data entry fields (manual or automatic) in the treatment planning system (e.g., tumor site, treatment intent, ready to treat dates, patient setup and desired technique employed in treatment planning), as well as tumor and normal tissue volumes delineated by the clinician and RT dose delivered using multiple DVH (Dose Volume Histogram) (Figure 1). These large-scale datasets can be employed for administrative purposes, such as capturing the number of patients on treatment that share a common histology or planning technique, but are also most relevant to computational approaches that involve artificial intelligence (AI) [3,4,[9][10][11][12][13], machine learning (ML) [2,[4][5][6][7]9,[13][14][15][16][17][18][19][20][21][22], deep learning (DL) [2,3,11,[23][24][25][26][27], ground truth [7,13] and radiogenomics [2,13,14,[28][29][30][31][32][33][34] (See Table…”