ObjectiveThe inter-rater reliability of the modified Knosp's classification was measured before the analysis. The clinical validity of the parasellar extension grading system was evaluated by investigating the extents of resection and complication rates among the grades in the endoscopic endonasal transsphenoidal surgery (EETS) for pituitary adenomas.MethodsFrom November 2008 to August 2015, of the 286 patients who underwent EETS by the senior author, 208 were pituitary adenoma cases (146 non-functioning pituitary adenomas, 10 adrenocorticotropic hormone-secreting adenomas, 31 growth hormone-secreting adenomas, 17 prolactin-secreting adenomas, and 4 thyroid-stimulating hormone-secreting adenomas; 23 microadenomas, 174 macroadenomas, and 11 giant adenomas). Two neurosurgeons and a neuroradiologist independently measured the degree of parasellar extension on the preoperative sellar MRI according to the modified Knosp's classification. Inter-rater reliability was statistically assessed by measuring the intraclass correlation coefficient. The extents of resection were evaluated by comparison of the pre- and post-operative MR images; the neurovascular complications were assessed by reviewing the patients' medical records. The extent of resection was measured in each parasellar extension grade; thereafter, their statistical differences were calculated.ResultsThe intraclass correlation coefficient value of reliability across the three raters amounted to 0.862. The gross total removal (GTR) rates achieved in each grade were 70.0, 69.8, 62.9, 21.4, 37.5, and 4.3% in Grades 0, 1, 2, 3A, 3B, and 4, respectively. A significant difference in the extent of resection was observed only between Grades 2 and 3A. In addition, significantly higher complication rates were observed in the groups above Grade 3A.ConclusionAlthough the modified Knosp's classification system appears to be complex, its inter-rater reliability proves to be excellent. Regarding the clinical validity of the parasellar extension grading system, Grades 3A, 3B, and 4 have a negative predictive value for the GTR rate, with higher complication rates.