“…Next, the stability of the raters' internal standards over time must derive from experience and skill and be objectively measured using intra-rater reliability tasks if we are to compare one subject's perceptual ranking with another, or a subject's first ranking with a subsequent ranking [17,20,21]. Although several studies describing vocal quality following LTR have addressed the issue of inter-rater reliability on a subjective assessment tool [7,10,15,16], none have reported detail about the background and years of experience of the raters, or the number of raters used. The methods used for rating (i.e., the use of a likert scale versus a visual analog scale, instructions provided to the raters) were also not consistently provided.…”