“…The core quality factors of CC have been identi¯ed in the literature; they include: synchronization delay (between caption and the actual audio), presentation speed, number of missing words between the captions and the audio transcript, number of 46 S. Nam & D. Fels spelling and grammar mistakes, coloring and positioning, display methods, verbatim state, speaker identi¯cation, and the inclusion of captions for non-speech sounds [4,7,8]. In this paper, we selected the¯ve factors to be used including delay, speed, spelling and grammar, missing words, and verbatim state as these have been iden-ti¯ed by D/HOH viewers as the¯ve most important factors a®ecting quality [1,6,9,10]. The verbatim state (or verbatim accuracy) factor of CC represents the quanti¯ed accuracy of the translation from the spoken words in audio format to CC word by word [4,5].…”