“…However, the spoken documents bring extra difficulties such as the recognition errors, problems with spontaneous speech, and lack of correct sentence or paragraph boundaries. In order to avoid the redundant or incorrect parts while selecting the important and correct information, multiple recognition hypotheses, confidence scores, language model scores and other grammatical knowledge have been utilized [3,7]. In addition, prosodic features (e.g., intonation, pitch, energy, pause duration) can be used as important clues for summarization as well; although reliable and efficient approaches to use these prosodic features are still under active research [8,9].…”