“…By virtue of the developed techniques, a variety of functionalities were created to help distill important content from multimedia collections, or provide locations of important speech segments in a video accompanied with their corresponding transcripts, for users to listen to or to digest. Statistical language modeling (LM) (Jelinek, 1999;Jurafsky and Martin, 2008;Zhai, 2008), which manages to quantify the acceptability of a given word sequence in a natural language or capture the statistical characteristics of a given piece of text, has been proved to offer both efficient and effective modeling abilities in many practical applications of natural language processing and speech recognition (Ponte and Croft, 1998;Jelinek, 1999;Huang, et al, 2001; a ; Jurafsky and Martin, 2008;Furui et al, 2012;Liu and Hakkani-Tur, 2011).…”