“…In recent years, meeting speech recognition (Maganti et al, 2007;Nasu et al, 2011) and meeting speaker diarization (Boakye et al, 2008;Ben-Harush et al, 2009;Stolcke et al, 2010;Sun et al, 2010;Valente et al, 2010;Vijayasenan et al, 2010;Boakye et al, 2011;Stolcke, 2011;Valente et al, 2011;Yella et al, 2011;Vijayasenan et al, 2012;Zwyssig et al, 2012) have been effectively utilized to transcribe and browse meeting procedures. However, their performance is usually low at the overlapped speech segments where more than one speaker is speaking.…”