“…Through this approach, these methods aim to add or enhance missing or modified features and increase the signal similarity of whispers to normal speech. In general, these reconstruction methods can be classified into two major groups of train- These reconstruction methods (either training-based or non-training) suffer from range of disadvantages including problems in converting continuous speech (due to using phoneme switching) [20], being computationally expensive (due to using highly overlapped frames for spectral enhancement, or using jump Markov linear system for pitch and voicing parameters) [19,4], and more importantly lack of naturalness in regenerated output (due to simplified time alignment and spectral features assumptions) [21,23]. In this paper, we focus on a trainingbased approach, and propose a novel reconstruction algorithm to improve the efficiency in phonated speech regeneration.…”