To improve the robustness of distributed speech front-ends in mobile communication we introduce, in this paper, a new set of feature vector which is estimated through three steps. First, the Mel-Line Spectral Frequencies (MLSFs) coefficients are combined with conventional MFCCs, after extracted from a denoised acoustic frame using the wiener filter. Also, we optimize the stream weights of multi-stream HMMs by deploying a discriminative approach. Finally, these features are adequately transformed and reduced in a multi-stream scheme using Karhunen-Loeve Transform (KLT). Recognition experiments on the Aurora 2 connected digits database reveal that the proposed front-end leads to a significant improvement in speech recognition accuracy for highly noisy GSM.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.