A Hash authentication algorithm of speech perception based on MDCT coefficients was proposed to solve the problems of large amount of computation and bad real-time capability when using traditional authentication algorithm in compressed domain speech. Firstly, the algorithm extracts MDCT coefficients by partly decompressing speech sound in MP3 format. Then MDCT coefficients of each frame of speech are processed by Mel filter in the compressed domain, forming the 15-dimensional MFCC coefficient vector. Finally the perceptual Hash string is generated by Hash structure. The perceptual Hash string can perceive the content of voice authentication. Experimental results show that the algorithm keeping on content presents the strong robustness and good real-time capability.
According to the situation that traditional speech authentication algorithms don’t be appropriated for present speech communication, we proposed a speech authentication algorithm of perceptual hashing based on Immittance Spectral Pairs. It can satisfy the requirement of the efficiency and the robustness for speech authentication. Firstly, the speech signal pre-processing, for framing, adding window, obtained for each speech frame immittance spectral Pairs parameters, constitute an immittance spectral Pairs parameter matrix. Then process cepstral mean and variance normalization for immittance spectral Pairs parameter matrix, cepstral mean and variance normalization can effectively improve the robustness of the Gaussian white noise. And parameter matrix for non-negative matrix factorization. Finally, quantifying the formed weight matrix and getting perceptual hashing sequences.Experiments show that the proposed algorithm has good robustness for content preserving operations, and it doesn’t reduce the efficiency while meeting robustness, it can satisfy the real-time requirement of speech communication.
With the fast developing portable digital audio player, the copyright protection of music faces a severe challenge. Audio watermarking is an efficient approach to solve this problem. In order to improve the performance of real-time processing, a new real-time audio watermarking based on fast MCLT is presented. Algorithms, which use of quantitative method, can be realized watermark blind detection. The experiment results show that the algorithm, which is utilized in the copyright protection, meet the timely processing requirements. Embedded of watermark audio have obviously robustness and transparency compared to the traditional methods. Furthermore, it can against MP3 compression attacks efficiently.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.