This paper describes a digital cepstrum-pitch detector designed to work in real time on a 12-bit binary machine. Spectrum and cepstrum analysis of raw speech data is accomplished through the use of a single complex Fast Fourier Transform (FFT) algorithm. Pitch detection is accomplished by a cepstrum-peak scanning logic designed to locate the position of a peak corresponding to the pitch period. Since other peaks may exist in the data, logic is required to differentiate these peaks from the desired one. Because of the limited word size of the machine, special processing algorithms were developed to perform logging and to avoid the squaring of variables. Various logical operations performed on the spectrum and cepstrum functions in voicing detection and in the extraction of pitch are presented and discussed. A demonstration of the performance of this pitch detector when operated with a 2400-bit/sec digital vocoder will be presented by means of a tape recording. [Work supported under contract.]
In this paper, we describe a real-time all-digital spectrum channel vocoder implemented on a programmable signal processor. The vocoder system efficiently utilizes a radix four, 64-point FFT algorithm for spectral analysis, while the pitch extractor selects the pitch period using decision logic on the average magnitude difference function (AMDF) of the input speech. The receiver linearly interpolates both the pitch period and the synthesized impulse response which is generated by using the inverse Fourier transform. The vocoder frame updating rate is 22 msec, and the input speech sampling rate is 7040 samples/sec. The configuration of the vocoder and the logic of the AMDF pitch extractor is discussed and the system demonstrated with audio tapes.
This paper presents an SNR analysis of a bilateral log compandor that was used in a portable wide-bandwidth digital re cording system designed to generate field recordings of wide-band atmospheric noise in the MF radio spectrum [1], [2]. Log-compression amplifiers are used in the / and Q channels of the recorder receiver function to compress dynamic range from 120 to about 50 dB. Fol lowing A/D conversion, the compressed / and Q channel signals are digitally recorded on magnetic tape. The recorded data are then transcribed onto magnetic computer tape for subsequent data analysis. Inverse log compression of the transcribed samples is accomplished via a software log-expansion algorithm exactly matched to the inverse characteristics of each of the / and Q channel log-compression am plifiers.The present analysis shows that samples of the / and Q channel signals can be generated from samples of the log-compressed signals by inverse log compression without concern for aliasing error. A measure of the accuracy of the resulting sampled sequence is then developed analytically in closed form in terms of an SNR measure for the decompressed samples. INTRODUCTION T HIS PAPER presents an SNR analysis of bipolar log companding [3] to control dynamic range in a digital recording/data analysis system. While other analyses of this type have been presented previously [4]-[8], the present analysis, in addition to providing a closed-form analytical so lution for the overall compandor SNR, demonstrates a useful result in the application of nonlinear no-memory processing of analog band-limited signals prior to A/D conversion.Specifically, it is shown that a sampled representation of the unprocessed analog signal can be generated from samples of the log-compressed signal (to within a specified level of quantization accuracy) by inverse log compression without concern for aliasing error. Briefly, the system being considered here consists of an analog bipolar log-compression amplifier followed by an A/D converter and high-speed digital tape recorder. The digitally recorded samples are transcribed onto magnetic (computer) tape for subsequent analysis. Logarithmic expansion of the sampled data is applied via an inverse software ^-"décom pression" algorithm prior to data analysis. This algorithm is assumed to be an exact match to the inverse characteristic of the analog log-compression amplifier.The thrust of this paper is to present an SNR analysis of the overall log-companding process that takes place in each / and Q channel between a receiver LPF (low-pass filter) output and the output of the inverse software log-compression algorithm that provides the Ik and Qk samples used in the subsequent data analysis operations.In the present analysis, the total dynamic range of the in stantaneous signal range into the log compressor is assumed to be 120 dB. The upper 90 dB of this range falls in the loga rithmic portion of the log compressor resulting in an effective compression from 90 dB to approximately 21 dB for this range. The lower 30 dB o...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.