Fundamental frequency (F0) estimation for quasiharmonic signals is an important task in music signal processing. Many previously developed techniques have suffered from unsatisfactory performance due to ambiguous spectra, noise perturbations, wide frequency range, vibrato, and other common artifacts encountered in musical signals. In this paper a new two-way mismatch (TWM) procedure for estimating F0 is described which may lead to improved results in this area. This computer-based method uses the quasiharmonic assumption to guide a search for F0 based on the short-time spectra of an input signal. The estimated F0 is chosen to minimize discrepancies between measured partial frequencies and harmonic frequencies generated by trial values of F0. For each trial F0, mismatches between the harmonics generated and the measured partial frequencies are averaged over a fixed subset of the available partials. A weighting scheme is used to reduce the susceptibility of the procedure to the presence of noise or absence of certain partials in the spectral data. Graphs of F0 estimate versus time for several representative recorded solo musical instrument and voice passages are presented. Some special strategies for extending the TWM procedure for F0 estimations of two simultaneous voices in duet recordings are also discussed.
Automatic off-line classification and recognition of bird vocalizations has been a subject of interest to ornithologists and pattern detection researchers for many years. Several new applications, including bird vocalization classification for aircraft bird strike avoidance, will require real time classification in the presence of noise and other disturbances. The vocalizations of many common bird species can be represented using a sum-of-sinusoids model. An experiment using computer software to perform peak tracking of spectral analysis data demonstrates the usefulness of the sum-of-sinusoids model for rapid automatic recognition of isolated bird syllables. The technique derives a set of spectral features by time-variant analysis of the recorded bird vocalizations, then performs a calculation of the degree to which the derived parameters match a set of stored templates that were determined from a set of reference bird vocalizations. The results of this relatively simple technique are favorable for both clean and noisy recordings.
Audio recordings of gunshots can provide information about the gun location with respect to the microphone(s), the speed and trajectory of the projectile, and in some cases the type of firearm and ammunition. Recordings obtained under carefully controlled conditions can be well-modeled by geometrical acoustics. Special acoustic processing systems for real time gunshot detection and localization are used by the military and law enforcement agencies for sniper detection. Forensic analysis of audio recordings is also used to provide evidence in criminal and civil cases. This paper reviews the distinctive features and limitations of acoustic gunshot analysis using DSP techniques.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.