An Acoustic Source Localization Method Using a Drone-Mounted Phased Microphone Array

Go, Yeong-Ju; Choi, Jong‐Soo

doi:10.3390/drones5030075

Cited by 12 publications

(6 citation statements)

References 30 publications

(29 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Then, the inflection point p i3D of the histogram H 3D is determined as the reference value, as in Equation (7).…”

Section: Hist-music-3dmentioning

confidence: 99%

“…In drone audition research, sound source localization (SSL) and sound source separation/enhancement (SSS/SSE) are often addressed. For example, Go et al attached four microphone arrays to the underside of a drone and localized the source of impulse sounds using beamforming [7]. Wang et al mounted a microphone array to the top of a drone and proposed a deep neural network (DNN)-based speech enhancement method [8].…”

Section: Introductionmentioning

confidence: 99%

“…The first is the development of microphone arrays. To increase the noise tolerance for SSL, microphone arrays consisting of different types and arrangements of microphones have been proposed [7,[10][11][12]. The second is the development of SSL methods.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Proposal of Practical Sound Source Localization Method Using Histogram and Frequency Information of Spatial Spectrum for Drone Audition

Hoshiba,

Komatsuzaki,

Iwatsuki

2024

Drones

View full text Add to dashboard Cite

A technology to search for victims in disaster areas by localizing human-related sound sources, such as voices and emergency whistles, using a drone-embedded microphone array was researched. One of the challenges is the development of sound source localization methods. Such a sound-based search method requires a high resolution, a high tolerance for quickly changing dynamic ego-noise, a large search range, high real-time performance, and high versatility. In this paper, we propose a novel sound source localization method based on multiple signal classification for victim search using a drone-embedded microphone array to satisfy these requirements. In the proposed method, the ego-noise and target sound components are extracted using the histogram information of the three-dimensional spatial spectrum (azimuth, elevation, and frequency) at the current time, and they are separated using continuity. The direction of arrival of the target sound is estimated from the separated target sound component. Since this method is processed with only simple calculations and does not use previous information, all requirements can be satisfied simultaneously. Evaluation experiments using recorded sound in a real outdoor environment show that the localization performance of the proposed method was higher than that of the existing and previously proposed methods, indicating the usefulness of the proposed method.

show abstract

“…Then, the inflection point p i3D of the histogram H 3D is determined as the reference value, as in Equation (7).…”

Section: Hist-music-3dmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Proposal of Practical Sound Source Localization Method Using Histogram and Frequency Information of Spatial Spectrum for Drone Audition

Hoshiba,

Komatsuzaki,

Iwatsuki

2024

Drones

View full text Add to dashboard Cite

show abstract

“…The beamforming method consists in processing a spatial-temporal signal recorded by a microphone array. Each microphone of the acoustic array, having a specific position relative to its centre, is assigned a time delay which allows it to focus the signal beam in the direction of acoustic wave propagation [5,19,[23][24][25][26][27]. By using a time delay, the measured signal is the same for all microphones used.…”

Section: Identification Of Main Noise Sourcesmentioning

confidence: 99%

Acoustic Signature and Impact of High-Speed Railway Vehicles in the Vicinity of Transport Routes

Polak

Korzeb

2022

Energies

View full text Add to dashboard Cite

In this paper, an attempt is undertaken to identify the acoustic signature of railway vehicles travelling at 200 km/h. In the framework of conducted experimental research, test fields were determined, measurement apparatus was selected and a methodology for making measurements was specified, including the assessment of noise emission on curved and straight track for electric multiple units of Alstom type ETR610-series ED250, the so-called Pendolino. The measurements were made with the use of an acoustic camera and a 4 × 2 microphone array, including four equipped measurement points and two microphones located at the level of the head of the rail and at a height of 4 m above this level. As a result of the conducted experimental research, the dominant noise sources were identified and amplitude–frequency characteristics for these sources were determined by dividing the spectrum into one-third octave bands in the range from 20 Hz to 20 kHz. The paper also considers issues related to the verification of selected models of noise assessment in terms of their most accurate reflection of the phenomenon of propagation in close surroundings. On the basis of conducted experimental studies, the behaviour of selected models describing the change of sound level with frequency division into one-third octave bands as a function of variable distance of observer from the railway line on which high-speed railway vehicles are operated was verified. In addition, the author’s propagation model is presented together with a database built within the scope of the study, containing the actual waveforms in the time and frequency domain.

show abstract

“…Uni-modal approaches, such as spatial likelihood methods and spatial filtering methods, use the microphone signal only. Spatial likelihood methods are based on traditional source localization algorithms, such as steered response power (SRP) and steered response power with phase transform (SRP-PHAT) [3], [15]- [18], and multiple signal classification (MUSIC) [9], [19], [20]. While being Manuscript received: September 15, 2022 The authors are with Centre for Intelligent Sensing, Queen Mary University of London, London, UK (e-mail: {lin.wang, a.cavallaro}@qmul.ac.uk) widely used for ground robot audition, the performance of these algorithms typically degrades with drone platforms due to the strong ego-noise and hence the low SNR [21].…”

Section: Introductionmentioning

confidence: 99%

Deep-Learning-Assisted Sound Source Localization From a Flying Drone

Wang

Cavallaro

2022

IEEE Sensors J.

View full text Add to dashboard Cite

Sound source localization from a flying drone is a challenging task due to the strong ego-noise from rotating motors and propellers as well as the movement of the drone and the sound sources. To address this challenge, we propose a deep learning-based framework that integrates single-channel noise reduction and multi-channel source localization. In this framework we suppress the ego-noise and estimate a time-frequency soft ratio mask with a single-channel deep neural network (DNN). Then we design two downstream multi-channel source localization algorithms, based on Steered Response Power (SRP-DNN) and Time-Frequency Spatial filtering (TFS-DNN). The main novelty lies in the proposed TFS-DNN approach, which estimates the presence probability of the target sound at individual time-frequency bins by combining the DNN-inferred soft ratio mask and the instantaneous direction of arrival of the sound received by the microphone array. The time-frequency presence probability of the target sound is then used to design a set of spatial filters to construct a spatial likelihood map for source localization. By jointly exploiting spectral and spatial information, TFS-DNN robustly processes signals in short segments (e.g. 0.5 seconds) in dynamic and low signal-noise-ratio scenarios (e.g. SNR -20 dB). Results on real and simulated data in a variety of scenarios (static sources, moving sources and moving drones) indicate the advantage of TFS-DNN over competing methods, including SRP-DNN and the state-of-the-art time-frequency spatial filtering.

show abstract

An Acoustic Source Localization Method Using a Drone-Mounted Phased Microphone Array

Cited by 12 publications

References 30 publications

Proposal of Practical Sound Source Localization Method Using Histogram and Frequency Information of Spatial Spectrum for Drone Audition

Proposal of Practical Sound Source Localization Method Using Histogram and Frequency Information of Spatial Spectrum for Drone Audition

Acoustic Signature and Impact of High-Speed Railway Vehicles in the Vicinity of Transport Routes

Deep-Learning-Assisted Sound Source Localization From a Flying Drone

Contact Info

Product

Resources

About