2022
DOI: 10.3390/s22124650
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Sound Source Localization for Low-Power IoT Devices Based on Multi-Stream CNN

Abstract: Voice-activated artificial intelligence (AI) technology has advanced rapidly and is being adopted in various devices such as smart speakers and display products, which enable users to multitask without touching the devices. However, most devices equipped with cameras and displays lack mobility; therefore, users cannot avoid touching them for face-to-face interactions, which contradicts the voice-activated AI philosophy. In this paper, we propose a deep neural network-based real-time sound source localization (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…The authors investigated voice-activated AI technology and proposed a deep-neural-network-based real-time sound source localization (SSL) model for low-power IoT devices. The authors used multichannel acoustic data to parallelize convolutional neural network layers in the form of multiple streams in order to capture unique delay patterns in the low-, mid-, and high-frequency ranges and estimate the fine and coarse location of voices [ 58 ].…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors investigated voice-activated AI technology and proposed a deep-neural-network-based real-time sound source localization (SSL) model for low-power IoT devices. The authors used multichannel acoustic data to parallelize convolutional neural network layers in the form of multiple streams in order to capture unique delay patterns in the low-, mid-, and high-frequency ranges and estimate the fine and coarse location of voices [ 58 ].…”
Section: Literature Reviewmentioning
confidence: 99%
“… Classification of the reviewed literature, with emphasis on the learning algorithms used [ 28 , 29 , 31 , 32 , 33 , 34 , 37 , 38 , 39 , 40 , 41 , 43 , 44 , 46 , 47 , 49 , 50 , 51 , 52 , 56 , 57 , 58 , 59 , 60 , 61 , 62 ]. …”
Section: Figurementioning
confidence: 99%
“…As an alternative to signal processing based DoA estimation methods, end-to-end ML models eliminate feature extraction step and enable lowcomputation DoA prediction. However, the best offerings of the current literature focus on platforms with significantly higher power consumption than battery-powered nodes used in low-cost IoT-based noise monitoring systems [45], [46].…”
Section: Related Workmentioning
confidence: 99%
“…The flying drone localizes the sound source from ambient noise and frequent movement [ 19 ]. The low-power device with a microphone array derives the AoA for the sound source in the application of various human interaction [ 20 ]. The complex environment from the indoor condition is challenged for the SSL system by Machhamer [ 21 ] and Zhang [ 22 ].…”
Section: Introductionmentioning
confidence: 99%