Voice activation system using acoustic event detection and keyword/speaker recognition

Cho, Namgook; Kim, Taeyoon; Shin, Sangwook; Kim, Eunkyoung

doi:10.1109/icce.2011.5722550

Cited by 3 publications

(1 citation statement)

References 2 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…因此, 这类基于语音识别的方法更为稳定并容易扩展, 也受到更多关注. 例如, 通过在前端进行预筛选, 可以剔除非正常语音片段 [20] , 能够减轻后端识别压力, 提高实现效率; 对语音识别的声学特征 [11, 17∼19] 或韵律特征 [21] [22] . 除延迟相加波束形成外, 还有若干具有固定指向性的固定波束形成方法 [23,24] , 能够在一定情况下更好地抑制混响的影响.…”

Section: 该实验是完全仿真实际中远讲的情况例如家庭场景在两种噪声分别作为干扰的场景下 Amr2unclassified

具有选择注意能力的语音拾取技术

GUO¹,

Wu²,

Fu³

et al. 2015

Sci. Sin.-Inf.

View full text Add to dashboard Cite

Currently, a natural speech-picking mode is badly needed in speech communication and in humancomputer interaction systems. However, speech is usually corrupted by attenuation, multi-path propagation, and various interferences before it is received, especially when there exist several speech systems and users. It is important for practical speech systems to pick the correct speech signal within complex environments. In this paper, the mechanism of auditory attention ability is simulated through a target speech-picking system in which the a priori knowledge of the target speech and interference of sound sources are used carefully to detect and improve the target speech. The technologies of microphone arrays, wake-up-words, target speech detection, speech enhancement, and dereverberation are combined in this strategy to fulfill the task of robust target speech-picking.

show abstract

Section: 该实验是完全仿真实际中远讲的情况例如家庭场景在两种噪声分别作为干扰的场景下 Amr2unclassified

具有选择注意能力的语音拾取技术

GUO¹,

Wu²,

Fu³

et al. 2015

Sci. Sin.-Inf.

View full text Add to dashboard Cite

show abstract

Deep neural network based wake-up-word speech recognition with two-stage detection

2017

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Audio Scenarios Detection Technique

Kadam¹,

Kagalkar²

2015

IJCA

View full text Add to dashboard Cite

The proposed research objective is to add to a framework for programmed recognition of sound. In this framework the real errand is to distinguish any information sound stream investigate it & anticipate the likelihood of diverse sounds show up in it. To create and industrially conveyed an adaptable sound web crawler a flexible sound search engine. The calculation is clamor and contortion safe, computationally productive, and hugely adaptable, equipped for rapidly recognizing a short portion of sound stream caught through a phone microphone in the presence of frontal area voices and other predominant commotion, and through voice codec pressure, out of a database of over accessible tracks. The algorithm utilizes a combinatorial hashed time-recurrence group of stars examination of the sound, yielding ordinary properties, for example, transparency, in which numerous tracks combined may each be distinguished.

show abstract

Voice activation system using acoustic event detection and keyword/speaker recognition

Cited by 3 publications

References 2 publications

具有选择注意能力的语音拾取技术

具有选择注意能力的语音拾取技术

Deep neural network based wake-up-word speech recognition with two-stage detection

Audio Scenarios Detection Technique

Contact Info

Product

Resources

About