2021
DOI: 10.1016/j.csl.2020.101131
|View full text |Cite
|
Sign up to set email alerts
|

LIS-Net: An end-to-end light interior search network for speech command recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 12 publications
0
3
0
Order By: Relevance
“…MS is used in conjunction with CNN, and it can distinguish vowel sounds, although the aggregate dataset is more complex due to many dimensions, such as various noises, ages, accents, environments, and physical characteristics (i.e., female vs. male voices). In the same way [46], MS was applied to the speech command recognition (SCR) task and achieved good performance. MS images with a feature size of 125 × 80 × 1 were used as acoustic features.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…MS is used in conjunction with CNN, and it can distinguish vowel sounds, although the aggregate dataset is more complex due to many dimensions, such as various noises, ages, accents, environments, and physical characteristics (i.e., female vs. male voices). In the same way [46], MS was applied to the speech command recognition (SCR) task and achieved good performance. MS images with a feature size of 125 × 80 × 1 were used as acoustic features.…”
Section: Discussionmentioning
confidence: 99%
“…Mel spectrograms (MS) were converted from the raw speech signal (16 kHz) and then applied to the speech command recognition (SCR) task [46]. MS images with the feature size of 125 × 80 × 1 were used as acoustic features.…”
Section: Acoustic Featuresmentioning
confidence: 99%
“…Speech command-based applications are widely used in various felds and have signifcantly enhanced human-computer interaction [6]. Speech recognition interfaces are integrated into digital devices, e-commerce, elearning, the Internet of Tings, robotics, and medical equipment to facilitate control and monitor through speech input [7,8].…”
Section: Introductionmentioning
confidence: 99%