ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053367
|View full text |Cite
|
Sign up to set email alerts
|

Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning

Abstract: In this work, we investigated the teacher-student training paradigm to train a fully learnable multi-channel acoustic model for far-field automatic speech recognition (ASR). Using a large offline teacher model trained on beamformed audio, we trained a simpler multi-channel student acoustic model used in the speech recognition system. For the student, both multi-channel feature extraction layers and the higher classification layers were jointly trained using the logits from the teacher model. In our experiments… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…The acoustic beamformers in [48] are obtained via semisupervised learning (SSL), where both labeled and unlabeled data are used. When a small set of labeled data are available in addition to a large volume of unlabeled data, using both sets in SSL is more advantageous than SL alone.…”
Section: Bb Rf Cmentioning
confidence: 99%
“…The acoustic beamformers in [48] are obtained via semisupervised learning (SSL), where both labeled and unlabeled data are used. When a small set of labeled data are available in addition to a large volume of unlabeled data, using both sets in SSL is more advantageous than SL alone.…”
Section: Bb Rf Cmentioning
confidence: 99%