ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746271
|View full text |Cite
|
Sign up to set email alerts
|

Modeling of Pre-Trained Neural Network Embeddings Learned From Raw Waveform for COVID-19 Infection Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…In this paper, we focus on the first track of the challenge, ExVo Multi-Task learning. Taking inspirations from recent works on the use of embeddings from the pre-trained networks for various speech procession tasks including paralinguistic tasks (Keesing et al, 2021;Yang et al, 2021;Mostaani et al, 2022;Srinivasan et al, 2022), we investigate the utility of neural embeddings for speakers' emotion intensity, native country and age estimation. In that regard, as illustrated in Figure 1, we compare two types of neural embedding extraction approaches: (a) neural embeddings extracted from neural networks trained in self-supervised learning (SSL) setting and (b) neural embeddings extracted from neural networks trained on auxiliary out-of-domain tasks such as, SER, phone classification and on in-domain ExVo challenge task.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we focus on the first track of the challenge, ExVo Multi-Task learning. Taking inspirations from recent works on the use of embeddings from the pre-trained networks for various speech procession tasks including paralinguistic tasks (Keesing et al, 2021;Yang et al, 2021;Mostaani et al, 2022;Srinivasan et al, 2022), we investigate the utility of neural embeddings for speakers' emotion intensity, native country and age estimation. In that regard, as illustrated in Figure 1, we compare two types of neural embedding extraction approaches: (a) neural embeddings extracted from neural networks trained in self-supervised learning (SSL) setting and (b) neural embeddings extracted from neural networks trained on auxiliary out-of-domain tasks such as, SER, phone classification and on in-domain ExVo challenge task.…”
Section: Introductionmentioning
confidence: 99%