2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) 2022
DOI: 10.1109/bhi56158.2022.9926795
|View full text |Cite
|
Sign up to set email alerts
|

Investigating Graph-based Features for Speech Emotion Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 32 publications
0
9
0
Order By: Relevance
“…Specifically, our speech analysis relies on the graph-based theory, including structural and statistical information of a time series. Thus, we firstly introduce the extraction of the structural graph-based speech representation through the Visibility Graph (VG) theory [13]. After that, we extend our description to the statistical information computational approach.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Specifically, our speech analysis relies on the graph-based theory, including structural and statistical information of a time series. Thus, we firstly introduce the extraction of the structural graph-based speech representation through the Visibility Graph (VG) theory [13]. After that, we extend our description to the statistical information computational approach.…”
Section: Methodsmentioning
confidence: 99%
“…The VG theory has been shown to be an appropriate tool for investigating and further determining the structural interrelations among the samples of a time series, which consists of positive defined elements [13]. Inspired by [12], we have applied VG theory to the SER problem in our previous work [13], which provided evidence regarding the effectiveness of VG theory in the analysis of speech signals. However, as discussed in [12], the VG has two disadvantages: first, it does not consider the effect of uneven sampling; second, it cannot capture the time series changes below a zero baseline.…”
Section: Structural Graph-based Speech Informationmentioning
confidence: 99%
See 2 more Smart Citations
“…At present, the application of graph neural networks in the field of speech technology still has some limitations [25], but some scholars have verified the advantages of graph convolution in the field of speech technology and the possibility of being widely used through research, such as conversational speech recognition [26], sentence-level [27] / conversation-level speech emotion recognition [28], speech enhancement [29], and Q &A rewriting [30]. The methods of graph construction can be divided into sample point-based, frame-based, speech channel-based, and historical dialogue-based approaches, as shown in Fig.…”
Section: Ser Based On Gnnsmentioning
confidence: 99%