2023
DOI: 10.1016/j.inffus.2022.09.006
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal audio-visual information fusion using canonical-correlated Graph Neural Network for energy-efficient speech enhancement

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 24 publications
(3 citation statements)
references
References 33 publications
0
3
0
Order By: Relevance
“…Battery life is also an important issue in the design of multi modal hearing aids as different components of these devices needs to be charged when used for an extended period of service. Though recent work from [17,18] has focused on developing energy efficient machine learning models, the issue of cost is very important for deploying AV HAT technologies.…”
Section: Cost and Battery Lifementioning
confidence: 99%
See 1 more Smart Citation
“…Battery life is also an important issue in the design of multi modal hearing aids as different components of these devices needs to be charged when used for an extended period of service. Though recent work from [17,18] has focused on developing energy efficient machine learning models, the issue of cost is very important for deploying AV HAT technologies.…”
Section: Cost and Battery Lifementioning
confidence: 99%
“…• e) safeguard users' rights and provide user control (art. [14][15][16][17][18][19][20][21] that allows them to request for the removal of personal data, have it transferred to another organisation or object to the processing of their personal data for certain reasons;…”
Section: Privacy and Security Concernsmentioning
confidence: 99%
“…Traditional SR systems rely on audio signals captured by microphones, which can be affected by background noise, distance, and some environmental factors [2], [3]. SR using audio-visual signals, which involves combining information from both audio and visual cues to identify or verify the speaker's identity, has gained increasing attention in recent years [4], [5]. While SR using audio-visual signals has the potential to improve the accuracy and reliability of SR systems [6]- [8], it is important to consider the limitations of this technology [9].…”
Section: Introductionmentioning
confidence: 99%