2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2021
DOI: 10.1109/asru51503.2021.9688052
|View full text |Cite
|
Sign up to set email alerts
|

Target Language Extraction at Multilingual Cocktail Parties

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…Reproducing a similar mechanism would require TSE systems that operate with semantic clues, which introduces novel challenges concerning how to represent semantic information and exploit it within a TSE system. Some works have started to explore this direction, such as conditioning on languages [59] or more abstract concepts [60].…”
Section: Exploring Other Cluesmentioning
confidence: 99%
“…Reproducing a similar mechanism would require TSE systems that operate with semantic clues, which introduces novel challenges concerning how to represent semantic information and exploit it within a TSE system. Some works have started to explore this direction, such as conditioning on languages [59] or more abstract concepts [60].…”
Section: Exploring Other Cluesmentioning
confidence: 99%
“…Research on target speech extraction has conventionally focused on extracting the speech of a target speaker in a mixture of overlapping speakers by exploiting physical clues such as pre-recorded enrollment utterances [46], direction information [12], or video [1,9,28,40] to identify the target speaker. Meanwhile, we can use semantic clues, such as language [3,42] or content of speech, to focus our attention on the conversation we want to hear. For example, if our name is mentioned or the topic of a conversation nearby interests us, we turn our attention to that speaker.…”
Section: Introductionmentioning
confidence: 99%