2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01135
|View full text |Cite
|
Sign up to set email alerts
|

Aligning Subtitles in Sign Language Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 42 publications
0
11
0
Order By: Relevance
“…With the advancement of computer vision techniques, there is increasing attention on collecting real-life SLT datasets. Many such datasets (Camgoz et al, 2018(Camgoz et al, , 2021Albanie et al, 2021) are drawn from TV programs accompanied by sign language interpretation. Despite being highly realistic compared to studio datasets, they are generally limited to a specific domain.…”
Section: Datasets For Sltmentioning
confidence: 99%
See 3 more Smart Citations
“…With the advancement of computer vision techniques, there is increasing attention on collecting real-life SLT datasets. Many such datasets (Camgoz et al, 2018(Camgoz et al, , 2021Albanie et al, 2021) are drawn from TV programs accompanied by sign language interpretation. Despite being highly realistic compared to studio datasets, they are generally limited to a specific domain.…”
Section: Datasets For Sltmentioning
confidence: 99%
“…For example, the popular Phoenix-2014T DGS-German benchmark contains signed German weather forecasts and includes only 11 hours of signing videos from 9 signers. The largest real-world sign language corpus we are aware of is BOBSL (Albanie et al, 2021), which consists of 1,467 hours of BBC broadcasts from 39 signers interpreted into British Sign Language (BSL). However, access to the videos is restricted, and the data cannot be used by independent researchers or commercial organizations.…”
Section: Datasets For Sltmentioning
confidence: 99%
See 2 more Smart Citations
“…Visual grounding. Our work is also related to tasks such as natural language grounding in videos [14,24,25,29,40,68,71,72] and subtitle alignment in sign language clips [12]. Transformers.…”
Section: Related Workmentioning
confidence: 99%