2022
DOI: 10.48550/arxiv.2203.04287
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation

Abstract: This paper proposes a simple transfer learning baseline for sign language translation. Existing sign language datasets (e.g. PHOENIX-2014T, CSL-Daily) contain only about 10K-20K pairs of sign videos, gloss annotations and texts, which are an order of magnitude smaller than typical parallel data for training spoken language translation models. Data is thus a bottleneck for training effective sign language translation models. To mitigate this problem, we propose to progressively pretrain the model from generaldo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 30 publications
(90 reference statements)
0
2
0
Order By: Relevance
“…representations to predict the action category. In contrast, many practical applications, e.g., sign language translation [4,5,13], robotic imitation learning [29,36], action alignment [6,21,23] and phase classification [16,27,37,50] require algorithms having ability to model long videos with hundreds of frames and extract frame-wise representations rather than the global features (Fig. 1).…”
Section: Liquid Starts Exitingmentioning
confidence: 99%
“…representations to predict the action category. In contrast, many practical applications, e.g., sign language translation [4,5,13], robotic imitation learning [29,36], action alignment [6,21,23] and phase classification [16,27,37,50] require algorithms having ability to model long videos with hundreds of frames and extract frame-wise representations rather than the global features (Fig. 1).…”
Section: Liquid Starts Exitingmentioning
confidence: 99%
“…Authors of [16] apply transfer learning for end-to-end sign language translation on two different datasets. They used GPT2 pre-trained model on the German Sign Language corpus and got remarkable results.…”
Section: Related Workmentioning
confidence: 99%