2008
DOI: 10.1007/s12193-008-0007-z
|View full text |Cite
|
Sign up to set email alerts
|

Speech and sliding text aided sign retrieval from hearing impaired sign news videos

Abstract: The objective of this study is to automatically extract annotated sign data from the broadcast news recordings for the hearing impaired. These recordings present an excellent source for automatically generating annotated data: In news for the hearing impaired, the speaker also signs with the hands as she talks. On top of this, there is also corresponding sliding text superimposed on the video. The video of the signer can be segmented via the help of either the speech or both the speech and the text, generating… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2009
2009
2012
2012

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…We threshold and scale the probability of the model to obtain a 256 × 256 × 256 look-up-table with values from 0 to 255. We were inspired by the work [15] which we refer to for further details.…”
Section: Skin Color Segmentationmentioning
confidence: 99%
“…We threshold and scale the probability of the model to obtain a 256 × 256 × 256 look-up-table with values from 0 to 255. We were inspired by the work [15] which we refer to for further details.…”
Section: Skin Color Segmentationmentioning
confidence: 99%
“…Therefore studies in the literature make use of different cues and restrictions to perform hand segmentation. Popular methods for hand segmentation include using skin color cues [5], motion cues [37], shape cues [9] or depth information [39].…”
Section: Fingerspelling Recognitionmentioning
confidence: 99%
“…We convert the first frame of each video clip into a grey-level image and use its horizontal projection to detect the vertical position of the 20 pixel high 352 pixel wide text band. As proposed in [4], this band lies between the hills of the horizontal projection which is a simple summation of the rows of the grey-level image. Text band hosts a logo towards the left end of the region.…”
Section: A Preprocessingmentioning
confidence: 99%
“…[3] proposed multi-frame combination technique as a robust video text feature extraction method. [4], [5] finds the text location from the horizontal projection of the grey-level image.…”
Section: Introductionmentioning
confidence: 99%