2015 IEEE International Conference on Computer Vision (ICCV) 2015
DOI: 10.1109/iccv.2015.306
|View full text |Cite
|
Sign up to set email alerts
|

Common Subspace for Model and Similarity: Phrase Learning for Caption Generation from Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
24
0
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 56 publications
(26 citation statements)
references
References 25 publications
1
24
0
1
Order By: Relevance
“…A constrained language model is then used to generate from this representation. A conceptually related approach is pursued by Ushiku et al (2015): the authors use a common subspace model which maps all feature vectors associated with the same phrase into nearby regions of the space. For generation, a beamsearch based decoder or templates are used.…”
Section: Description As a Retrieval In Multimodal Spacementioning
confidence: 99%
“…A constrained language model is then used to generate from this representation. A conceptually related approach is pursued by Ushiku et al (2015): the authors use a common subspace model which maps all feature vectors associated with the same phrase into nearby regions of the space. For generation, a beamsearch based decoder or templates are used.…”
Section: Description As a Retrieval In Multimodal Spacementioning
confidence: 99%
“…The task of generating semantic descriptions based on images can be collectively referred to as image caption. Early work [13,30,42] to generate captions is based on templates. A group of visual concepts are detected in the image and then expressed by templates.…”
Section: Scene Understandingmentioning
confidence: 99%
“…Similarly, in [15], the Conditional Random Field (CRF) model is implemented with the correspondence attributes, objects, and prepositions prediction to construct the sentence according to predefined templates. A new subspace embedded approach is suggested for image caption generation, called Common Subspace for Model and Similarity (CoSMoS) in [16].…”
Section: Traditional Image Captioning Modelsmentioning
confidence: 99%