2017
DOI: 10.48550/arxiv.1705.04358
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Object-Level Context Modeling For Scene Classification with Context-CNN

Abstract: Convolutional Neural Networks (CNNs) have been used extensively for computer vision tasks and produce rich feature representation for objects or parts of an image. But reasoning about scenes requires integration between the low-level feature representations and the high-level semantic information.We propose a deep network architecture which models the semantic context of scenes by capturing object-level information. We use Long Short Term Memory(LSTM) units in conjunction with object proposals to incorporate o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 41 publications
1
4
0
Order By: Relevance
“…It is also worth mentioning the work of Javed and Nelakanti [2017], a recent proposal for scene recognition that also assumes object features as an ideal source of information to recognize scenes, reinforcing our premise. This work showed that with a small fraction of a large-scale dataset, one can be competitive with the state of the Figure 3.6: Hierarchical recurrent approach proposed by Zuo et al [2016].…”
Section: Recurrent Neural Networksupporting
confidence: 70%
See 3 more Smart Citations
“…It is also worth mentioning the work of Javed and Nelakanti [2017], a recent proposal for scene recognition that also assumes object features as an ideal source of information to recognize scenes, reinforcing our premise. This work showed that with a small fraction of a large-scale dataset, one can be competitive with the state of the Figure 3.6: Hierarchical recurrent approach proposed by Zuo et al [2016].…”
Section: Recurrent Neural Networksupporting
confidence: 70%
“…. 3.7 Methodology proposed by Javed and Nelakanti [2017]. A combination of CNN and RNN architectures to model spatial context for scene recognition.…”
Section: List Of Figuresmentioning
confidence: 99%
See 2 more Smart Citations
“…Herein, COOR adapts the cooccurring frequency to represent object-to-object relations, while SOOR is encoded with sequential model via regarding object sequences as sentences. Rooted in the work of Javed et al [193], Laranjeira et al [36] proposed a bidirectional LSTM to capture the contextual relations of regions of interest, as shown in Fig. 11 (a).…”
Section: Contextual Strategymentioning
confidence: 99%