Using Semantic Features for Scene Classification: how Good do they Need to Be?

Boutell, Matthew; Choudhury, Anustup; Luo, Jiebo; Brown, Christopher M.

doi:10.1109/icme.2006.262955

Cited by 12 publications

(18 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To obtain multiclass classification, we trained a SVM for each class to distinguish it from all others, and classified the image with the class whose SVM gave the maximum output. Further details are given in [3].…”

Section: Discriminative Approachmentioning

confidence: 99%

“…The most effective of these models uses pairwise spatial relationships between regions. 3) In Section V, we compare this model with three other generative models: an exact model that models the full joint distribution of the scene type and every semantic region in the image, one that models co-occurrence of these regions while ignoring the actual spatial relations, and one that treats these regions independently. 4) Finally, we compare our model with a discriminative model that uses high-level features and with one that uses low-level features.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Scene Parsing Using Region-Based Generative Models

Boutell

Luo²,

Brown³

2007

IEEE Trans. Multimedia

Self Cite

View full text Add to dashboard Cite

Abstract-Semantic scene classification is a challenging problem in computer vision. In contrast to the common approach of using low-level features computed from the whole scene, we propose "scene parsing" utilizing semantic object detectors (e.g., sky, foliage, and pavement) and region-based scene-configuration models. Because semantic detectors are faulty in practice, it is critical to develop a region-based generative model of outdoor scenes based on characteristic objects in the scene and spatial relationships between them. Since a fully connected scene configuration model is intractable, we chose to model pairwise relationships between regions and estimate scene probabilities using loopy belief propagation on a factor graph. We demonstrate the promise of this approach on a set of over 2000 outdoor photographs, comparing it with existing discriminative approaches and those using low-level features.

show abstract

Section: Discriminative Approachmentioning

confidence: 99%

mentioning

confidence: 99%

Scene Parsing Using Region-Based Generative Models

Boutell

Luo²,

Brown³

2007

IEEE Trans. Multimedia

Self Cite

View full text Add to dashboard Cite

show abstract

“…As in [2], we convert the image to LST space and split the image into blocks formed by an NxN grid. We then compute the mean and variance of each block's color band.…”

Section: Raw Feature Extractionmentioning

confidence: 99%

“…Spatial color moments are a state-of-the-art feature used to distinguish outdoor scenes [2][10]; we use them as a baseline feature for comparison, even though color is expected to be more salient for outdoor scenes than indoor ones. As in [2], we convert the image to LST space and split the image into blocks formed by an NxN grid.…”

Section: Raw Feature Extractionmentioning

confidence: 99%

“…Distinguishing between common rooms in indoor environments such as homes is a much more challenging problem. Features typically used for outdoor scenes in the literature, such as spatial color moments [2] [10], often fail in indoor environments because colors are a weak predictor of room type. Also, home interiors contain much extraneous data common to most or all room types that is useless for classification: homogeneous areas corresponding to blank walls, floors, and ceilings and generic "objects" such as doors or corners.…”

Section: Introduction and Related Workmentioning

confidence: 99%

See 1 more Smart Citation