2000
DOI: 10.1117/12.387181
|View full text |Cite
|
Sign up to set email alerts
|

<title>Ground truth for training and evaluation of automatic main subject detection</title>

Abstract: A consumer photograph, or snapshot, is a medium for conveying to a viewer, one's interest in one or more main subjects. A methodology is presented for collecting ground truth data useful for training and evaluating algorithms designed to automatically detect the main subject of a consumer photograph. For a database of 100 images, 16 observers provided polygonal approximations to the image areas that comprise the main subject. Results from all observers are combined to form a truth image that is considered the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2001
2001
2011
2011

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 1 publication
0
9
0
Order By: Relevance
“…The sharp regions consisting of sharp blocks are the main subject regions. for j = 1 to n do 5: Compute the DCT coefficient T of B ij 6: Compute β (2,2)i,j of T 7: if β (2,2)i,j > ξ max then 8: β (2,2)i,j = ξ max 9: end if 10: if β (2,2)i,j < θ DCT then if Dist(β (2,2)i,j , µ C1 ) > Dist(β (2,2)i,j , µ C2 ) then 14: if Dist(β (2,2)i,j , µ C1 ) < Dist(β (2,2)i,j , µ C2 ) then …”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The sharp regions consisting of sharp blocks are the main subject regions. for j = 1 to n do 5: Compute the DCT coefficient T of B ij 6: Compute β (2,2)i,j of T 7: if β (2,2)i,j > ξ max then 8: β (2,2)i,j = ξ max 9: end if 10: if β (2,2)i,j < θ DCT then if Dist(β (2,2)i,j , µ C1 ) > Dist(β (2,2)i,j , µ C2 ) then 14: if Dist(β (2,2)i,j , µ C1 ) < Dist(β (2,2)i,j , µ C2 ) then …”
Section: Discussionmentioning
confidence: 99%
“…But this method need construct a training set, and require the expert to distinguish the two different regions. Luo, Etz, Singhal [2] [3] proposed a Bayes neural network to detect the main subject region, and it is also a supervised learning method. Li,Wang et al [4] [5] analyze the statistics of the high-frequency wavelet coefficients to segment the focused regions in an image, then detect the main subject regions.…”
Section: Introductionmentioning
confidence: 99%
“…The central zone of the image is then classified as the main subject and all the other zones are classified as background. Since our application consists of consumer-type photographic images, there is strong evidence that main subject regions are located more towards the center of the image rather than towards the borders [10]. The performance of the Czone predictor (compared to the ground truth) sets the baseline for the Bayesian network and neural network classifiers.…”
Section: The Classifiersmentioning
confidence: 99%
“…We define centrality by computing the integral of a probability density function (PDF) of main subject location over the area of a given region. The PDF is derived from the ground truth data [10]. In doing so, every pixel of a given region, not just the centroid, contributes to the centrality measure of the region to a varying degree depending on its location.…”
Section: Structural Feature Extractionmentioning
confidence: 99%
“…Testing and performance evaluation of all these algorithms requires MROI and MLOI database. An experiment for developing an image-set and its ground-truth for training and evaluation of automatic main subject detection (people detection) algorithms were proposed in [2]. The authors used 100 images of consumer photographs to develop ground-truth of salient regions.…”
Section: Introductionmentioning
confidence: 99%