CVPR 2011 2011
DOI: 10.1109/cvpr.2011.5995453
|View full text |Cite
|
Sign up to set email alerts
|

Learning and matching multiscale template descriptors for real-time detection, localization and tracking

Abstract: We describe a system to learn an object template from a video stream, and localize and track the corresponding object in live video. The template is decomposed into a number of local descriptors, thus enabling detection and tracking in spite of partial occlusion. Each local descriptor aggregates contrast invariant statistics (normalized intensity and gradient orientation) across scales, in a way that enables matching under significant scale variations. Lowlevel tracking during the training video sequence enabl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(26 citation statements)
references
References 20 publications
0
26
0
Order By: Relevance
“…This is the fundamental link that ties together sensing and control: Without control, no (time-and risk-optimal) sensing can be performed in the presence of line-of-sight and quantization phenomena, and clearly without sensing no (feedback) control could be performed. Follow-up work includes the course notes [68], a characterization of detachable object detection, [7,6], a characterization of the controlled recognition bounds [70], of sufficient exploration [62], and the optimal descriptor TST/BTD [49]. .…”
Section: Summary and Discussionmentioning
confidence: 99%
“…This is the fundamental link that ties together sensing and control: Without control, no (time-and risk-optimal) sensing can be performed in the presence of line-of-sight and quantization phenomena, and clearly without sensing no (feedback) control could be performed. Follow-up work includes the course notes [68], a characterization of detachable object detection, [7,6], a characterization of the controlled recognition bounds [70], of sufficient exploration [62], and the optimal descriptor TST/BTD [49]. .…”
Section: Summary and Discussionmentioning
confidence: 99%
“…However in many cases, the use of a more complicated model for similarity measurement results in higher computational cost. Further, sliding window search is also a costly stage for such methods [14]. While there exist known algorithms for fast search of object instances in an image using branch-and-bound techniques [13], in our particular problem, methods of this type have two crucial limitations.…”
Section: Introductionmentioning
confidence: 99%
“…Then the normalized density score with respect to the spatially, uniformly distributed response is shown in Eqn. (14).…”
Section: Matching Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Table 1: 30 kinds of food ingredients in the data set. types ingredients fish (5) tuna, squid, octopus, shrimp, salmon meat (6) beef, pork, chicken, minced meat, sausage, ham vegetable mushroom, potato, eggplant, carrot, radish, (13) tomato, cucumber, cabbage, green onions, onion, Chinese cabbage, lettuce, Shiitake mushroom fruit (6) apple, strawberry, pineapple, orange, banana, grapefruit…”
Section: Data Collection and Experimental Settingmentioning
confidence: 99%