2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2015
DOI: 10.1109/cvprw.2015.7301299
|View full text |Cite
|
Sign up to set email alerts
|

A cloud infrastructure for target detection and tracking using audio and video fusion

Abstract: This paper presents a Cloud-based architecture for detecting and tracking multiple moving targets from airborne videos combined with the audio assistance, which is called Cloudbased Audio-Video (CAV) fusion. The CAV system innovation is a method for user-based voice-to-text color feature descriptor track matching with an automated hue feature extraction from image pixels. The introduced CAV approach is general purpose for detecting and tracking different valuable targets' movement for suspicious behavior recog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 49 publications
(42 reference statements)
0
7
0
Order By: Relevance
“…Generally, application-specific visual features are extracted from the eye and mouth regions. Some of the visual features available in the literature include model-based, motion-based, image-based, and geometry-based features [5,6,8]. In most instances, the modality information is merged only after the feature extraction.…”
Section: Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Generally, application-specific visual features are extracted from the eye and mouth regions. Some of the visual features available in the literature include model-based, motion-based, image-based, and geometry-based features [5,6,8]. In most instances, the modality information is merged only after the feature extraction.…”
Section: Featuresmentioning
confidence: 99%
“…There is a range of application areas using multimodal data convergence and fusion. Some of the applications include the following: (i) biomedical systems for emergency care, (ii) health monitoring system, (iii) smart outdoor environment monitoring [6], (iv) multimodal video retrieval, and (v) emotion recognition [7].…”
mentioning
confidence: 99%
“…The comparative technique includes the NSA [23], Exponential Weighted Moving Average (EWMA) [29], NSA + EWMA, NSA + NARX. The results of the proposed SSDM + ENN are compared with the other existing techniques to highlight the dominance of the techniques.…”
Section: Comparative Techniquesmentioning
confidence: 99%
“…[21][22][23] Through the use of image quality, various image processing methods have been developed for cloud architectures. 24,25 An open research question is the alignment of machine-level image interpretability with that of human observers, 26,27 although initial comparisons suggest the human perception and machine-level processing are sensitive to different image characteristics. 28,29 Many examples to compute the NIIRS have been reported 11 and updates are included in the Motion Imagery Standards Board.…”
Section: Introductionmentioning
confidence: 99%