GPU-based MapReduce for large-scale near-duplicate video retrieval

Wang, Hanli; Zhu, Fengkuangtian; Wang, Lei; Jiang, Yu‐Gang

doi:10.1007/s11042-014-2185-x

“…Hence this approach is challenging to parallelize accurately and efficiently even for the stateof-the-art big-data frameworks. Wang et al [168] proposed a novel MapReduce framework called Multimedia and Intelligent Computing Cluster for near-duplicate video retrieval for large-scare multimedia data processing by joining the computing power of CPU's and GPU's to speed up the video data processing. They extract the keyframes using uniform sampling, store the keyframes to HDFS, perform local feature extraction using the Hessian-Affine detector [169] to detect interest points.…”

Section: A Content-based Video Retrievalmentioning

confidence: 99%

Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues

Alam

¹

,

Ullah

²

,

Lee

³

2020

View full text Add to dashboard Cite

The proliferation of multimedia devices over the Internet of Things (IoT) generates an unprecedented amount of data. Consequently, the world has stepped into the era of big data. Recently, on the rise of distributed computing technologies, video big data analytics in the cloud has attracted the attention of researchers and practitioners. The current technology and market trends demand an efficient framework for video big data analytics. However, the current work is too limited to provide a complete survey of recent research work on video big data analytics in the cloud, including the management and analysis of a large amount of video data, the challenges, opportunities, and promising research directions. To serve this purpose, we present this study, which conducts a broad overview of the state-of-the-art literature on video big data analytics in the cloud. It also aims to bridge the gap among large-scale video analytics challenges, big data solutions, and cloud computing. In this study, we clarify the basic nomenclatures that govern the video analytics domain and the characteristics of video big data while establishing its relationship with cloud computing. We propose a service-oriented layered reference architecture for intelligent video big data analytics in the cloud. Then, a comprehensive and keen review has been conducted to examine cutting-edge research trends in video big data analytics. Finally, we identify and articulate several open research issues and challenges, which have been raised by the deployment of big data technologies in the cloud for video big data analytics. To the best of our knowledge, this is the first study that presents the generalized view of the video big data analytics in the cloud. This paper provides the research studies and technologies advancing the video analyses in the era of big data and cloud computing. INDEX TERMS big data, intelligent video analytics, cloud-based video analytics system, video analytics survey, deep learning, distributed computing, intermediate results orchestration, cloud computing.

show abstract

“…Hence this approach is challenging to parallelize accurately and efficiently even for the state-of-the-art big-data frameworks. Wang et al [168] proposed a novel MapReduce framework called Multimedia and Intelligent Computing Cluster for nearduplicate video retrieval for large-scare multimedia data processing by joining the computing power of CPU's and GPU's to speed up the video data processing. They extract the keyframes using uniform sampling, store the keyframes to HDFS, perform local feature extraction using the Hessian-Affine detector [169] to detect interest points.…”

Section: A Content-based Video Retrievalmentioning

confidence: 99%

Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues

Alam,

Ullah,

Lee

2020

Preprint

0

View full text Add to dashboard Cite

The proliferation of multimedia devices over the Internet of Things (IoT) generates an unprecedented amount of data. Consequently, the world has stepped into the era of big data. Recently, on the rise of distributed computing technologies, video big data analytics in the cloud has attracted the attention of researchers and practitioners. The current technology and market trends demand an efficient framework for video big data analytics. However, the current work is too limited to provide a complete survey of recent research work on video big data analytics in the cloud, including the management and analysis of a large amount of video data, the challenges, opportunities, and promising research directions. To serve this purpose, we present this study, which conducts a broad overview of the state-of-the-art literature on video big data analytics in the cloud. It also aims to bridge the gap among large-scale video analytics challenges, big data solutions, and cloud computing. In this study, we clarify the basic nomenclatures that govern the video analytics domain and the characteristics of video big data while establishing its relationship with cloud computing. We propose a service-oriented layered reference architecture for intelligent video big data analytics in the cloud. Then, a comprehensive and keen review has been conducted to examine cutting-edge research trends in video big data analytics. Finally, we identify and articulate several open research issues and challenges, which have been raised by the deployment of big data technologies in the cloud for video big data analytics. To the best of our knowledge, this is the first study that presents the generalized view of the video big data analytics in the cloud. This paper provides the research studies and technologies advancing the video analyses in the era of big data and cloud computing. INDEX TERMS big data, intelligent video analytics, cloud-based video analytics system, video analytics survey, deep learning, distributed computing, intermediate results orchestration, cloud computing.

show abstract

“…In recent years, research involving video retrieval has been increased, mainly due to the popularity of sites for video sharing and viewing over the Web, e.g., YouTube. However, according to [1], several videos are either identical or almost identical when they are compared to each other. Usually, they differ in the format, encoding parameters, and the use of edition operations for the addition and/or removal of frames.…”

Section: Introductionmentioning

confidence: 99%

“…Some methods are also based on hash approaches to match similar videos and solve NDVR problem using global or local visual features as video representation [6]. As the amount of data grows, some works address the near-duplicate video detection problem by using parallel solutions such as MapReduce framework or GPU for large-scale NDVR problem [1]. When using the video-level strategy, the common approach to identify near-duplicate videos is based on the following: (i) videos are segmented into shots that are represented by key-frames; (ii) each key-frame is described by a signature (or descriptor), usually belonging to a high-dimensional space; (iii) then a global signature is computed over the key-frame descriptors to represent the whole video; and (iv) the similarity among videos can be computed by using either the set of signatures of key-frames or the global signature.…”

Section: Introductionmentioning

confidence: 99%

Near-duplicate video detection based on an approximate similarity self-join strategy

Silva

¹

,

Patrocínio

²

,

Gravier

³

et al. 2016

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)

View full text Add to dashboard Cite

The huge amount of redundant multimedia data, like video, has become a problem in terms of both space and copyright. Usually, the methods for identifying near-duplicate videos are neither adequate nor scalable to find pairs of similar videos. Similarity self-join operation could be an alternative to solve this problem in which all similar pairs of elements from a video dataset are retrieved. Nonetheless, methods for similarity self-join have poor performance when applied to highdimensional data. In this work, we propose a new approximate method to compute similarity self-join in sub-quadratic time in order to solve the near-duplicate video detection problem. Our strategy is based on clustering techniques to find out groups of videos which are similar to each other.

show abstract

GPU-based MapReduce for large-scale near-duplicate video retrieval

Cited by 10 publications

References 24 publications

Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues

Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues

Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues

Near-duplicate video detection based on an approximate similarity self-join strategy

Contact Info

Product

Resources

About