2014
DOI: 10.1007/s11042-014-2185-x
|View full text |Cite
|
Sign up to set email alerts
|

GPU-based MapReduce for large-scale near-duplicate video retrieval

Abstract: With the exponential growth of multimedia data, people are overwhelmed with massive amount of online videos, of which Near-Duplicate Videos (NDVs) occupy a large portion. In this paper, we present a novel framework for NDV retrieval, which explores the parallel power of two promising techniques: Graphics Processing Unit (GPU) and MapReduce. With the power of the proposed framework, various key algorithms in the field of computer vision, such as K-Means clustering, bag of features, inverted file index with hamm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Hence this approach is challenging to parallelize accurately and efficiently even for the stateof-the-art big-data frameworks. Wang et al [168] proposed a novel MapReduce framework called Multimedia and Intelligent Computing Cluster for near-duplicate video retrieval for large-scare multimedia data processing by joining the computing power of CPU's and GPU's to speed up the video data processing. They extract the keyframes using uniform sampling, store the keyframes to HDFS, perform local feature extraction using the Hessian-Affine detector [169] to detect interest points.…”
Section: A Content-based Video Retrievalmentioning
confidence: 99%
“…Hence this approach is challenging to parallelize accurately and efficiently even for the stateof-the-art big-data frameworks. Wang et al [168] proposed a novel MapReduce framework called Multimedia and Intelligent Computing Cluster for near-duplicate video retrieval for large-scare multimedia data processing by joining the computing power of CPU's and GPU's to speed up the video data processing. They extract the keyframes using uniform sampling, store the keyframes to HDFS, perform local feature extraction using the Hessian-Affine detector [169] to detect interest points.…”
Section: A Content-based Video Retrievalmentioning
confidence: 99%
“…Hence this approach is challenging to parallelize accurately and efficiently even for the state-of-the-art big-data frameworks. Wang et al [168] proposed a novel MapReduce framework called Multimedia and Intelligent Computing Cluster for nearduplicate video retrieval for large-scare multimedia data processing by joining the computing power of CPU's and GPU's to speed up the video data processing. They extract the keyframes using uniform sampling, store the keyframes to HDFS, perform local feature extraction using the Hessian-Affine detector [169] to detect interest points.…”
Section: A Content-based Video Retrievalmentioning
confidence: 99%
“…In recent years, research involving video retrieval has been increased, mainly due to the popularity of sites for video sharing and viewing over the Web, e.g., YouTube. However, according to [1], several videos are either identical or almost identical when they are compared to each other. Usually, they differ in the format, encoding parameters, and the use of edition operations for the addition and/or removal of frames.…”
Section: Introductionmentioning
confidence: 99%
“…Some methods are also based on hash approaches to match similar videos and solve NDVR problem using global or local visual features as video representation [6]. As the amount of data grows, some works address the near-duplicate video detection problem by using parallel solutions such as MapReduce framework or GPU for large-scale NDVR problem [1]. When using the video-level strategy, the common approach to identify near-duplicate videos is based on the following: (i) videos are segmented into shots that are represented by key-frames; (ii) each key-frame is described by a signature (or descriptor), usually belonging to a high-dimensional space; (iii) then a global signature is computed over the key-frame descriptors to represent the whole video; and (iv) the similarity among videos can be computed by using either the set of signatures of key-frames or the global signature.…”
Section: Introductionmentioning
confidence: 99%