2013
DOI: 10.1007/s11760-013-0424-7
|View full text |Cite
|
Sign up to set email alerts
|

A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…ST&V hash is the XOR operation between ST and V hashes. This fingerprint is robust against some temporal attacks (frame dropping, frame rate conversion, and frame exchanging) and signal processing attacks (Gaussian noising, median filtering, histogram equalization, among others).CST-SURF [27]. CST-SURF is part of a spatiotemporal registration framework.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…ST&V hash is the XOR operation between ST and V hashes. This fingerprint is robust against some temporal attacks (frame dropping, frame rate conversion, and frame exchanging) and signal processing attacks (Gaussian noising, median filtering, histogram equalization, among others).CST-SURF [27]. CST-SURF is part of a spatiotemporal registration framework.…”
Section: Resultsmentioning
confidence: 99%
“…For instance, in [20] four detectors are combined: SIFT, SURF, DCT for visual component and WASF for audio component. Other methods that combine visual and acoustic information are proposed in [27, 28]. In [29], the combination of robust local and global visual feature representations with a time-variant jitter synchronization gives robustness against scale, orientation and affine transforms.…”
Section: Related Workmentioning
confidence: 99%
“…Here, we used this small dataset to assess the robustness of features we used to guard against possible video distortions. The ReTRiEVED [77] Dataset was created to evaluate methods that require video quality assessment in transmissions. The ReTRiEVED dataset contains 176 test videos obtained from 8 source videos by applying the transmission parameters listed in Table 16.…”
Section: Retrieved Datasetmentioning
confidence: 99%
“…Since the match numbers differ greatly under different thresholds, we use the 12-step threshold values to find better conditions for SIFT. Figure 12 shows Here, we compared the SIFT features against some related methods: CST−SURF [77], CC [78], and {Th; CC; ORB} [24] for video retrieval. In Table 17, the average detection F1-scores are presented.…”
Section: Attacksmentioning
confidence: 99%