2011 IEEE 11th International Conference on Computer and Information Technology 2011
DOI: 10.1109/cit.2011.61
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Adjustable Matrix Bloom Filter-Based Copy Detection System for Digital Libraries

Abstract: With the increasing volume of on-line literatures on the Internet and the simplicity of finding and downloading data, dishonest use of the findings of others, known as plagiarism, is getting worse and worse. Therefore, there is a need to be a copy detection system to address this problem in an efficient way. Most current systems only focus on one goal, estimating similarity with highest accuracy, i.e. 100%. While, in some real applications, it can be useful to take into account other factors such as query spee… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 11 publications
(21 citation statements)
references
References 25 publications
0
21
0
Order By: Relevance
“…A query encryption method for encrypted database is proposed by Arai et al [11]. Bloom filter is also applied for copy detection system by Geravand et al [12]. Still more, there are some researches [13]- [19] for enhancing security with Bloom filter.…”
Section: B Bloom Filtermentioning
confidence: 98%
“…A query encryption method for encrypted database is proposed by Arai et al [11]. Bloom filter is also applied for copy detection system by Geravand et al [12]. Still more, there are some researches [13]- [19] for enhancing security with Bloom filter.…”
Section: B Bloom Filtermentioning
confidence: 98%
“…Typically, to realize the minimum false positive rate, the bit utilization in a standard BF is only 50%. Therefore, multiple works have been conducted to tackle this issue [99] [100] [101] [102] [103].…”
Section: Space Efficiencymentioning
confidence: 99%
“…Matrix BF. A Matrix BF is a bit matrix in which each bit can be set or reset to detect copy-paste contents in a literature library [103]. The matrix BF consists of N rows each of which records the contents in one document.…”
Section: Space Efficiencymentioning
confidence: 99%
See 1 more Smart Citation
“…For example, use of probabilistic data structure called Bloom filter [7] was suggested as a way to increase the speed of text comparison [8] [9]. The approach provided reasonable results in terms of the speed and quality of the detection but has an obvious applicability limitation as it assumes storing shingles in the RAM.…”
Section: Scaling Up Various Solutions For Text Comparisonmentioning
confidence: 99%