2016 International Conference on Platform Technology and Service (PlatCon) 2016
DOI: 10.1109/platcon.2016.7456789
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Text Similarity on a Scalable No-SQL Database Platform

Abstract: The paper looks at the platform scalability problem for near-to-similar document detection tasks. The application areas for the proposed approach include plagiarism detection and text filtering in data leak prevention systems. The paper reviews limitations of the current solutions based on the relational DBMS and suggests data structure suitable for implementation in no-SQL databases on the highly scalable clustered platforms. The proposed data structure is based on "key-value" model and it does not depend on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…One final note about knowledge mobilization, is that we observed that the most prolific among the authors whose work we analyzed was Butakov, who authored or co-authored six research outputs, including one conference proceeding (Butakov 2014), one book chapter (Butakov, Shcherbinin, Diagilev and Tskhay 2013) and four peer-reviewed articles (Butakov and Barber 2012;Butakov, Dyagilev andTskha 2012, Butakov, Murzintsev andTskhai 2016;Butakov and Shcherbinin 2009), including both analytic and descriptive contributions. In our analysis, we found the work conducted by Butakov and his colleagues to be among the most technical, with a focus on computer science and software development.…”
Section: Types Of Knowledge Mobilizationmentioning
confidence: 99%
See 1 more Smart Citation
“…One final note about knowledge mobilization, is that we observed that the most prolific among the authors whose work we analyzed was Butakov, who authored or co-authored six research outputs, including one conference proceeding (Butakov 2014), one book chapter (Butakov, Shcherbinin, Diagilev and Tskhay 2013) and four peer-reviewed articles (Butakov and Barber 2012;Butakov, Dyagilev andTskha 2012, Butakov, Murzintsev andTskhai 2016;Butakov and Shcherbinin 2009), including both analytic and descriptive contributions. In our analysis, we found the work conducted by Butakov and his colleagues to be among the most technical, with a focus on computer science and software development.…”
Section: Types Of Knowledge Mobilizationmentioning
confidence: 99%
“…Sub-theme 2b: Prevention Related to professional development, a similar body of work addressed the issue of prevention, and was dominated by the work of one researcher (Butakov) in collaboration with others interested in the development of software to detect plagiarism (Butakov and Barber 2012;Butakov, Murzintsev and Tskhai, 2016;Butakov and Shcherbinin 2009;Butakov, Shcherbinin, Diagilev and Tskhay 2013). Interestingly, these studies seemed to stand apart from all the others in that their focus was highly technical and focused intently on computer science.…”
Section: Theme 2: Professional Development and Preventionmentioning
confidence: 99%