2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2019
DOI: 10.1109/jcdl.2019.00026
|View full text |Cite
|
Sign up to set email alerts
|

Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations

Abstract: Identifying academic plagiarism is a pressing task for educational and research institutions, publishers, and funding agencies. Current plagiarism detection systems reliably find instances of copied and moderately reworded text. However, reliably detecting concealed plagiarism, such as strong paraphrases, translations, and the reuse of nontextual content and ideas is an open research problem. In this paper, we extend our prior research on analyzing mathematical content and academic citations. Both are promisin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 31 publications
(25 citation statements)
references
References 31 publications
1
23
0
Order By: Relevance
“…Approaches analyzing nontextual content features, such as academic citations [16,28], images [25,11], and mathematical content [26,29], complement the text analysis approaches to improve the detection of concealed plagiarism.…”
Section: Related Workmentioning
confidence: 99%
“…Approaches analyzing nontextual content features, such as academic citations [16,28], images [25,11], and mathematical content [26,29], complement the text analysis approaches to improve the detection of concealed plagiarism.…”
Section: Related Workmentioning
confidence: 99%
“…Nontextual content features in academic documents are a valuable source of semantic information that are largely independent of natural language text. Considering these sources of semantic information for similarity analysis raises the effort plagiarists must invest for obfuscating reused content [16,22].…”
Section: Background and Related Workmentioning
confidence: 99%
“…However, only a few studies have addressed the detection of plagiarism in digital mathematical libraries [19,21,22] regardless of the detection approach. We briefly describe the main findings of these studies hereafter.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Fundamental knowledge on frequency distributions of math formulae is beneficial for numerous applications in MathIR, ranging from educational purposes [3] to math recommendation systems, search engines [22,25], and even automatic plagiarism detection systems [29,39,41]. For example, students can search for the conventions to write certain quantities in formulae; document preparation systems can integrate an auto-completion or auto-correction service for math inputs; search or recommendation engines can adjust their ranking scores with respect to standard notations; and plagiarism detection systems can estimate whether two identical formulae indicate potential plagiarism or are just using the conventional notations in a particular subject area.…”
Section: Introductionmentioning
confidence: 99%