2020
DOI: 10.33774/coe-2020-7zd6k-v2
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Phrase indexing and the identification of related academic research content

Abstract: Work to automate the identification of related articles in corpora of academic research content is described. Pairs of related articles are recognised on the basis of the phrases they contain, using a similarity measure that emphasizes the importance of phrase overlap. Phrases are weighted according to their significance, evaluated in terms of statistical under-or over-representation relative to corpus-level frequency, and the significance scores of n-grams with higher n values are boosted. The measure proves … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 9 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?