2019
DOI: 10.24200/sci.2019.50426.1685
|View full text |Cite
|
Sign up to set email alerts
|

Variance-based Features for Keyword Extraction in Persian and English Text Documents

Abstract: This paper addresses automatic keyword extraction in Persian and English text documents. Generally, to extract keywords from a text, a weight is assigned to each token, and words characterized by higher weights are selected as the keywords. This study proposed four methods for weighting the words and compared these methods with ve previous weighting techniques. The previous methods used in this paper include Term Frequency (TF), Term Frequency Inverse Document Frequency (TF-IDF), variance, Discriminative Featu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…This approach considers variance of all features only without providing a clear explanation of the threshold setting. Veisi et al [23] proposes a keyword extraction approach from Persian and English text documents. Keywords are selected according to token weighting depending on the corresponding variance.…”
Section: Variance-based Feature Selectionmentioning
confidence: 99%
“…This approach considers variance of all features only without providing a clear explanation of the threshold setting. Veisi et al [23] proposes a keyword extraction approach from Persian and English text documents. Keywords are selected according to token weighting depending on the corresponding variance.…”
Section: Variance-based Feature Selectionmentioning
confidence: 99%