2016
DOI: 10.4018/ijirr.2016040102
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Step Iterative Algorithm for Feature Selection on Dynamic Documents

Abstract: The authors propose clustering based multistep iterative algorithm. The important step is where terms are grouped by synonyms. It takes advantage of semantic relativity measure between the terms. Term frequency is computed of the group of synonyms by considering the relativity measure of the terms appearing in the document from the parent term in the group. This increases the importance of terms which though individually appear less frequently but together show their strong presence. The authors tried experime… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…Group Frequency of this synset group is computed based on the frequency of occurrence of different terms in the synset group. For example, if customer, client and consumer have frequency of occurrence 6, 3 and 2, respectively, then the frequency of the group is computed as (6 × 1) + (3 × 0.7) + (2 × 0.6) = 6 + 2.1 + 1.2 = 9 (Bafna et al , 2016).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Group Frequency of this synset group is computed based on the frequency of occurrence of different terms in the synset group. For example, if customer, client and consumer have frequency of occurrence 6, 3 and 2, respectively, then the frequency of the group is computed as (6 × 1) + (3 × 0.7) + (2 × 0.6) = 6 + 2.1 + 1.2 = 9 (Bafna et al , 2016).…”
Section: Methodsmentioning
confidence: 99%
“…Text mining techniques like clustering, feature extraction and so on can be beneficial. In text mining, major terms (frequently occurring) are considered as features representing a document (Bafna et al , 2016; Sun and Vasarhelyi, 2018). Clustering helps in grouping documents based on the similarity between the terms present in the document.…”
Section: Introductionmentioning
confidence: 99%
“…Synonyms and Meronyms are identified for each term and assembled into groups to form the synsets. The synsets having group frequency greater than the threshold frequency are chosen to represent the columns and reviews are placed in rows [20]. The synset document matrix containing the group frequency count is normalized to form the feature matrix.…”
Section: Review Based Feature Matrix Generationmentioning
confidence: 99%
“…Entropy is consistent even for large data size. Document Management System (DMS) facilitates to access the documents in a fast and easy way and turn increases the productivity of the work [52][53]. Grouping of documents is one of the most important steps towards document management, which helps in identifying replicas of documents, clustering search engine results, and so on [54][55].…”
Section: Introductionmentioning
confidence: 99%