2021
DOI: 10.1108/ijwis-11-2020-0067
|View full text |Cite
|
Sign up to set email alerts
|

A new neutrosophic TF-IDF term weighting for text mining tasks: text classification use case

Abstract: Purpose This paper aims to present a new term weighting approach for text classification as a text mining task. The original method, neutrosophic term frequency – inverse term frequency (NTF-IDF), is an extended version of the popular fuzzy TF-IDF (FTF-IDF) and uses the neutrosophic reasoning to analyze and generate weights for terms in natural languages. The paper also propose a comparative study between the popular FTF-IDF and NTF-IDF and their impacts on different machine learning (ML) classifiers for docum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 34 publications
0
5
0
Order By: Relevance
“…Text feature extraction is a fundamental task in text classification, which is divided into two methods: machine learning and deep learning. The machine learning method was usually adopted word frequency statistics, e.g., Term Frequency-Inverse Document Frequency (TF-IDF) [23] can characterize text features by considering the word frequency and inverse document frequency, which can better classify the texts. The machine learning methods cannot mine more profound into the contextual semantic information of text and do not perform well on text classification tasks with high abstraction, such as the threat behavior recognition task in this paper.…”
Section: Text Feature Extraction and Classification Methodsmentioning
confidence: 99%
“…Text feature extraction is a fundamental task in text classification, which is divided into two methods: machine learning and deep learning. The machine learning method was usually adopted word frequency statistics, e.g., Term Frequency-Inverse Document Frequency (TF-IDF) [23] can characterize text features by considering the word frequency and inverse document frequency, which can better classify the texts. The machine learning methods cannot mine more profound into the contextual semantic information of text and do not perform well on text classification tasks with high abstraction, such as the threat behavior recognition task in this paper.…”
Section: Text Feature Extraction and Classification Methodsmentioning
confidence: 99%
“…5 that is a screenshot depicting polarity and subjectivity measures of each statement on downloaded files. The second task of the first phase is applying "neutrosophic term frequencyinverse term frequency (NTF-IDF)", that "is an extended version of the popular fuzzy TF-IDF (FTF-IDF) and uses the neutrosophic reasoning to analyze and generate weights for terms in natural languages" as a text mining approach [47]. We use the hereunder formulation described on the work of Majumdar to extract explicitly out of tacticity using, the extracted polarity, subjectivity, and recency matrix.…”
Section: Fig 2 Research Methodology and Proposed Approachmentioning
confidence: 99%
“…There are plenty of text mining approaches available at the works of Michael et al, [31] and Feng et al, [32] and quite comprehensive number of techniques, methods, tools, libraries and packges that one can get more information reading the work of Ignatow and Mihalcea [33]. The neutrosophic term weighing technique developed by Bounabi et al, is considered as the main text mining analysis method used to execute to be defined methodological approach [34].…”
Section: Background Of Text Miningmentioning
confidence: 99%
“…Whether to model language in a connectionist or symbolic manner hinges also on its inherent compositionality. 17 Von Rueden et al [152] propose a taxonomy for integrating prior knowledge into learning systems. This is an extensive work covering types of knowledge and knowledge representations, neuro-symbolic integration approaches, motivations for each approach, challenges and future directions.…”
Section: Related Workmentioning
confidence: 99%
“…Given that our work deals with natural language as input, we are only concerned with logic rules (which we subdivide into rules and logic) and knowledge graphs (which we subdivide into frames and semantic networks) -see Section 6.2.2. 16 https://www.britannica.com/topic/morphology-linguistics 17 According to Noam Chomsky's theory of language, language is compositional, in the sense that a sentence is composed of phrases, which are in turn composed of sub-phrases, and so on, in a recursive manner. This idea enables the construction of infinite possibilities from finite means.…”
Section: Related Workmentioning
confidence: 99%