Transductive Classification through Term Network (TCTN) is an interesting and accurate approach to perform semi-supervised learning based on term networks for text classification. TCTN can surpass the accuracies obtained by transductive classification approach considering texts represented in other types of networks or vector space model. Also, TCTN can surpass the accuracies obtained by inductive supervised learning algorithms. Besides, the term networks in TCTN can have their size decreased while still keeps its classification performance. This implies a less computational cost than other semi-supervised learning approaches based on networks. Originally, TCTN considered just manually defined hyper-parameters. However, even better results can be achieved with a more carefully chosen hyper-parameters values. Thus, in this article, we present a genetic algorithm that (GA) can be used for finding better hyper-parameter values for TCTN. The proposed approach is called GA-TCTN. Our approach is applied in 25 text collections, and results demonstrate that a GA can be useful together with TCTN for semi-supervised text classification. Besides this contribution, comparisons among hyper-parameters distributions are performed to identify some pattern in its structure. The results indicate that TCTN and GA-TCTN tend to generate a similar set of hyper-parameters. However, GA-TCTN still allows the use of more specific hyper-parameters values being more flexible and practical than TCTN with manually defined parameters. Besides, GA-TCTN obtained better results than TCTN with statistically significant differences.
Transductive Classification through Term Network (TCTN) is an interesting and accurate approach to perform semi-supervised learning based on term networks for text classification. TCTN can surpass the accuracies obtained by transductive classification approach considering texts represented in other types of networks or vector space model. Also, TCTN can surpass the accuracies obtained by inductive supervised learning algorithms. Besides, the term networks in TCTN can have their size decreased while still keeps its classification performance. This implies a less computational cost than other semi-supervised learning approaches based on networks. Originally, TCTN considered just manually defined hyper-parameters. However, even better results can be achieved with a more carefully chosen hyper-parameters values. Thus, in this article, we present a genetic algorithm that (GA) can be used for finding better hyper-parameter values for TCTN. The proposed approach is called GA-TCTN. Our approach is applied in 25 text collections, and results demonstrate that a GA can be useful together with TCTN for semi-supervised text classification. Besides this contribution, comparisons among hyper-parameters distributions are performed to identify some pattern in its structure. The results indicate that TCTN and GA-TCTN tend to generate a similar set of hyper-parameters. However, GA-TCTN still allows the use of more specific hyper-parameters values being more flexible and practical than TCTN with manually defined parameters. Besides, GA-TCTN obtained better results than TCTN with statistically significant differences.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.