Gao Bo scite author profile

Gao Bo

3Publications

25Citation Statements Received

43Citation Statements Given

How they've been cited

How they cite others

Affiliations

Ministry of Education of the People's Republic of China, Beijing University of Chemical Technology

Publications

Order By: Most citations

Text Classification Using Novel Term Weighting Scheme-Based Improved TF-IDF for Internet Media Reports

Jiang

et al. 2021

Mathematical Problems in Engineering

View full text Add to dashboard Cite

With the rapid development of the internet technology, a large amount of internet text data can be obtained. The text classification (TC) technology plays a very important role in processing massive text data, but the accuracy of classification is directly affected by the performance of term weighting in TC. Due to the original design of information retrieval (IR), term frequency-inverse document frequency (TF-IDF) is not effective enough for TC, especially for processing text data with unbalanced distributions in internet media reports. Therefore, the variance between the DF value of a particular term and the average of all DFs DF ¯ , namely, the document frequency variance (ADF), is proposed to enhance the ability in processing text data with unbalanced distribution. Then, the normal TF-IDF is modified by the proposed ADF for processing unbalanced text collection in four different ways, namely, TF-IADF, TF-IADF+, TF-IADFnorm, and TF-IADF+norm. As a result, an effective model can be established for the TC task of internet media reports. A series of simulations have been carried out to evaluate the performance of the proposed methods. Compared with TF-IDF on state-of-the-art classification algorithms, the effectiveness and feasibility of the proposed methods are confirmed by simulation results.

show abstract

An Improved Term Weighting Method for Content Analysis on Chinese Internet Media Contents

Jiang

Tian

et al. 2020

View full text Add to dashboard Cite

Separation Modeling Of the Internal Air-Launch Rocket from a Cargo Aircraft

Bo¹,

Tang²,

Xu³

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gao Bo

Text Classification Using Novel Term Weighting Scheme-Based Improved TF-IDF for Internet Media Reports

An Improved Term Weighting Method for Content Analysis on Chinese Internet Media Contents

Separation Modeling Of the Internal Air-Launch Rocket from a Cargo Aircraft

Contact Info

Product

Resources

About