In the present world of information, text classification is a more challenging process due to the larger number of training cases and feature set present in text data. One of the most difficult tasks in the text classification problem is high dimensionality of the feature space. As many real world text classifications are not modeled or too difficult to model, this paper aims at the real world text classification approach or model based on one of the properties of David Merrill's First principles of Instruction (FPI). The Objective is to introduce a method to improve text classifications effectiveness, efficiency and accuracy. In this methodology we categorizes the text using a predefined category group by providing them with the proper training set based on the feature of Application phase in FPI. The algorithm involves the Parsing, text categorization and text analysis.
General TermsPattern Recognition, Text Mining, et. al.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.