Text mining has gained considerable popularity over the last few years. Since the awareness of mobile media, streaming media, print media, and many other outlets are now accessible to consumers. Due to the large availability of text in many respects, research experts registered many unstructured data and found various ways in the literature of converting this dispersed text into a given, organized volume. Compared to the short text, the emphasis on complete classification (complete news, big records, long text, etc.) is prevalent. We addressed in this paper the process of text classification, grading, and various methodologies for feature extraction in short texts, i.e., news classification based on their headlines. Existing classification is compared and their operating methodologies presented efficiently. This work serves the purpose of classifying different types of Bangla Newspaper articles into 10 specific categories. The classification task is performed on Bengali text from three renowned newspapers of Kolkata. We have used advanced data tokenization techniques and unsupervised 'GloVe' vectorization for better classification performance. We applied LSTM and CNN as our main feature extractors. Comparing with other models like binary SVM classifier, standard LSTM, BiLSTM, CNN, or ANN, this proposed work gives better accuracy of 87%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.