Summary
Information mining and semantic analysis have gained significant attention over recent years to obtain appropriate information from unstructured data. Several approaches have been introduced for web information mining. However, the expected accuracy is not reached by these approaches. Therefore, hybrid fuzzy clustering and enhanced latent Dirichlet allocation (ELDA) are proposed for the accuracy increment in this work. The information clustering process is performed using the hybrid fuzzy clustering algorithm called fuzzy‐C‐medoids optimized using improved whale algorithm. The clustering procedure entails grouping data points into more similar clusters than data points from other clusters. Finally, the context of the text is recognized by analyzing the semantic information with ELDA, which offers a suitable index for accurate and fast data extraction. PYTHON tool is used to develop the proposed web mining model, and the simulation analysis of the proposed model is carried out using the BibTex dataset and compared with baseline models. The performance of the proposed method is evaluated using purity, normalized mutual information, accuracy, and precision metrics. Also, it is compared with different existing algorithms. Thus, the analysis of the results proved that the proposed approach achieves better outcomes than the existing approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.