Summary
Information mining and semantic analysis have gained significant attention over recent years to obtain appropriate information from unstructured data. Several approaches have been introduced for web information mining. However, the expected accuracy is not reached by these approaches. Therefore, hybrid fuzzy clustering and enhanced latent Dirichlet allocation (ELDA) are proposed for the accuracy increment in this work. The information clustering process is performed using the hybrid fuzzy clustering algorithm called fuzzy‐C‐medoids optimized using improved whale algorithm. The clustering procedure entails grouping data points into more similar clusters than data points from other clusters. Finally, the context of the text is recognized by analyzing the semantic information with ELDA, which offers a suitable index for accurate and fast data extraction. PYTHON tool is used to develop the proposed web mining model, and the simulation analysis of the proposed model is carried out using the BibTex dataset and compared with baseline models. The performance of the proposed method is evaluated using purity, normalized mutual information, accuracy, and precision metrics. Also, it is compared with different existing algorithms. Thus, the analysis of the results proved that the proposed approach achieves better outcomes than the existing approaches.