The effective evaluation of the impact of a scholarly article is a significant endeavor; for this reason, it has garnered attention. From the perspective of knowledge flow, this paper extracted various knowledge flow patterns concealed in articles citation counts to describe the citation impact of the articles. First, the intensity characteristic of knowledge flow was investigated to distinguish the different citation vitality of articles. Second, the knowledge diffusion capacity was examined to differentiate the size of the scope of articles’ influences on the academic environment. Finally, the knowledge transfer capacity was discussed to investigate the support degree of articles on the follow-up research. Experimental results show that articles got more citations recently have a higher knowledge flow intensity. The articles have various impacts on the academic environment and have different supporting effects on the follow-up research, representing the differences in their knowledge diffusion and knowledge transfer capabilities. Compared with the single quantitative index of citation frequency, these knowledge flow patterns can carefully explore the citation value of articles. By integrating the three knowledge flow patterns to examine the total citation impact of articles, we found that the articles exhibit distinct value of citation impact even if they were published in the same field, in the same year, and with similar citation frequencies.
The topic of one article reflects its main semantic content, which is also the main guidance for researchers to choose reference literature. In order to explore whether the topic of an article will affect its citation trend in future, this paper establishes a machine learning framework to study the role of topic characteristics in the prediction of future high influential articles. Articles from four different disciplines are collected as experimental samples to verify whether the framework proposed in this paper can be applied to the prediction task in different disciplines. The Latent Dirichlet Allocation (LDA) is used to determine the topic characteristics of sample articles. LDA can map sample articles to current hot topics and generate the mapping probability of sample articles under different hot topics. The maximum mapping probability of the sample article under the hot topics is extracted as the topic feature of the article. Then the feature space for the prediction task is constructed by combining the topic feature and some bibliometrics indices of articles. Three feature selection algorithms, Fisher Score, Relief-F and Spectral Feature Selection (SPEC), are taken to select the important features in the feature space. The prediction performance of these features is finally tested by three classifiers, SVM, KNN and Bagging. The experimental results show that the topic characteristics of article, the early citation characteristics of article, and the reputation of the author are the key factors that determine whether an article can grow into a highly influential one. The important value of topic characteristics in articles' citation activities shows that the content of the article is an important factor in attracting more citations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.