This paper proposes a methodology which discriminates the articles by the target authors ("true" articles) from those by other homonymous authors ("false" articles). Author name searches for 2,595 "source" authors in six subject fields retrieved about 629,000 articles. In order to extract true articles from the large amount of the retrieved articles, including many false ones, two filtering stages were applied. At the first stage any retrieved article was eliminated as false if either its affiliation addresses had little similarity to those of its source article or there was no citation relationship between the journal of the retrieved article and that of its source article. At the second stage, a sample of retrieved articles was subjected to manual judgment, and utilizing the judgment results, discrimination functions based on logistic regression were defined. These discrimination functions demonstrated both the recall ratio and the precision of about 95% and the accuracy (correct answer ratio) of 90-95%. Existence of common coauthor(s), address similarity, title words similarity, and interjournal citation relationships between the retrieved and source articles were found to be the effective discrimination predictors. Whether or not the source author was from a specific country was also one of the important predictors. Furthermore, it was shown that a retrieved article is almost certainly true if it was cited by, or cocited with, its source article. The method proposed in this study would be effective when dealing with a large number of articles whose subject fields and affiliation addresses vary widely.
This article surveys studies on bibliometrics and related subjects in Japan. Reviewed articles are classified according to the following categories: (a) studies on bibliometrics—including bibliometric laws, citation studies, scientific communication, and software tools for bibliometrics; and (b) application of bibliometrics—including policies for scientific research, bibliometrics and information retrieval, and databases in oriental languages. An interesting characteristic in the Japanese studies is that databases of texts in oriental languages such as Japanese and Chinese have been developed. Applications of fuzzy set theory to document retrieval using bibliometric techniques are also observed. We emphasize the models and methods used in common between bibliometrics and other fields of sciences.
For those who are not experts in a particular scientific field, it is difficult to understand scientific research trends. Although studies on the extraction of research trends have been conducted, most focus on extracting global trends from large-scale data, and the methods are often complicated. The purpose of this study is to develop a method of obtaining overviews of a scientific field for non-experts by capturing research trends simply and then to verify the method. To extract research topics which should express research trends, text analysis was performed using abstracts over 12 years of articles on high-temperature superconductors. We characterised three topics for the extracted word groups that frequently occurred. For these topics, we studied their appropriateness using a method that has been little used: examining research articles, review literature and co-citations among research articles used to extract the words, comparisons with controlled index terms assigned to the articles and confirming that there were no contradictions. Based on the established method, we have also applied this method to another research field: ‘simulation and modelling’. Although the method used in this article is simple, important topics were extracted, and the relations with the original articles are clear, which can lead to further investigation of the extracted topics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.