Topic mining of scientific literature can accurately capture the contextual structure of a topic, track research hotspots within a field, and improve the availability of information about the literature. This paper introduces a multi-dimensional topic mining method based on a hierarchical semantic graph model. The main innovations include (1) the hierarchical extraction of feature terms and construction of a corresponding semantic graph and (2) multi-dimensional topic mining based on graph segmentation and structure analysis. The process of semantic graph construction is based primarily on hierarchical feature term extraction, which can effectively reveal the hierarchical structural distribution of feature terms within documents. Our graph model also takes into account the complementarity of content-and context-related feature terms in documents while avoiding the loss of textual information. In addition, the multi-dimensional features of the topic can be mined effectively via an in-depth analysis of the constructed graph, resulting in a quantitative visualization of the many-to-many association between the topic and feature terms. A variety of experiments on existing document datasets demonstrate that the proposed approach is able to outperform state-of-the-art methods in terms of accuracy and efficacy. INDEX TERMS Topic mining, multi-dimensional topic, hierarchical semantic graph.
PurposeAs academic social Q&A networking websites become more popular, scholars are increasingly using them to meet their information needs by asking academic questions. However, compared with other types of social media, scholars are less active on these sites, resulting in a lower response quantity for some questions. This paper explores the factors that help explain how to ask questions that generate more responses and examines the impact of different disciplines on response quantity.Design/methodology/approachThe study examines 1,968 questions in five disciplines on the academic social Q&A platform ResearchGate Q&A and explores how the linguistic characteristics of these questions affect the number of responses. It uses a range of methods to statistically analyze the relationship between these linguistic characteristics and the number of responses, and conducts comparisons between disciplines.FindingsThe findings indicate that some linguistic characteristics, such as sadness, positive emotion and second-person pronouns, have a positive effect on response quantity; conversely, a high level of function words and first-person pronouns has a negative effect. However, the impacts of these linguistic characteristics vary across disciplines.Originality/valueThis study provides support for academic social Q&A platforms to assist scholars in asking richer questions that are likely to generate more answers across disciplines, thereby promoting improved academic communication among scholars.
With the continuous development of academic social network sites, academic social Q&A platforms have also become a venue for scholars to obtain academic information. This study explores the factors that influence the response quantity for questions on an academic social Q&A platform. Using 130 questions from the library and information services domain on ResearchGate Q&A, we adopt content analysis and multiple linear regression analysis to investigate the relationship between question characteristics and response quantity. We find that, for academic questions, the questioner's authority does not affect response quantity but questions that provide academic resources and contain fewer personal opinions from the questioner tend to attract more answers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.