Information shared online by the web users, is increasingly becoming a good source for others to learn from it. In many cases, the shared information is reflecting users' experiences and opinions about events or use of certain products. The volume of this shared information is huge. It is humanly very time consuming to read all this information and make an informed decision. The challenge is to analyze shared contents automatically, find dimensions of discussions, associated opinions and summarize them so that the user can have an extensive overview of the information for decision making. The research work presented in this paper addresses this challenge of information diversification by using probabilistic topic modeling and opinion extraction techniques. The proposed method automatically extracts dimensions of a particular discussion and combines it with the opinions presented in the discussion for information diversification purpose. Experiments on the real-world dataset indicates that our method is able to extract dimensions of a discussion and successfully associate it with the opinions expressed against these dimensions. The method presents users with both positive and negative opinions against a certain discussion to give an overview of the discussion.
Authors' Contribution MI and MR collected the samples. MI compliled the data. JK supervised the research and wrote the manuscript. ZR peformed statistical analysis. MI, MR and KK presented methodology.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.