Background and aim: Recently, the use of data science techniques in healthcare has been increased remarkably. Community detection as one the important methods of data science is utilized in the health domain.
Methods: This paper detects disease areas based on combination of big data and graph mining methods on drug prescriptions. At first, network of prescription is designed, and Louvain algorithm is applied for community detection of 50000 Iranian prescriptions in 2014 gathered from the Iranian Health Insurance Organization. We use modularity metric for validation of the results and the experts’ opinion as the external validation of communities.
Results: The outputs are consist of six communities. These communities are labeled based on experts’ opinion that present the disease fields.
Conclusion: The Louvain algorithm has the ability to detect the major communities of the prescription database with an acceptable accuracy. We have proven that these communities present the disease fields.
In recommender systems (RSs), explicit information is often preferred over implicit because it is much more accurate than implicit or predicted information; for example, the user can enter information about his interests directly into the system, and the system will generate accurate recommendations for him. Receiving explicit information, however, may be difficult for a system. Explicit demographic information might be uncomfortable for some users, and extremely common questions, such as race, gender, income, and age, can lead to bias and unfair recommendations. As a result, in this study, we propose a method, in which information collected from a new user does not contain demographic information, and enquired explicit information is data driven. Users’ interest in tourism activities is used to identify seven categories of tourism. The mapping between extracted categories and activities is established with a multilabel classification (MLC) algorithm. The user’s interest in 18 tourism activities is predicted by rating only seven tourism categories. Common MLC algorithms with different classifiers were used to evaluate the proposed method. The best result relates to binary relevance with the Naïve Bayes classifier, which also outperforms the entitled algorithms in collaborative filtering (CF) systems as baseline models. The proposed method can capture users’ interests and develop their profiles without receiving demographic information. Also, compared to CF, in addition to a slight advantage in metrics, it only requires seven ratings to predict user interest in 18 activities. In contrast, CF algorithms require at least 15 user ng records to predict user interest in unknown activities (3-4 activities) to achieve a performance close to the proposed method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.