As one of the most prevalent cancers among women worldwide, breast cancer has attracted the most attention by researchers. It has been verified that an accurate and early detection of breast cancer can increase the chances for the patients to take the right treatment plan and survive for a long time. Nowadays, numerous classification methods have been utilized for breast cancer diagnosis. However, most of these classification models have concentrated on maximum the classification accuracy, failed to take into account the unequal misclassification costs for the breast cancer diagnosis. To the best of our knowledge, misclassifying the cancerous patient as non-cancerous has much higher cost compared to misclassifying the non-cancerous as cancerous. Consequently, in order to tackle this deficiency and further improve the classification accuracy of the breast cancer diagnosis, we propose an improved cost-sensitive support vector machine classifier (ICS-SVM) for the diagnosis of breast cancer. In the proposed approach, we take full account of unequal misclassification costs of breast cancer intelligent diagnosis and provide more reasonable results over previous works and conventional classification models. To evaluate the performance of the proposed approach, Wisconsin Breast Cancer (WBC) and Wisconsin Diagnostic Breast Cancer (WDBC) breast cancer datasets obtained from the University of California at Irvine (UCI) machine learning repository have been studied. The experimental results demonstrate that the proposed hybrid algorithm outperforms all the existing methods. Promisingly, the proposed method can be regarded as a useful clinical tool for breast cancer diagnosis and could also be applied to other illness diagnosis.
Predicting postoperative survival of lung cancer patients (LCPs) is an important problem of medical decision-making. However, the imbalanced distribution of patient survival in the dataset increases the difficulty of prediction. Although the synthetic minority oversampling technique (SMOTE) can be used to deal with imbalanced data, it cannot identify data noise. On the other hand, many studies use a support vector machine (SVM) combined with resampling technology to deal with imbalanced data. However, most studies require manual setting of SVM parameters, which makes it difficult to obtain the best performance. In this paper, a hybrid improved SMOTE and adaptive SVM method is proposed for imbalance data to predict the postoperative survival of LCPs. The proposed method is divided into two stages: in the first stage, the cross-validated committees filter (CVCF) is used to remove noise samples to improve the performance of SMOTE. In the second stage, we propose an adaptive SVM, which uses fuzzy self-tuning particle swarm optimization (FPSO) to optimize the parameters of SVM. Compared with other advanced algorithms, our proposed method obtains the best performance with 95.11% accuracy, 95.10% G -mean, 95.02% F1, and 95.10% area under the curve (AUC) for predicting postoperative survival of LCPs.
PurposeQualitative methods are not suitable to process high volumes of policy texts for exploring policy evolution. Therefore, it is hard to use qualitative methods to systematically analyze the characteristics of complex policy networks. So the authors propose a bibliometric research study for exploring policy evolution from time–agency–theme perspectives to excavate the rules and existing problems of China's medical informatization policy and to provide suggestions for formulating and improving the future medical informatization policies.Design/methodology/approachInitially, 615 valid samples are obtained by retrieving related China's medical informatization policy documents, and the joint policy-making agency network and the co-occurrence network models of medical informatization policies are defined, and then the authors research China's medical informatization policies from single-dimension and multi-dimension view.FindingsThe analysis results reveal that China's medical informatization policy process can be divided into four stages; the policy-making agencies are divided into four subgroups by community detection analysis according to the fast unfolding algorithm; the core policy theme keywords are identified based on the eigenvector centrality of the nodes in those networks; the focuses of theme terms are varied in different stages and the correlations between agencies and themes are gradually decentralized.Practical implicationsThese findings provide experience and evidence on leveraging informatics in the medical and healthcare field of China. Also, they can help scholars and practitioners better understand the current status and future directions of medical and healthcare informatics development in China and provide a reference to formulate and improve China's future medical informatization policies.Originality/valueThis study proposes a quantitative bibliometric-based research framework to describe transitions and trends of China's medical informatization policy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.