Attribute Value Weighted Average of One-Dependence Estimators

Yu, Liangjun; Jiang, Liangxiao; Wang, Dianhong; Zhang, Lungan

doi:10.3390/e19090501

Cited by 29 publications

(15 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The ensemble classification algorithms include adaptive boosting (AdaBoost) [ 30 ], bootstrap aggregating (bagging) [ 31 ], and random forest [ 32 , 33 ]. The algorithm extensions include hidden naïve Bayes (HNB) and aggregating one-dependence estimators (AODE) [ 19 – 22 ].…”

Section: Methodsmentioning

confidence: 99%

“…In principle, ML has four steps: problem definition, data collection and preparation, model building, and model prediction. There are 11 state-of-the-art machine learning techniques [ 18 – 22 ], namely, logistic regression (LR), k-nearest neighbor (kNN), support vector machine (SVM), naïve Bayes (NB), Bayesian network (BN), decision tree (C4.5), AdaBoosting, bagging, random forest (RF), hidden naïve Bayes (HNB), and aggregating one-dependence estimators (AODE).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China

Shen

et al. 2018

BioMed Research International

100

View full text Add to dashboard Cite

Background Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. MethodsA cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. ResultsAmong the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion Novel machine learning techniques may have screening and predictive value for NAFLD.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China

Shen

et al. 2018

BioMed Research International

100

View full text Add to dashboard Cite

show abstract

“…Bayesian network (BN) [1]- [4] provides a powerful tool for knowledge representation and inference under conditions of uncertainty. Since the 1990s, the study of Bayesian network classifier (BNC) for classification has attracted tremendous attention after the success of Naive Bayes (NB) [5]- [8].…”

Section: Introductionmentioning

confidence: 99%

“…• Model weighting [4], [17] computes the weight associated with each SPODE to combine their probability estimates linearly.…”

Section: Introductionmentioning

confidence: 99%

“…We argue that overfitting to the testing instance will help improve rather than harm the generalization performance, and the significance of each SPODE should vary while classifying different instances, especially for highly predictive SPODEs. Yu et al [4] considered the specific characteristics of each testing instance and adjusted the weights to different SPODEs adaptively by computing the correlation between the root attribute value and the class. However, the notion that the weight of each SPODE is irrelevant to non-root attribute values is not convincing.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Self-Adaptive Attribute Value Weighting for Averaged One-Dependence Estimators

et al. 2020

View full text Add to dashboard Cite

Of numerous proposals for weakening the attribute independence assumption of Naive Bayes, averaged one-dependence estimators (AODE) learns by extrapolation from marginal to full-multivariate probability distributions, and has demonstrated reasonable improvement in terms of classification performance. However, all the one-dependence estimators in AODE are assigned with the same weight, and their probability estimates are combined linearly. This work presents an efficient and effective attribute value weighting approach that assigns discriminative weights to different super-parent one-dependence estimators for different instances by identifying the differences among these one-dependence estimators in terms of log likelihood. The proposed approach is validated on widely used benchmark datasets from UCI machine learning repository. Experimental results show that the proposed approach achieves bias-variance trade-off and is a competitive alternative to state-of-the-art Bayesian and non-Bayesian learners (e.g., tree augmented Naive Bayes and logistic regression).

show abstract