Increase statistical reliability without losing predictive power by merging classes and adding variables

Huang, Wenxue; Li, Xiaofeng; Pan, Yuanyi

doi:10.3934/bdia.2016014

BDIA

2017

DOI: 10.3934/bdia.2016014

|View full text |Cite

Increase statistical reliability without losing predictive power by merging classes and adding variables

Wenxue Huang¹,

Xiaofeng Li²,

Yuanyi Pan

Abstract: It is usually true that adding explanatory variables into a probability model increases association degree yet risks losing statistical reliability. In this article, we propose an approach to merge classes within the categorical explanatory variables before the addition so as to keep the statistical reliability while increase the predictive power step by step.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2017

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

A category-based probabilistic approach to feature selection

Rodrigues¹,

Huang²,

Pan

2017

BDIA

View full text Add to dashboard Cite

<abstract> <p>A high dimensional and large sample categorical data set with a response variable may have many noninformative or redundant categories in its explanatory variables. Identifying and removing these categories usually improve the association but also give rise to significantly higher statistical reliability of selected features. A category-based probabilistic approach is proposed to achieve this goal. Supportive experiments are presented.</p> </abstract>

show abstract

A category-based probabilistic approach to feature selection

Rodrigues¹,

Huang²,

Pan

2017

BDIA

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Increase statistical reliability without losing predictive power by merging classes and adding variables

Cited by 1 publication

References 16 publications

A category-based probabilistic approach to feature selection

A category-based probabilistic approach to feature selection

Contact Info

Product

Resources

About