William Perrizo scite author profile

Classification analysis of microarray gene expression data has been widely used to uncover biological features and to distinguish closely related cell types that often appear in the diagnosis of cancer. However, the number of dimensions of gene expression data is often very high, e.g., in the hundreds or thousands. Accurate and efficient classification of such high-dimensional data remains a contemporary challenge. In this paper, we propose a comprehensive vertical sample-based KNN/LSVM classification approach with weights optimized by genetic algorithms for high-dimensional data. Experiments on common gene expression datasets demonstrated that our approach can achieve high accuracy and efficiency at the same time. The improvement of speed is mainly related to the vertical data representation, P-tree,Patents are pending on the P-tree technology. This work is partially supported by GSA Grant ACT#:K96130308. and its optimized logical algebra. The high accuracy is due to the combination of a KNN majority voting approach and a local support vector machine approach that makes optimal decisions at the local level. As a result, our approach could be a powerful tool for high-dimensional gene expression data analysis.

show abstract

The P-tree algebra

Ding

Khan

Roy

et al. 2002

View full text Add to dashboard Cite

Association Rule Mining on Remotely Sensed Images Using P-trees

Ding

Perrizo

2002

View full text Add to dashboard Cite

Unique ergodicity of flows on homogeneous spaces

Ellis

Perrizo

1978

Israel J. Math.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

William Perrizo

k-nearest Neighbor Classification on Spatial Data Streams Using P-trees

Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis

The P-tree algebra

Association Rule Mining on Remotely Sensed Images Using P-trees

Unique ergodicity of flows on homogeneous spaces

Contact Info

Product

Resources

About