Abstract-During recent years, machine learning techniques have been attracting significant attentions in molecular biology and genomic era. They have become increasingly important to solve real-world problems such as elucidating protein function. An important step in the search for knowledge of protein function is to predict its cellular localization sites. Many computational methods that try to solve this problem have been developed over the years but the imbalanced distribution of proteins in cellular locations enormously influences the behavior of these methods. Hence, the performance and efficiency of the existing prediction methods still need to be improved. A computational method for efficiently predicting protein cellular localization is highly required. In this paper, we explore the use of four supervised machine learning algorithms in predicting the cellular localization sites of proteins from the primary sequence information. Our experiments were performed using Naï ve Bayesian, k-Nearest Neighbor and feed-forward Neural Network classifiers. The experts were evaluated with and without cross-validation on E.coli and Yeast benchmarks and combined using majority voting rule for improving classification accuracy on each dataset. The experimental results show that the proposed combination system significantly outperforms the best individual classifier.Index Terms-Protein localization, naï ve Bayesian classifier, k-nearest neighbor classifier, neural network classifier, combination of classifiers, E.coli, yeast.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.