Key-Value stores play an important role in today's large-scale, high-performance cloud applications. Elastic scaling and load rebalancing with low cost live data migration are critical to enabling the elasticity of Key/Value stores in the cloud. Learning how to reduce the migration cost is one difficult problem that cloud providers are trying to solve. Many existing works try to address this issue in non-virtual environments. These approaches, however, may not be well suited for cloud-based Key/Value stores. To address this challenge, the study tackles the data migration problem under a load rebalancing scenario. The paper builds a one cost model that could be aware of the underlying VM interference and trade-off between migration time and performance impact. A cost-aware migration algorithm is designed to utilize the cost model and balance rate to guide the choice of possible migration actions. Our evaluation using Yahoo! Cloud Serving Benchmark shows the effectiveness of the approach.
Aiming at the characteristics of high dimension and small samples in microarray data, this paper proposes a selective ensemble method to classify microarray data. Firstly, kruskal-wallis test is used to filter irrelevant genes with classification task and to obtain a set of genes, and then a reduced training set is produced from original training set according to gene subset obtained. Secondly, multiple gene subsets are generated by using neighborhood rough set model with different radius and used to construct training subsets on above reduced training set. Thirdly, every constructed training subset is used to train a classifier by using SVM algorithm, and then multiple classifiers are produced as base classifiers. Finally, a set of base classifiers are selected by using teachinglearning-based optimization and build an ensemble classifier by weighted voting. Five benchmarks tumor microarray datasets are applied to evaluate performance of our proposed method. Experimental results indicate our proposed method is very effective and efficient for classifying microarray data, and it improves not only classification accuracy, but also decrease memory costs and computation times.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.