Software effort estimation plays an important role in the software development process: inaccurate estimation leads to poor utilization of resources and possibly to software project failure. Many software effort estimation techniques have been tried in an effort to develop models that generate optimal estimation accuracy, one of which is machine learning. It is crucial in machine learning to use a model that will maximize accuracy and minimize uncertainty for the purposes of software effort estimation. However, the process of selecting the best algorithm for estimation is complex and expert-dependent. This paper proposes an approach to analyzing datasets, automatically building estimation models with various machine learning techniques, and evaluating and comparing their results to find the model that produces the most accurate and surest estimates for a specific dataset. The proposed approach to automated model selection combines the Bayesian information criterion, correlation coefficients, and PRED measures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.