The classification of fresh fruits according to their visual ripeness is typically a subjective and tedious task; consequently, there is a growing interest in the use of non-contact techniques to automate this process. Machine learning techniques, such as artificial neural networks, support vector machines (SVMs), decision trees, and K-nearest neighbor algorithms, have been successfully applied for classification problems in the literature, particularly for images of fruit. However, the particularities of each classification problem make it difficult, if not impossible, to select a general technique that is applicable to all types of fruit. In this paper, the combinations of four machine learning techniques and three color spaces (RGB, HSV, and L*a*b*) were evaluated with regard to their ability to classify Cape gooseberry fruits. To this end, 925 Cape gooseberry fruit samples were collected, and each fruit was manually classified into one of seven different classes according to its level of ripeness. The color values of each fruit image in the three color spaces and their corresponding ripening stages were organized for training and validation following a fivefold cross-validation strategy in an iterative process repeated 100 times. According to the results, the classification of Cape gooseberry fruits by their ripeness level was sensitive to both the color space and the classification technique used. The models based on the L*a*b* color space and the SVM classifier showed the highest f-measure regardless of the color space, and the principal component analysis combination of color spaces improved the performance of the models at the expense of increased complexity. INDEX TERMS Cape gooseberry, artificial neural networks, support vector machines, decision trees, K-nearest neighbors, color spaces, PCA, multiclass confusion matrix.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.