A new algorithm using ant colony optimization (ACO) for selection of variables in linear discriminant analysis (LDA) is presented. The role of ACO is explored in the context of LDA classification in which spectral variable multicollinearity is a known cause of generalization problems. The proposed ACO-LDA presents a metaheuristic that mimics the ant's cooperative behavior, randomly depositing pheromones at vector elements corresponding to the most relevant variables. Such cooperative ant-like behavior, which is absent in the genetic algorithm, increases the probability of discarding noninformative variables, favoring construction of more parsimonious models than genetic algorithm-linear discriminate analysis (GA-LDA). The classification performance of ACO-LDA is assessed in two case studies: (i) classification of edible vegetable oils (with respect to base oil) via ultraviolet-visible (UV-Vis) spectrometry and (ii) simultaneous classification of tea samples with respect to type and geographic origin via near-infrared (NIR) spectrometry. In the first study, ACO-LDA was tested in a data set involving wide absorption bands in the UV region with low-resolution and strong spectral overlapping. In the second study, its capacity to manage a data matrix with high dimensionality was evaluated. In both studies, ACO-LDA selected a small subset of variables, which led to correct classifications for almost all of the samples, achieving a performance level similar to the well-established partial least squares-discriminant analysis (PLS-DA), and considerably better than GA-LDA. The use of ACO to select LDA classification variables can minimize generalization problems commonly associated with multicollinearity.
The Contributor assigns to Wiley-Blackwell, during the full term of copyright and any extensions or renewals, all copyright in and to the Contribution, and all rights therein, including but not limited to the right to publish, republish, transmit, sell, distribute and otherwise use the Contribution in whole or in part in electronic and print editions of the Journal and in derivative works throughout the world, in all languages and in all media of expression now known or later developed, and to license or permit others to do so.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.