The analysis of lignocellulosic materials is crucial to optimizing the conversion efficiencies in biorefineries and to studying crop residue input to soil nutrient cycles. Mid-infrared (MIR) and near-infrared (NIR) spectroscopies are rapid, simple, and nondestructive methods for the determination of biomass compositions. However, the analysis of a small set of plant biomass is not generally possible with conventional methods of data processing, such as partial least squares. Additionally, IR spectra do not distribute spherically in the data space. To overcome these problems, we propose a weighted-covariance factor fuzzy C-means clustering method combined with bootstrapping. The algorithm can classify spherical and nonspherical clusters, in contrast to classic fuzzy C-means, which is only adapted to spherical clusters. Bootstrapping enables resampling of the available spectra to generate several datasets on which the classification is performed. This unsupervised clustering methodology was tested to classify a small set of maize roots in soil according to genotype or period of their biodegradation process based on their NIR and MIR spectra. This methodology is applied to determine the optimal pretreatment of IR spectra, to study the contribution of the combination of MIR and NIR spectra and to compare the results on spectral and chemical data. The results show that the best methods of pretreatment are the first-order Savitzky-Golay derivative followed by standard normal variate. The MIR spectra produce a better result than NIR spectra for the initial characterization and for dynamic samples, while MIR spectra acquired on raw samples, without soluble extraction, provided better classification than wet chemistry
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.