Visible and Near-Infrared Reflectance Spectroscopy for Investigating Soil Mineralogy: A Review

Fang, Qian; Hong, Hanlie; Zhao, Lulu; Kukolich, Stephanie; Yin, Kunlong; Wang, Chaowen

doi:10.1155/2018/3168974

Cited by 121 publications

(74 citation statements)

References 95 publications

(141 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The measured properties include soil texture (sand, silt and clay), organic carbon (OC) and cation exchange capacity (CEC). The soil particle size was quantified by the pipette method, as described in Donagema et al (2011). The method consists of using a 0.1 M NaOH solution as a dispersing agent under high-speed mechanical stirring for 10 min.…”

Section: Data Set and Chemical Analysismentioning

confidence: 99%

See 1 more Smart Citation

The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data

Minasny

Mendes³

et al. 2020

SOIL

115

View full text Add to dashboard Cite

Abstract. The number of samples used in the calibration data set affects the quality of the generated predictive models using visible, near and shortwave infrared (VIS–NIR–SWIR) spectroscopy for soil attributes. Recently, the convolutional neural network (CNN) has been regarded as a highly accurate model for predicting soil properties on a large database. However, it has not yet been ascertained how large the sample size should be for CNN model to be effective. This paper investigates the effect of the training sample size on the accuracy of deep learning and machine learning models. It aims at providing an estimate of how many calibration samples are needed to improve the model performance of soil properties predictions with CNN as compared to conventional machine learning models. In addition, this paper also looks at a way to interpret the CNN models, which are commonly labelled as a black box. It is hypothesised that the performance of machine learning models will increase with an increasing number of training samples, but it will plateau when it reaches a certain number, while the performance of CNN will keep improving. The performances of two machine learning models (partial least squares regression – PLSR; Cubist) are compared against the CNN model. A VIS–NIR–SWIR spectra library from Brazil, containing 4251 unique sites with averages of two to three samples per depth (a total of 12 044 samples), was divided into calibration (3188 sites) and validation (1063 sites) sets. A subset of the calibration data set was then created to represent a smaller calibration data set ranging from 125, 300, 500, 1000, 1500, 2000, 2500 and 2700 unique sites, which is equivalent to a sample size of approximately 350, 840, 1400, 2800, 4200, 5600, 7000 and 7650. All three models (PLSR, Cubist and CNN) were generated for each sample size of the unique sites for the prediction of five different soil properties, i.e. cation exchange capacity, organic carbon, sand, silt and clay content. These calibration subset sampling processes and modelling were repeated 10 times to provide a better representation of the model performances. Learning curves showed that the accuracy increased with an increasing number of training samples. At a lower number of samples (< 1000), PLSR and Cubist performed better than CNN. The performance of CNN outweighed the PLSR and Cubist model at a sample size of 1500 and 1800, respectively. It can be recommended that deep learning is most efficient for spectra modelling for sample sizes above 2000. The accuracy of the PLSR and Cubist model seems to reach a plateau above sample sizes of 4200 and 5000, respectively, while the accuracy of CNN has not plateaued. A sensitivity analysis of the CNN model demonstrated its ability to determine important wavelengths region that affected the predictions of various soil attributes.

show abstract

Section: Data Set and Chemical Analysismentioning

confidence: 99%

“…Better generalisation can be made by training the model in a larger data set. However, several studies demonstrated that the performance of the machine learning model did not increase significantly, or it even plateaued, as the calibration sample size increased (Figueroa et al, 2012;Ramirez-Lopez et al, 2014;Ng et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data

Minasny

Mendes³

et al. 2020

SOIL

115

View full text Add to dashboard Cite

show abstract

“…Among the clay minerals, kaolinite, montmorillonite, illite, vermiculite, and chlorite are the common clay minerals in the hydrocarbon microseepage environments. Clay minerals exhibit diagnostic absorption features near 1400 nm (caused by OH overtones), 1900 nm (overtones caused by water molecules), 2200 nm (due to Al-OH combination tones) and some weaker absorptions in the 2300-2500 nm range due to presence of Fe-or Mg-OH (Clark 1999;Zhao et al 2018;Fang et al 2018). Carbonate minerals show spectral characteristics absorption near 2350 and 2500 nm (Hunt and Salisbury, 1971).…”

Section: Re Ectance Spectroscopymentioning

confidence: 99%

Mineralogical and geochemical characterization of hydrocarbon microseepage-induced sediments in part of Assam-Arakan Fold Belt, Cachar area, NE India

Garain

Mitra

Das

2020

Preprint

View full text Add to dashboard Cite

Long term hydrocarbon microseepages create a reducing environment on the surface rocks and sediments, which induces an array of mineralogical alterations. Among these alterations, the reduction of ferric iron minerals to ferrous iron minerals and precipitation of clay and carbonates are significant. Several studies have been carried out to characterize these hydrocarbon induced rocks/sediments. Almost all these studies have been carried out for arid to semi-arid climatic regions. The present study attempts to characterize the geochemical properties of the hydrocarbon induced sediments in part of Assam-Arakan Fold Belt (AAFB), NE India characterized by heavy rainfall. Reflectance spectroscopy, X-ray diffraction (XRD), X-ray fluorescence (XRF) and inductively coupled plasma emission - mass spectrometry (ICP-MS) studies have been carried out on the sediments. The reflectance spectroscopy reveals that microseepage-induced sediments have higher clay content and lesser ferric iron mineral content. Geochemical indices also suggest that the hydrocarbon-affected sediments are relatively more altered than the unaffected ones. Studies of trace element patterns indicate that the hydrocarbon-induced sediments are enriched in average Be, V, Cu, Zn, Ga, Zr, and Mo and are depleted in Li, Cr, Co, Ni, Rb, Sr, Sc, and Y. The normalized rare earth element (REE) distribution patterns are the same for both the microseepage affected and unaffected sediments though the microseepage-induced sediments are slightly depleted in the REEs. The present study, thus, points out that the hydrocarbon microseepage-induced alterations are also evident in the high precipitation terrains though the alteration levels are less pronounced than that of the arid to semi-arid climatic regions due to abundant surface and groundwater which mobilize the minerals/elements from the microseepage system and tries to homogenize the compositions.

show abstract

“…Reflectance spectra of these 200 powder samples in 0.5 mm grain size were obtained under the same condition as the sample in Figure 2. e preprocessing method of continuum removal (CR) was employed to remove background noise of the reflectance curves and isolate particular absorption features [17].…”

Section: Spectra and Categories Acquisition Of Modeling Samplesmentioning

confidence: 99%

Characterization and Identification of Coal and Carbonaceous Shale Using Visible and Near-Infrared Reflectance Spectroscopy

Yang

Wang

2018

Journal of Spectroscopy

View full text Add to dashboard Cite

Because of the high organic carbon concentration in carbonaceous shale, a large proportion of carbonaceous shales are often misclassified into coals using visible and near-infrared (VIS-NIR) reflectance spectroscopy in the field of coal-gangue identification of hyperspectral remote sensing of coal mine. In order to study spectral characterization of coal and carbonaceous shale, three bituminite samples and three carbonaceous shales were collected from a coal mine of China, and their spectral reflectance curves were obtained by a field spectrometer in the wavelength range of 350-2500 nm. Only one carbonaceous shale could be easily identified from the three bituminite samples according to obvious absorption valleys near 1400 nm, 1900 nm, and 2200 nm of its reflectance curve while the other two carbonaceous shales have similar reflectance curves to the three bituminite samples. e effect of carbon concentration on reflectance curve was simulated by the mixed powder of ultralow ash bituminite and clay in 0.5 mm grain size under various mixing ratios. It was found that absorption valleys near 1400 nm, 1900 nm, and 2200 nm of the mixed powder become not obvious when the bituminite content is more than 30%. In order to establish an effective identification method of coal and carbonaceous shale, 250 other samples collected from the same coal mine were divided into 150 training samples and 100 prediction samples. Principal component analysis (PCA) and Gauss radial basis kernel principal component analysis (GRB-KPCA) were employed to extract principal components (PCs) of continuum removed (CR) spectra of the training samples in eight selected wavelength regions which are related to the main mineral and organic compositions. Two support vector machine-(SVM-) based models PCA-SVM and GRB-KPCA-SVM were established. e results showed that the GRB-KPCA-SVM model had better identification accuracies of 94% and 92% for powder and nature block prediction samples, respectively.

show abstract

Visible and Near-Infrared Reflectance Spectroscopy for Investigating Soil Mineralogy: A Review

Cited by 121 publications

References 95 publications

The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data

The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data

Mineralogical and geochemical characterization of hydrocarbon microseepage-induced sediments in part of Assam-Arakan Fold Belt, Cachar area, NE India

Characterization and Identification of Coal and Carbonaceous Shale Using Visible and Near-Infrared Reflectance Spectroscopy

Contact Info

Product

Resources

About