This study compares the application of two variable selection methods in partial least squares regression (PLSR), the variable importance in projection (VIP) method and the selectivity ratio (SR) method. For this purpose, three different data sets were analysed: (a) physiochemical water quality parameters related to sensorial data, (b) gas chromatography-mass spectrometry (GC-MS) chemical (organic compound) profiles from fossil sea sediment samples related to sea surface temperature (SST) changes, and (c) exposed genes of Daphnia magna female samples related to their total offspring production. Correlation coefficients (r), levels of significance (p-value) and interpretation of the underlying experimental phenomena allowed the discussion about the best approach for variable selection in each case. The comparison of the two variable selection methods in the first water quality data set showed that the SR method is more accurate for sensorial prediction. For the climate data set, when raw total ion current (TIC) GC-MS chromatograms were considered, variables selected using the VIP method were easier to interpret compared with those selected by the SR method. However, when only some chromatographic peak areas (concentrations) were considered, the SR method was more efficient for prediction, and the VIP method selected the most relevant variables for the interpretation of SST changes. Finally, for the transcriptomic data set, the SR method was found again to be more reliable for prediction purposes.
A new liquid chromatography mass spectrometry (LC-MS) metabolomics strategy coupled to chemometric evaluation, including variable and biomarker selection, has been assessed as a tool to discriminate between control and stressed Saccharomyces cerevisiae yeast samples. Metabolic changes occurring during yeast culture at different temperatures (30 and 42°C) were analysed and the complex data generated in profiling experiments were evaluated by different chemometric multivariate approaches. Multivariate curve resolution alternating least squares (MCR-ALS) was applied to full spectral scan LC-MS preprocessed data multisets arranged in augmented column-wise data matrices. The results showed that sectioning the MS-chromatograms in different windows and analysing them by MCR-ALS enabled the proper resolution of very complex coeluted chromatographic peaks. The investigation of possible relationships between MCR-ALS resolved chromatographic peak areas and culture temperature was then investigated by partial least squares discriminant analysis (PLS-DA). Selection of most relevant resolved chromatographic peaks associated to yeast culture temperature changes was achieved according to PLS-DA-Variable Importance in Projection scores. A metabolite identification workflow was developed utilizing MCR-ALS resolved pure MS spectra and high-resolution accurate mass measurements to confirm assigned structures based on entries in metabolite databases. A total of 65 metabolites were identified. A preliminary interpretation of these results indicates that the strategy described in this study can be proposed as a general tool to facilitate biomarker identification and modelling in similar untargeted metabolomic studies.
Copper containing fungicides are used to protect vineyards from fungal infections. Higher residues of copper in grapes at toxic concentrations are potentially toxic and affect the microorganisms living in vineyards, such as Saccharomyces cerevisiae. In this study, the response of the metabolic profiles of S. cerevisiae at different concentrations of copper sulphate (control, 1 mM, 3 mM and 6 mM) was analysed by liquid chromatography coupled to mass spectrometry (LC-MS) and multivariate curve resolution-alternating least squares (MCR-ALS) using an untargeted metabolomics approach. Peak areas of the MCR-ALS resolved elution profiles in control and in Cu(ii)-treated samples were compared using partial least squares regression (PLSR) and PLS-discriminant analysis (PLS-DA), and the intracellular metabolites best contributing to sample discrimination were selected and identified. Fourteen metabolites showed significant concentration changes upon Cu(ii) exposure, following a dose-response effect. The observed changes were consistent with the expected effects of Cu(ii) toxicity, including oxidative stress and DNA damage. This research confirmed that LC-MS based metabolomics coupled to chemometric methods are a powerful approach for discerning metabolomics changes in S. cerevisiae and for elucidating modes of toxicity of environmental stressors, including heavy metals like Cu(ii).
This study focuses on the extraction of climate signals and processes using a combined approach which includes the analysis of a high number of lipid molecules in marine sediments, and the chemometric analysis of the acquired data. Neutral and acidic fractions of marine sediments from site IODP-U1318 (south-west of the UK, Porcupine Seabight) were quantified by GC-MS. The alkenone unsaturation index, U(k')37, was estimated from the composition of C37 alkenones and it was then used for the estimation of sea surface temperatures (SST) for reference. Principal component analysis (PCA), explained 77.45% of the total data variance, and differentiated neutral fraction GC-MS total ion current (TIC) profiles according to SST values of the different sediment sections. GC-MS TIC chromatograms were correlated to sea surface temperatures (SST) by partial least squares regression (PLSR). The compounds more robustly in line with SST values at each sediment section explained 93% of the SST variance and they were identified using the variable importance in projection (VIP) scores method. The proposed approach enables an objective identification of organic compounds sensitive to SST variability throughout complete chromatographic profiles. As a result of this multivariate unbiased approach, lipid composition of sediments was differentiated between compounds of marine (long chain n-alkanes, long chain n-alkan-1-ols) and terrestrial (short chain n-alkan1-ols, alkenols, cholesterol, squalene) origin, whose concentrations were directly and inversely correlated to SST, respectively.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.