Description |
The obtained GC/MS raw data in. D format were transferred to .abf format via software Analysis Base File Converter for quick retrieval of data. Then, data were imported into software MS-DIAL, which performs peak detection, peak identification, MS2Dec deconvolution, characterization, peak alignment, wave filtering, and missing value interpolation. Metabolite characterization was based on LUG database.A data matrix was derived. The three-dimensional matrix includes: sample information, the name of the peak of each substance, retention time, retention index, mass-to-charge ratio, and signal intensity. In each sample, all peak signal intensities were segmented and normalized according to the internal standards with RSD greater than 0.1 after screening.After the data was normalized, redundancy removal and peak merging were conducted to obtain the data matrix.The matrix was imported in R to carry out Principle Component Analysis (PCA) to observe the overall distribution among the samples and the stability of the whole analysis process. Orthogonal Partial Least-Squares-Discriminant Analysis (OPLS-DA) and Partial Least-Squares-Discriminant Analysis (PLS-DA) were utilized to distinguish the metabolites that differ between groups.To prevent overfitting, 7-fold cross-validation and 200 Response Permutation Testing (RPT) were used to evaluate the quality of the model. Variable Importance of Projection (VIP) values obtained from the OPLS-DA model were used to rank the overall contribution of each variable to group discrimination. |