Jing Huang: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China.
Xuenan Zhang: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China.
Hang Yang: School of Intelligent Systems Science and Engineering, Jinan University, Zhuhai 519070, China.
Zhenbiao Li: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China.
Zhengfang Xue: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China.
Qingqing Wang: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China. ORCID
Xinyuan Zhang: School of Intelligent Systems Science and Engineering, Jinan University, Zhuhai 519070, China.
Shenghua Ding: Hunan Agricultural Product Processing Institute, Hunan Academy of Agricultural Sciences, Changsha 410125, China.
Zisheng Luo: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China.
Yanqun Xu: College of Biosystems Engineering and Food Science, Key Laboratory of Agro-Products Postharvest Handling of Ministry of Agriculture and Rural Affairs, Zhejiang University, Hangzhou 310058, China. ORCID
Volatile organic compounds (VOCs) are closely associated with the maturity and variety of strawberries. However, the complexity of VOCs hinders their potential application in strawberry classification. This study developed a novel classification workflow using strawberry VOC profiles and machine learning (ML) models for precise fruit classification. A comprehensive VOC dataset was rapidly collected using gas chromatography-ion mobility spectrometry (GC-IMS) from five strawberry varieties at four maturity stages (n = 300) and visualized through principal component analysis (PCA). Five ML models were developed, including partial least squares discriminant analysis (PLS-DA), decision trees, support vector machines (SVM), Xgboost and neural networks (NN). The accuracy of all models ranged from 90.00% to 98.33%, with the NN model demonstrating the best performance. Specifically, it achieved 96.67% accuracy for single-maturity classification, 98.33% for single-variety classification, and 96.67% for dual maturity and variety classification, along with 98.09% precision, 97.92% recall, and 97.91% F1 score. Feature importance analysis indicated that the NN model exhibited the most balanced reliance on various VOCs, contributing to its optimal performance with the broad-spectrum VOC detection method, GC-IMS. Overall, these findings underscore the potential of NN modeling for accurate and efficient fruit classification based on integrated VOC profiles.