Development and assessment of a lysophospholipid-based deep learning model to discriminate geographical origins of white rice.

Nguyen Phuoc Long, Dong Kyu Lim, Changyeun Mo, Giyoung Kim, Sung Won Kwon
Author Information
  1. Nguyen Phuoc Long: Research Institute of Pharmaceutical Sciences and College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea.
  2. Dong Kyu Lim: Research Institute of Pharmaceutical Sciences and College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea. ORCID
  3. Changyeun Mo: National Institute of Agricultural Sciences, Rural Development Administration, Jeonju, 54875, Republic of Korea.
  4. Giyoung Kim: National Institute of Agricultural Sciences, Rural Development Administration, Jeonju, 54875, Republic of Korea.
  5. Sung Won Kwon: Research Institute of Pharmaceutical Sciences and College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea. swkwon@snu.ac.kr.

Abstract

Geographical origin determination of white rice has become the major issue of food industry. However, there is still lack of a high-throughput method for rapidly and reproducibly differentiating the geographical origins of commercial white rice. In this study, we developed a method that employed lipidomics and deep learning to discriminate white rice from Korea to China. A total of 126 white rice of 30 cultivars from different regions were utilized for the method development and validation. By using direct infusion-mass spectrometry-based targeted lipidomics, 17 lysoglycerophospholipids were simultaneously characterized within minutes per sample. Unsupervised data exploration showed a noticeable overlap of white rice between two countries. In addition, lysophosphatidylcholines (lysoPCs) were prominent in white rice from Korea while lysophosphatidylethanolamines (lysoPEs) were enriched in white rice from China. A deep learning prediction model was built using 2014 white rice and validated using two different batches of 2015 white rice. The model accurately discriminated white rice from two countries. Among 10 selected predictors, lysoPC(18:2), lysoPC(14:0), and lysoPE(16:0) were the three most important features. Random forest and gradient boosting machine models also worked well in this circumstance. In conclusion, this study provides an architecture for high-throughput classification of white rice from different geographical origins.

References

  1. Trends Biotechnol. 2004 May;22(5):245-52 [PMID: 15109811]
  2. Front Plant Sci. 2016 Sep 22;7:1419 [PMID: 27713752]
  3. J Environ Sci (China). 2013 Jan 1;25(1):144-54 [PMID: 23586309]
  4. J Agric Food Chem. 2014 Jul 16;62(28):6600-7 [PMID: 24953871]
  5. Cell Mol Life Sci. 2012 Oct;69(19):3225-43 [PMID: 22885821]
  6. Anal Chim Acta. 2015 Jul 23;885:1-16 [PMID: 26231889]
  7. Metabolomics. 2017;13(2):12 [PMID: 28090198]
  8. J Assoc Physicians India. 2009 Feb;57:127-33 [PMID: 19582980]
  9. Mol Syst Biol. 2016 Jul 29;12(7):878 [PMID: 27474269]
  10. Anal Chim Acta. 2015 Jun 16;879:10-23 [PMID: 26002472]
  11. Food Chem. 2015 Oct 1;184:154-9 [PMID: 25872438]
  12. Health Psychol Behav Med. 2014 Jan 1;2(1):723-734 [PMID: 25750814]
  13. Plant Signal Behav. 2011 Nov;6(11):1720-31 [PMID: 22041989]
  14. Neural Netw. 2015 Jan;61:85-117 [PMID: 25462637]
  15. Nucleic Acids Res. 2009 Jul;37(Web Server issue):W652-60 [PMID: 19429898]
  16. Food Chem. 2013 Dec 15;141(4):3504-9 [PMID: 23993513]
  17. Curr Protoc Bioinformatics. 2016 Sep 07;55:14.10.1-14.10.91 [PMID: 27603023]
  18. Brief Bioinform. 2017 Sep 1;18(5):851-869 [PMID: 27473064]
  19. J Agric Food Chem. 2010 Mar 24;58(6):3801-7 [PMID: 20170108]
  20. Food Chem. 2008 Jul 15;109(2):470-5 [PMID: 26003374]
  21. Mol Pharm. 2016 Jul 5;13(7):2524-30 [PMID: 27200455]
  22. J Pharm Biomed Anal. 2016 May 30;124:120-128 [PMID: 26942336]
  23. Crit Rev Food Sci Nutr. 2008 Jun;48(6):553-98 [PMID: 18568860]
  24. J Agric Food Chem. 2011 Oct 12;59(19):10435-41 [PMID: 21916514]
  25. Ecology. 2007 Nov;88(11):2783-92 [PMID: 18051647]
  26. J Agric Food Chem. 2017 Mar 29;65(12):2628-2634 [PMID: 28245645]
  27. Food Chem. 2013 Aug 15;139(1-4):1133-45 [PMID: 23561219]

MeSH Term

China
Deep Learning
Geography
Lysophosphatidylcholines
Lysophospholipids
Mass Spectrometry
Oryza
Reproducibility of Results
Republic of Korea
Species Specificity

Chemicals

Lysophosphatidylcholines
Lysophospholipids
lysophosphatidylethanolamine
We 201

Word Cloud

Created with Highcharts 10.0.0whitericemethodgeographicaloriginsdeeplearningdifferentusingtwomodelhigh-throughputstudylipidomicsdiscriminateKoreaChinacountrieslysoPCGeographicalorigindeterminationbecomemajorissuefoodindustryHoweverstilllackof arapidlyreproduciblydifferentiatingcommercialdevelopedemployedtotal12630cultivarsregionsutilizeddevelopmentvalidationdirectinfusion-massspectrometry-basedtargeted17lysoglycerophospholipidssimultaneouslycharacterizedwithinminutespersampleUnsuperviseddataexplorationshowednoticeableoverlapadditionlysophosphatidylcholineslysoPCsprominentlysophosphatidylethanolamineslysoPEsenrichedpredictionbuilt2014validatedbatches2015accuratelydiscriminatedAmong10selectedpredictors18:214:0lysoPE16:0threeimportantfeaturesRandomforestgradientboostingmachinemodelsalsoworkedwellcircumstanceconclusionprovidesarchitectureclassificationDevelopmentassessmentlysophospholipid-based

Similar Articles

Cited By