Assessment of Soybean Lodging Using UAV Imagery and Machine Learning.

Shagor Sarkar, Jing Zhou, Andrew Scaboo, Jianfeng Zhou, Noel Aloysius, Teng Teeh Lim
Author Information
  1. Shagor Sarkar: Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
  2. Jing Zhou: Department of Biological Systems Engineering, University of Wisconsin-Madison, Madison, WI 53705, USA. ORCID
  3. Andrew Scaboo: Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
  4. Jianfeng Zhou: Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA. ORCID
  5. Noel Aloysius: Department of Chemical & Biomedical Engineering, University of Missouri, Columbia, MO 65211, USA.
  6. Teng Teeh Lim: Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.

Abstract

Plant lodging is one of the most essential phenotypes for soybean breeding programs. Soybean lodging is conventionally evaluated visually by breeders, which is time-consuming and subject to human errors. This study aimed to investigate the potential of unmanned aerial vehicle (UAV)-based imagery and machine learning in assessing the lodging conditions of soybean breeding lines. A UAV imaging system equipped with an RGB (red-green-blue) camera was used to collect the imagery data of 1266 four-row plots in a soybean breeding field at the reproductive stage. Soybean lodging scores were visually assessed by experienced breeders, and the scores were grouped into four classes, i.e., non-lodging, moderate lodging, high lodging, and severe lodging. UAV images were stitched to build orthomosaics, and soybean plots were segmented using a grid method. Twelve image features were extracted from the collected images to assess the lodging scores of each breeding line. Four models, i.e., extreme gradient boosting (XGBoost), random forest (RF), K-nearest neighbor (KNN) and artificial neural network (ANN), were evaluated to classify soybean lodging classes. Five data preprocessing methods were used to treat the imbalanced dataset to improve classification accuracy. Results indicate that the preprocessing method SMOTE-ENN consistently performs well for all four (XGBoost, RF, KNN, and ANN) classifiers, achieving the highest overall accuracy (OA), lowest misclassification, higher F1-score, and higher Kappa coefficient. This suggests that Synthetic Minority Oversampling-Edited Nearest Neighbor (SMOTE-ENN) may be a good preprocessing method for using unbalanced datasets and the classification task. Furthermore, an overall accuracy of 96% was obtained using the SMOTE-ENN dataset and ANN classifier. The study indicated that an imagery-based classification model could be implemented in a breeding program to differentiate soybean lodging phenotype and classify lodging scores effectively.

Keywords

References

  1. Front Med (Lausanne). 2022 Mar 07;9:730748 [PMID: 35321465]
  2. Plant Methods. 2019 Aug 21;15:98 [PMID: 31452674]
  3. PeerJ Comput Sci. 2021 May 19;7:e536 [PMID: 34141878]
  4. Plant Phenomics. 2019 Dec 31;2019:5704154 [PMID: 33313529]
  5. Front Plant Sci. 2022 Aug 04;13:957870 [PMID: 35991436]
  6. Front Plant Sci. 2017 Jun 30;8:1111 [PMID: 28713402]
  7. J Stat Softw. 2018;85(11):1-20 [PMID: 30505247]
  8. Front Genet. 2020 Oct 02;11:820 [PMID: 33133122]
  9. Scientifica (Cairo). 2021 Apr 20;2021:8810279 [PMID: 33968461]
  10. Open Biol. 2022 Jun;12(6):210353 [PMID: 35728624]
  11. Sci Rep. 2016 Aug 24;6:31890 [PMID: 27552909]
  12. Photogramm Eng Remote Sensing. 2010 Oct;76(10):1159-1168 [PMID: 21643433]
  13. IEEE Trans Image Process. 2001;10(2):266-77 [PMID: 18249617]
  14. Comput Intell Neurosci. 2022 Apr 30;2022:8735201 [PMID: 35535180]
  15. Sci Rep. 2022 Apr 15;12(1):6256 [PMID: 35428863]
  16. PLoS One. 2014 Nov 11;9(11):e112894 [PMID: 25386696]

Word Cloud

Created with Highcharts 10.0.0lodgingsoybeanbreedingUAVscoresSoybeanusingmethodANNpreprocessingclassificationaccuracySMOTE-ENNevaluatedvisuallybreedersstudyimageryuseddataplotsfourclassesieimagesimagefeaturesXGBoostRFKNNclassifydatasetoverallhigherPlantoneessentialphenotypesprogramsconventionallytime-consumingsubjecthumanerrorsaimedinvestigatepotentialunmannedaerialvehicle-basedmachinelearningassessingconditionslinesimagingsystemequippedRGBred-green-bluecameracollect1266four-rowfieldreproductivestageassessedexperiencedgroupednon-lodgingmoderatehighseverestitchedbuildorthomosaicssegmentedgridTwelveextractedcollectedassesslineFourmodelsextremegradientboostingrandomforestK-nearestneighborartificialneuralnetworkFivemethodstreatimbalancedimproveResultsindicateconsistentlyperformswellclassifiersachievinghighestOAlowestmisclassificationF1-scoreKappacoefficientsuggestsSyntheticMinorityOversampling-EditedNearestNeighbormaygoodunbalanceddatasetstaskFurthermore96%obtainedclassifierindicatedimagery-basedmodelimplementedprogramdifferentiatephenotypeeffectivelyAssessmentLodgingUsingImageryMachineLearningcrophigh-throughputphenotypingremotesensing

Similar Articles

Cited By