Comparison between optimized MaxEnt and random forest modeling in predicting potential distribution: A case study with Quasipaa boulengeri in China.

Ziyi Zhao, Nengwen Xiao, Mei Shen, Junsheng Li
Author Information
  1. Ziyi Zhao: State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China; School of Ecology, Lanzhou University, Lanzhou 730000, China.
  2. Nengwen Xiao: State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China. Electronic address: xiaonw@craes.org.cn.
  3. Mei Shen: State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China.
  4. Junsheng Li: State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China.

Abstract

Random forest (RF) and MaxEnt models are shallow machine learning approaches that perform well in predicting species' potential distributions. RF models can produce robust results with the default automatic configuration in most cases, but it is necessary for MaxEnt to optimize the model settings to improve the performance, and the predictive performance difference between optimized MaxEnt and RF is uncertain. To explore this issue, the potential distribution of the endangered amphibian Quasipaa boulengeri in China was predicted using optimized MaxEnt and RF models. A total of 408 occurrence data were selected, 1000 locations were generated as pseudo-absence data by the geographic distance method, and 10,000 sites were selected as background data by creating a bias file. Partial ROC at different thresholds and success rate curves were used to compare the predictive performances between optimized MaxEnt and RF. Our results showed that the RF and optimized MaxEnt models both had good performance in predicting the potential distribution of Q. boulengeri, with the RF model performing slightly better whether based on partial ROC or success rate curves. Furthermore, the core suitable habitat regions of Q. boulengeri identified by RF and MaxEnt were similar and were all located in the Sichuan, Chongqing, Hubei, Hunan, and Guizhou provinces. However, the RF model produced a habitat suitability map with higher discrimination and greater heterogeneity. Temperature annual range, mean temperature of the driest quarter, and annual precipitation were the vital environmental variables limiting the distribution of Q. boulengeri. The RF model is the stronger machine learner. We believe it may be more applicable in predicting the native potential distributions of species with sufficient occurrence data, given the additional predictive detail, the simplicity of use, the computational time involved, and the operational complexity.

Keywords

MeSH Term

Animals
Anura
China
Ecosystem
Temperature

Word Cloud

Created with Highcharts 10.0.0RFMaxEntboulengerimodelspotentialmodeloptimizedpredictingdistributiondataforestperformancepredictiveQuasipaaQRandommachinedistributionsresultsChinaoccurrenceselectedROCsuccessratecurveshabitatannualshallowlearningapproachesperformwellspecies'canproducerobustdefaultautomaticconfigurationcasesnecessaryoptimizesettingsimprovedifferenceuncertainexploreissueendangeredamphibianpredictedusingtotal4081000locationsgeneratedpseudo-absencegeographicdistancemethod10000sitesbackgroundcreatingbiasfilePartialdifferentthresholdsusedcompareperformancesshowedgoodperformingslightlybetterwhetherbasedpartialFurthermorecoresuitableregionsidentifiedsimilarlocatedSichuanChongqingHubeiHunanGuizhouprovincesHoweverproducedsuitabilitymaphigherdiscriminationgreaterheterogeneityTemperaturerangemeantemperaturedriestquarterprecipitationvitalenvironmentalvariableslimitingstrongerlearnerbelievemayapplicablenativespeciessufficientgivenadditionaldetailsimplicityusecomputationaltimeinvolvedoperationalcomplexityComparisonrandommodelingdistribution:casestudyOptimizedOverfittingSpecies

Similar Articles

Cited By (25)