Robust Inference from Conditional Logistic Regression Applied to Movement and Habitat Selection Analysis.

Marie-Caroline Prima, Thierry Duchesne, Daniel Fortin
Author Information
  1. Marie-Caroline Prima: Département de Biologie, Université Laval, Québec, Québec, Canada. ORCID
  2. Thierry Duchesne: Département de mathématiques et de statistique, Université Laval, Québec, Québec, Canada.
  3. Daniel Fortin: Département de Biologie, Université Laval, Québec, Québec, Canada.

Abstract

Conditional logistic regression (CLR) is widely used to analyze habitat selection and movement of animals when resource availability changes over space and time. Observations used for these analyses are typically autocorrelated, which biases model-based variance estimation of CLR parameters. This bias can be corrected using generalized estimating equations (GEE), an approach that requires partitioning the data into independent clusters. Here we establish the link between clustering rules in GEE and their effectiveness to remove statistical biases in variance estimation of CLR parameters. The current lack of guidelines is such that broad variation in clustering rules can be found among studies (e.g., 14-450 clusters) with unknown consequences on the robustness of statistical inference. We simulated datasets reflecting conditions typical of field studies. Longitudinal data were generated based on several parameters of habitat selection with varying strength of autocorrelation and some individuals having more observations than others. We then evaluated how changing the number of clusters impacted the effectiveness of variance estimators. Simulations revealed that 30 clusters were sufficient to get unbiased and relatively precise estimates of variance of parameter estimates. The use of destructive sampling to increase the number of independent clusters was successful at removing statistical bias, but only when observations were temporally autocorrelated and the strength of inter-individual heterogeneity was weak. GEE also provided robust estimates of variance for different magnitudes of unbalanced datasets. Our simulations demonstrate that GEE should be estimated by assigning each individual to a cluster when at least 30 animals are followed, or by using destructive sampling for studies with fewer individuals having intermediate level of behavioural plasticity in selection and temporally autocorrelated observations. The simulations provide valuable information to build reliable habitat selection and movement models that allow for robustness of statistical inference without removing excessive amounts of ecological information.

References

  1. Ecology. 2012 Nov;93(11):2336-42 [PMID: 23236905]
  2. Behav Ecol. 2014 Nov;25(6):1353-1364 [PMID: 25419085]
  3. Mov Ecol. 2014 Feb 07;2(1):4 [PMID: 25520815]
  4. Proc Biol Sci. 2012 Nov 7;279(1746):4481-8 [PMID: 22951736]
  5. J Anim Ecol. 2010 May;79(3):548-55 [PMID: 20202010]
  6. Ecology. 2009 Dec;90(12):3554-65 [PMID: 20120822]
  7. Philos Trans R Soc Lond B Biol Sci. 2010 Jul 27;365(1550):2157-62 [PMID: 20566493]
  8. PLoS One. 2015 Apr 21;10(4):e0122947 [PMID: 25898019]
  9. Biom J. 2008 Feb;50(1):97-109 [PMID: 17849385]
  10. Ecology. 2011 Jan;92(1):240-52 [PMID: 21560694]
  11. Philos Trans R Soc Lond B Biol Sci. 2010 Jul 27;365(1550):2233-44 [PMID: 20566500]
  12. J Anim Ecol. 2014 Jan;83(1):185-98 [PMID: 23859231]

MeSH Term

Animal Migration
Animals
Ecosystem
Logistic Models
Models, Theoretical

Word Cloud

Created with Highcharts 10.0.0varianceclustersselectionGEEstatisticalCLRhabitatautocorrelatedparametersstudiesobservationsestimatesConditionalusedmovementanimalsbiasesestimationbiascanusingdataindependentclusteringruleseffectivenessrobustnessinferencedatasetsstrengthindividualsnumber30destructivesamplingremovingtemporallysimulationsinformationlogisticregressionwidelyanalyzeresourceavailabilitychangesspacetimeObservationsanalysestypicallymodel-basedcorrectedgeneralizedestimatingequationsapproachrequirespartitioningestablishlinkremovecurrentlackguidelinesbroadvariationfoundamongeg14-450unknownconsequencessimulatedreflectingconditionstypicalfieldLongitudinalgeneratedbasedseveralvaryingautocorrelationothersevaluatedchangingimpactedestimatorsSimulationsrevealedsufficientgetunbiasedrelativelypreciseparameteruseincreasesuccessfulinter-individualheterogeneityweakalsoprovidedrobustdifferentmagnitudesunbalanceddemonstrateestimatedassigningindividualclusterleastfollowedfewerintermediatelevelbehaviouralplasticityprovidevaluablebuildreliablemodelsallowwithoutexcessiveamountsecologicalRobustInferenceLogisticRegressionAppliedMovementHabitatSelectionAnalysis

Similar Articles

Cited By